Harness Kit

Governance

Controls first, scale second.

Operational agent work needs evals, traces, approvals, least-privilege tools, rollback, and escalation paths before it earns trust.

Evals

Define what good looks like before a model run becomes production behavior.

Traces

Record provider attempts, failures, lead verdicts, and accepted synthesis without exposing raw transcripts.

Approvals

Put humans at the points where policy, money, reputation, or irreversible actions enter the workflow.

Least privilege

Keep tool permissions explicit and narrow enough for the workflow's actual needs.

Rollback

Know which commit, config, or harness change to revert when behavior regresses.

Escalation

Name the condition that moves work from agent automation back to human review.

What to verify

Sources: .harness-kit/agents.yaml, docs/positioning.md