Skip to content

Wire verifier preview to emit hunt verdicts (pass/partial/fail → hunted/wounded/escaped) #2093

@Hmbown

Description

@Hmbown

Wire the verifier preview to emit hunt verdicts. Composes the v0.8.46 verifier-preview scope with the verdict vocabulary that ships in v0.8.45 (#2092).

Concept

The v0.8.46 milestone already scopes a verifier preview that auto-spawns a fresh-context, read-only, time-boxed sub-agent on claim-of-done and returns pass / partial / fail. After #2092, codewhale has a shared verdict vocabulary (hunting / hunted / wounded / escaped) and the data structure to hold it. This issue maps verifier verdicts onto hunt verdicts and routes them through the same code path.

Mapping

Verifier verdict Hunt verdict Action
pass hunted Trophy card written. Session may auto-close per config.
partial wounded (with verifier reason text) No trophy. Verdict logged. Hunt remains open; runtime suggests follow-ups.
fail escaped (with verifier reason text) No trophy. Verdict logged. Hunt closed.

/force-complete (already in verifier-preview scope) becomes /declare-hunted — overrides the verifier and writes the trophy. Audit-logged.

In scope (v0.8.46 increment)

Out of scope (defer to v0.9.0 / #2094)

  • Three-jurisdiction policy (strict / evidentiary / permissive).
  • Judge as a separately-configurable LLM with its own model id and prompt.
  • Trajectory-aware judging (judge sees full session, not just claim-of-done snapshot).

Acceptance

  • A session with [verifier] enabled = true finishing a hunt produces a verdict; the verdict is the hunt verdict.
  • pass → trophy card on disk. partial → wounded log. fail → escaped log. All three observed in tests.
  • /declare-hunted overrides and audit-logs.
  • Verdict glyph renders in /tasks.
  • Existing verifier-preview tests pass unchanged or are updated alongside the verdict-mapping commit.

Closes / partially closes

Composes with the verifier-preview surface already scoped in the v0.8.46 milestone description.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingenhancementNew feature or requestrustPull requests that update rust code

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions