From a9fb64c209b98174bfdd0e2ee44a16af151e1caf Mon Sep 17 00:00:00 2001
From: Jonathan Jackson <jjackson@dimagi.com>
Date: Fri, 22 May 2026 02:56:42 -0600
Subject: [PATCH] docs: cleanup historical artifacts (~17k lines)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Audit pass over docs/ found a lot of accumulated detritus that no
code, skill, or agent references. Removing the unreferenced
historical artifacts and fixing the broken cross-references that
result.

Deletions (~17,300 lines)
- docs/superpowers/plans/ — wholesale (11 files, ~13k lines). These
  were post-shipment implementation plans, zero references from any
  skill/agent/CLAUDE.md. The PR diff is the truth, not the plan
  checkboxes.
- docs/superpowers/specs/ — 8 unreferenced design docs (~2,100 lines):
  ace-solicitations-phase-design, app-multimedia-coverage-design,
  skills-audit-findings, mobile-cloud-runner-poc + api-gaps,
  ace-sweep-atom-contracts + design, work-order-skill-design.
  Kept the 5 specs that ARE referenced from skills/CLAUDE.md
  (shallow-deep-qa-split, decisions-log, qa-eval-migration,
  state-consolidation, focus-group-archetype-redefinition).
- docs/generated/playbook.md — 16-day-stale derived artifact
  (claimed "8-phase orchestration"; pipeline has been 10-phase for
  weeks). Regenerated by `/ace:docs` when next needed.

Broken cross-references fixed
- skills/README.md — two refs to deleted 2026-04-01-ace-design.md
  replaced with pointers to CLAUDE.md + agents/orchestrator-reference.md
  and (for the dry-run paragraph) absorbed inline.
- skills/upload-transcript/SKILL.md — ref to deleted
  2026-05-02-ace-run-multi-run-revival-design.md dropped; the
  sentence stands on its own.
- README.md — Documentation section refreshed; broken refs to
  ace-design.md and ace-web-harness-design.md replaced with
  pointers to CLAUDE.md, agents/orchestrator-reference.md, the
  integrations playbook, and the ace-web sibling repo.

Followups not in this PR (called out in the analysis but deferred):
- SKILL.md ## Change Log compression (every skill carries 5-13
  historical entries that bloat its context every time it dispatches).
  Bigger code change; deserves its own PR.
- .claude/pm/runs/ early-April compaction. Light touch; folds into
  one durable learning when someone gets to it.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .claude-plugin/marketplace.json               |    4 +-
 .claude-plugin/plugin.json                    |    2 +-
 README.md                                     |   13 +-
 VERSION                                       |    2 +-
 docs/generated/playbook.md                    |  564 ----
 .../2026-05-04-ace-solicitations-phase.md     | 2425 -----------------
 .../plans/2026-05-04-shallow-deep-qa-split.md | 1202 --------
 .../2026-05-05-app-multimedia-coverage.md     | 2105 --------------
 .../plans/2026-05-08-decisions-log-pr1.md     | 1124 --------
 .../plans/2026-05-08-decisions-log-pr2.md     | 1059 -------
 .../plans/2026-05-08-decisions-log-pr3.md     | 1201 --------
 .../plans/2026-05-08-decisions-log-pr4.md     |  294 --
 .../plans/2026-05-08-decisions-log-pr5.md     |  133 -
 ...026-05-10-orchestrator-structural-split.md |  818 ------
 ...5-15-ace-sweep-pr1-foundation-and-drive.md | 1148 --------
 .../plans/2026-05-21-work-order-skill.md      | 1454 ----------
 ...26-05-04-ace-solicitations-phase-design.md |  378 ---
 ...26-05-05-app-multimedia-coverage-design.md |  Bin 21583 -> 0 bytes
 .../specs/2026-05-06-skills-audit-findings.md |  316 ---
 .../2026-05-09-mobile-cloud-runner-poc.md     |  180 --
 ...2026-05-11-mobile-cloud-runner-api-gaps.md |  141 -
 .../2026-05-15-ace-sweep-atom-contracts.md    |  186 --
 .../specs/2026-05-15-ace-sweep-design.md      |  171 --
 .../2026-05-21-work-order-skill-design.md     |  233 --
 package.json                                  |    2 +-
 skills/README.md                              |    4 +-
 skills/upload-transcript/SKILL.md             |    4 +-
 27 files changed, 16 insertions(+), 15147 deletions(-)
 delete mode 100644 docs/generated/playbook.md
 delete mode 100644 docs/superpowers/plans/2026-05-04-ace-solicitations-phase.md
 delete mode 100644 docs/superpowers/plans/2026-05-04-shallow-deep-qa-split.md
 delete mode 100644 docs/superpowers/plans/2026-05-05-app-multimedia-coverage.md
 delete mode 100644 docs/superpowers/plans/2026-05-08-decisions-log-pr1.md
 delete mode 100644 docs/superpowers/plans/2026-05-08-decisions-log-pr2.md
 delete mode 100644 docs/superpowers/plans/2026-05-08-decisions-log-pr3.md
 delete mode 100644 docs/superpowers/plans/2026-05-08-decisions-log-pr4.md
 delete mode 100644 docs/superpowers/plans/2026-05-08-decisions-log-pr5.md
 delete mode 100644 docs/superpowers/plans/2026-05-10-orchestrator-structural-split.md
 delete mode 100644 docs/superpowers/plans/2026-05-15-ace-sweep-pr1-foundation-and-drive.md
 delete mode 100644 docs/superpowers/plans/2026-05-21-work-order-skill.md
 delete mode 100644 docs/superpowers/specs/2026-05-04-ace-solicitations-phase-design.md
 delete mode 100644 docs/superpowers/specs/2026-05-05-app-multimedia-coverage-design.md
 delete mode 100644 docs/superpowers/specs/2026-05-06-skills-audit-findings.md
 delete mode 100644 docs/superpowers/specs/2026-05-09-mobile-cloud-runner-poc.md
 delete mode 100644 docs/superpowers/specs/2026-05-11-mobile-cloud-runner-api-gaps.md
 delete mode 100644 docs/superpowers/specs/2026-05-15-ace-sweep-atom-contracts.md
 delete mode 100644 docs/superpowers/specs/2026-05-15-ace-sweep-design.md
 delete mode 100644 docs/superpowers/specs/2026-05-21-work-order-skill-design.md

diff --git a/.claude-plugin/marketplace.json b/.claude-plugin/marketplace.json
index 6987c3d3..447d125c 100644
--- a/.claude-plugin/marketplace.json
+++ b/.claude-plugin/marketplace.json
@@ -6,13 +6,13 @@
     "url": "https://github.com/jjackson"
   },
   "metadata": {
-    "version": "0.13.330"
+    "version": "0.13.331"
   },
   "plugins": [
     {
       "name": "ace",
       "source": "./",
-      "version": "0.13.330",
+      "version": "0.13.331",
       "description": "AI Connect Engine — orchestrates the CRISPR-Connect lifecycle from idea through app building, Connect setup, LLO management, and closeout"
     }
   ]
diff --git a/.claude-plugin/plugin.json b/.claude-plugin/plugin.json
index 13ae57c7..d519303a 100644
--- a/.claude-plugin/plugin.json
+++ b/.claude-plugin/plugin.json
@@ -1,6 +1,6 @@
 {
   "name": "ace",
-  "version": "0.13.330",
+  "version": "0.13.331",
   "description": "AI Connect Engine — orchestrates the CRISPR-Connect lifecycle from idea through app building, Connect setup, LLO management, and closeout",
   "author": {
     "name": "Jonathan Jackson",
diff --git a/README.md b/README.md
index ef567e41..9ac9973c 100644
--- a/README.md
+++ b/README.md
@@ -217,11 +217,14 @@ delegates app-build to `/nova:autobuild`. See
 
 ## Documentation
 
-- [Design Spec](docs/superpowers/specs/2026-04-01-ace-design.md) — full architecture and rationale
-- [Generated Playbook](docs/generated/playbook.md) — human-readable process flow (generated from agent/skill definitions)
-- [Integration Specs](playbook/integrations/) — what APIs exist vs. need to be built
-- [ACE Web Harness Design](docs/superpowers/specs/2026-04-07-ace-web-harness-design.md) — cross-cutting architecture spec for the browser-based ACE frontend
-- [PDD Stress-Test Observations](docs/examples/pdd-stress-test-observations.md) — how to validate an PDD and verify LLO execution, with two sample PDDs worked through end-to-end
+- [CLAUDE.md](CLAUDE.md) — agent guide, phase pipeline, conventions, gotchas
+- [agents/orchestrator-reference.md](agents/orchestrator-reference.md) — state schemas, phase write-back contract, pause points, fork points
+- [Integration Specs](playbook/integrations/) — per-MCP integration reference and durable gotcha records (OCS, Nova, Connect, CommCare, labs, mobile, slides)
+- [Generated Playbook](docs/generated/playbook.md) — derived process flow regenerated by `/ace:docs` (run the command to (re)create it)
+- [Design Specs](docs/superpowers/specs/) — date-stamped design docs for in-flight or recently-shipped work
+- [Durable Learnings](docs/learnings/) — cross-session lessons (Nova bugs, demo-user mechanics, Phase 6 validation arc, etc.)
+- [PDD Stress-Test Observations](docs/examples/pdd-stress-test-observations.md) — how to validate a PDD and verify LLO execution, with two sample PDDs worked through end-to-end
+- **`ace-web`** sibling repo — design spec for the browser-based ACE frontend lives in that repo, not here
 
 ## Related projects
 
diff --git a/VERSION b/VERSION
index 8bf173bc..7a026946 100644
--- a/VERSION
+++ b/VERSION
@@ -1 +1 @@
-0.13.330
+0.13.331
diff --git a/docs/generated/playbook.md b/docs/generated/playbook.md
deleted file mode 100644
index a6142596..00000000
--- a/docs/generated/playbook.md
+++ /dev/null
@@ -1,564 +0,0 @@
-# ACE Playbook — CRISPR-Connect Process
-
-_Generated: 2026-05-06 (ACE 0.13.43 — 8-phase orchestration)_
-
-Derived from `agents/*.md`, `skills/*/SKILL.md`, and
-`playbook/integrations/*.md`. Regenerate with `/ace:docs` after changing any
-of those sources.
-
-## Overview
-
-ACE (AI Connect Engine) orchestrates the full CRISPR-Connect lifecycle for
-a Connect opportunity. The `ace-orchestrator` agent dispatches to **eight
-phase agents** in order:
-
-1. **design-review** — iterate idea → approved PDD + opp-specific test prompts
-2. **commcare-setup** — Nova-build Learn + Deliver apps, deploy + release
-3. **connect-setup** — create Program + Opportunity in Connect
-4. **ocs-setup** — clone OCS chatbot template, attach RAG collection, smoke-test
-5. **qa-and-training** — capture screenshots, generate per-artifact training docs
-6. **solicitation-management** — publish solicitation, invite candidate LLOs
-7. **execution-manager** — onboard awarded LLO, UAT, go-live, recurring monitor
-8. **closeout** — invoices, feedback, learnings, cycle grade
-
-**Phases 1–5 run end-to-end with zero LLO involvement.** Phase 6 publishes a
-public solicitation listing (no targeted contact unless the PDD names
-preferred LLOs). Phase 7 is the first 1-1 LLO contact and starts only when
-`opp.yaml.selected_llo.org_slug` is populated by the manual `solicitation-review`
-skill that runs between Phase 6 and Phase 7.
-
-### Execution modes
-
-- **Auto (default)** — run all phases sequentially, log gates but don't
-  enforce them.
-- **Review** — pause at gate steps and use `AskUserQuestion` to get operator
-  approval before continuing. Gate steps:
-  - After `idea-to-pdd` (Phase 1) — PDD approval
-  - After `app-deploy` (Phase 2) — apps verified before Connect setup
-  - After `ocs-chatbot-eval --deep` (Phase 4) — OCS quality clears the
-    pre-launch bar
-  - After `solicitation-review` (manual, between Phase 6 and 7) — awardee
-    explicitly approved
-  - After `llo-launch` (Phase 7) — opportunity activation verified
-
-### Agent topology
-
-ACE has one architectural rule: **anything that calls `Agent` must run at
-level 0.** The orchestrator and the Phase 2 `commcare-setup` are procedure
-docs the top-level session reads inline; the other seven agents are
-subagents dispatched via `Agent(...)` from level 0. There are never two
-levels of `Agent` dispatch. See `CLAUDE.md § Agent topology` for the full
-rule and history.
-
-## Process Flow
-
-| Phase | Agent | Skills | Gate |
-|---|---|---|---|
-| 1 | design-review | idea-to-pdd → pdd-to-test-prompts + pdd-to-app-journeys | PDD approved (review mode) |
-| 2 | commcare-setup | pdd-to-learn-app + pdd-to-deliver-app → app-connect-coverage → app-test-cases → app-deploy → app-release | Apps deployed + released |
-| 3 | connect-setup | connect-program-setup → connect-opp-setup | — |
-| 4 | ocs-setup | ocs-agent-setup → ocs-chatbot-qa --quick → ocs-chatbot-eval --quick | OCS quality (deep gate at end) |
-| 5 | qa-and-training | app-screenshot-capture → training-* (6 per-artifact) → training-deck-build | — |
-| 6 | solicitation-management | solicitation-create → llo-invite → solicitation-monitor (recurring) | **HALT** — manual `solicitation-review` populates `selected_llo` |
-| 7 | execution-manager | llo-onboarding → llo-uat → llo-launch → timeline-monitor + flw-data-review + ocs-chatbot-qa --monitor (recurring) | UAT sign-off + launch verified |
-| 8 | closeout | opp-closeout → llo-feedback → learnings-summary → cycle-grade | — |
-
-Standalone skills (not part of the default `/ace:run`):
-- `app-multimedia-coverage` — manual post-Phase-2, attaches display images
-- `connect-baseline-screenshots` — cross-opp Connect-walkthrough capture
-- `ocs-tester` (agent) — ad-hoc OCS quality probe
-- `email-communicator` — utility skill, called by other skills
-- `upload-transcript` — uploads CLI stream-json to ace-web
-
-In-flow skills with a removal trigger:
-- `commcare-form-patch` — Phase 2 Step 2.8 workaround for
-  voidcraft-labs/nova-plugin#7 (Nova emits `<module>`/`<assessment>`
-  wrappers in Learn-app form XML that break the AVD's CommCare
-  runtime). Idempotent + no-op when zero wrappers found. Whole skill
-  + the backing `commcare_patch_xform` MCP atom self-delete the day
-  nova-plugin#7 ships and a clean `/ace:run` produces a wrapper-free
-  Learn CCZ.
-
----
-
-## Phase 1: Design Review
-
-**Agent:** `design-review`
-
-> Phase 1 of the CRISPR-Connect lifecycle: iterate an initial idea into an
-> approved Program Design Document (PDD) and derive opp-specific test
-> prompts for later OCS chatbot evaluation.
-
-### Skills
-
-#### idea-to-pdd
-Develop a Program Design Doc (PDD) for a Connect intervention from source
-material. Iterates a 5-question stress-test rubric until approved.
-
-#### idea-to-pdd-eval
-Independently grade a PDD against the source idea pack — re-runs the
-stress test from outside and cross-checks reviewer-comment fidelity.
-
-#### pdd-to-test-prompts
-Derive opp-specific Q&A test prompts from an approved PDD. Produces the
-ground-truth suite for the Phase 4 OCS chatbot deep gate.
-
-#### pdd-to-app-journeys
-Derive opp-specific expected user journeys from an approved PDD. Produces
-the UX-intent ground truth consumed by app-test-cases and app-ux-eval.
-
----
-
-## Phase 2: CommCare Setup
-
-**Agent:** `commcare-setup`
-
-> Phase 2 of the CRISPR-Connect lifecycle: translate the approved PDD into
-> Learn and Deliver apps via Nova, deploy them to CommCare HQ, and test.
-
-### Skills
-
-#### pdd-to-learn-app
-Build the CommCare Learn (training) app from the PDD via Nova's
-/nova:autobuild. Captures nova_app_id and writes a structure summary.
-
-#### pdd-to-learn-app-eval
-Grade a Nova-built Learn app against the PDD that specified it — module
-count, order, Assessment Score wiring, content coverage.
-
-#### pdd-to-deliver-app
-Build the CommCare Deliver (service-delivery) app from the PDD via Nova's
-/nova:autobuild. Captures nova_app_id and writes a structure summary.
-
-#### pdd-to-deliver-app-eval
-Grade a Nova-built Deliver app against the PDD that specified it — field
-count, ordering, conditional logic, Connectify wiring.
-
-#### app-connect-coverage
-Verify every form in a Nova-built Learn or Deliver app has the right
-CommCare Connect markers, auto-fix via Nova edits, loop until clean.
-
-#### app-test-cases
-Bind each PDD user journey to the Nova-built app structure and emit a
-Maestro recipe per journey with real selectors. Use after Nova finishes
-building, before app-release.
-
-#### app-deploy
-Upload Nova-built Learn + Deliver apps to CommCare HQ as draft builds via
-/nova:upload_to_hq. Captures HQ app IDs and writes a deploy summary.
-
-#### app-release
-Build and release the Learn + Deliver CommCare apps on CCHQ so Connect
-can read their form schema and surface deliver units.
-
-#### app-release-eval
-Verify every Learn + Deliver build was actually released so Connect can
-read deliver units. Provisional rubric pending 3+ real releases.
-
-#### app-multimedia-coverage (manual, not part of /ace:run)
-Attach display-only images to Connect app questions where they
-meaningfully help FLWs. Manual gate; not part of /ace:run.
-
-#### commcare-form-patch (Phase 2 Step 2.8, removal-tracked)
-Apply surgical CCHQ form-XML patches when Nova's `compile_app` emits
-output the AVD's CommCare runtime can't parse, then re-build + re-release.
-Wired into Phase 2 as Step 2.8 in 0.13.66 — auto-runs after `app-release`
-with `targets: auto` (no-op when zero wrappers in the released Learn CCZ).
-Whole skill self-deletes when voidcraft-labs/nova-plugin#7 ships per its
-own `## Removal criteria`. Workaround for jjackson/ace#115 finding 1.
-
----
-
-## Phase 3: Connect Setup
-
-**Agent:** `connect-setup`
-
-> Orchestrates Connect platform setup for a CRISPR-Connect opportunity:
-> program creation, opportunity shell, verification flags, and payment
-> units. Atom-driven via the ace-connect MCP (no HITL).
-
-### Skills
-
-#### connect-program-setup
-Create or reuse a Connect Program for the opportunity, archetype-matched
-to the PDD. Captures program_id for downstream skills.
-
-#### connect-program-setup-eval
-Grade Connect Program + Opportunity configuration against the PDD —
-reuse-vs-create, verification rules, delivery units, payment units.
-
-#### connect-opp-setup
-Create and fully configure a Connect Opportunity — opp shell, verification
-flags, payment units, ACE test-user pre-invite for emulator testing.
-
----
-
-## Phase 4: OCS Setup
-
-**Agent:** `ocs-setup`
-
-> Phase 4 of the CRISPR-Connect lifecycle: clone the ACE golden template,
-> build the opp-specific RAG collection, smoke-test the bot via a thin
-> quick chat suite, and stage the widget credentials for Connect.
-
-### Skills
-
-#### ocs-agent-setup
-Clone the ACE OCS template into a per-opp chatbot, attach a RAG
-collection from PDD + training + app summaries, publish, return embed
-credentials.
-
-#### ocs-chatbot-qa
-Exercise the per-opp OCS chatbot via its anonymous widget and capture a
-transcript with structural checks. Modes: --quick / --deep / --monitor.
-
-#### ocs-chatbot-eval
-LLM-as-Judge grader for OCS chatbot transcripts. Modes: --quick (1-dim
-smoke), --deep / --monitor (5-dim calibrated; emits gate brief).
-
-#### ocs-widget-handoff-eval
-Grade the OCS widget-handoff staging artifact for HITL paste-in — widget
-URL, embed key, opportunity-binding instructions.
-
----
-
-## Phase 5: QA and Training
-
-**Agent:** `qa-and-training`
-
-> Phase 5 of the CRISPR-Connect lifecycle: produce per-opp QA test plan +
-> walkthrough screenshots + training materials. All derived from the
-> design docs (PDD, app summaries, opp identifiers, OCS chatbot URL) so
-> the Phase runs from artifacts; no live LLO contact.
-
-### Skills
-
-#### app-screenshot-capture
-Run app smoke recipes against a local AVD and capture per-step
-screenshots for the training deck. Per-opp content only.
-
-#### app-ux-eval (deep-only, /ace:qa-deep)
-Grade the FLW experience of the built apps via LLM-as-Judge over
-captured screenshots. Deep-only — runs from /ace:qa-deep.
-
-#### training-llo-guide
-Generate the LLO-facing operations document for overseeing FLW
-deployment. Owns one artifact: llo-manager-guide.md.
-
-#### training-flw-guide
-Generate the FLW-facing step-by-step guide for the Learn and Deliver
-apps. Owns one artifact: flw-training-guide.md.
-
-#### training-quick-reference
-Generate the one-page printable pocket-card summary for FLWs in the
-field. Owns one artifact: quick-reference.md.
-
-#### training-faq
-Generate anticipated LLO + FLW questions with authoritative answers.
-Owns one artifact: faq.md.
-
-#### training-deck-outline
-Generate the slide-by-slide markdown outline that training-deck-build
-renders into a Google Slides deck. Owns one artifact.
-
-#### training-deck-build
-Render training-deck-outline.md into a Google Slides deck using the
-ACE template. Produces a presentable Slides URL.
-
-#### training-onboarding-email
-Generate the LLO onboarding email body, consumed by llo-onboarding
-and personalized per LLO at send time. Owns one artifact.
-
-#### connect-baseline-screenshots (cross-opp, manual)
-Capture the per-Connect-version baseline of "how Connect works"
-screenshots reused across every training deck. Manual, cross-opp.
-
----
-
-## Phase 6: Solicitation Management
-
-**Agent:** `solicitation-management`
-
-> Phase 6 of the CRISPR-Connect lifecycle: publish a solicitation derived
-> from the PDD, invite PDD-named candidate LLOs to it by email, and stop.
-> The review-and-award lifecycle continues via the manually-invoked
-> `solicitation-review` skill (gated on a human-in-the-loop checkpoint
-> before `award_response` is called). Phase 7 starts once an awardee is
-> recorded in `opp.yaml.selected_llo`.
-
-### Skills
-
-#### solicitation-create
-Translate the PDD into a solicitation payload, derive evaluation criteria,
-and publish via connect-labs MCP. Captures solicitation_id.
-
-#### solicitation-create-eval
-Grade a published solicitation against its source PDD — scope fidelity,
-field completeness, deadline sensibility.
-
-#### llo-invite
-Email each PDD-named candidate LLO the public solicitation URL. No-op
-when PDD has no preferred_llos.
-
-#### solicitation-monitor (recurring)
-Recurring poll for solicitation responses. Modes: --quick (count only) /
---monitor (full pull, default) / --close (final pull).
-
-#### solicitation-review (manual; HALT-and-resume)
-Score solicitation responses, recommend an awardee, and (after HITL
-approval) call award_response and populate opp.yaml.selected_llo.
-
-#### solicitation-review-eval
-Compare ACE's top-ranked solicitation recommendation against the human's
-actual award. Detection-rate metric.
-
----
-
-## Phase 7: Execution Management
-
-**Agent:** `execution-manager`
-
-> Phase 7 of the CRISPR-Connect lifecycle: execute the awarded LLO's run
-> of the opportunity — onboarding, UAT, go-live, and recurring monitoring.
-> Phase 7 entry is gated on `opp.yaml.selected_llo.org_slug` being
-> populated by Phase 6's `solicitation-review` skill.
-
-### Skills
-
-#### llo-onboarding
-Issue the Connect program invite and send the awarded LLO the ACE
-onboarding email with training materials and OCS widget link.
-
-#### llo-uat
-Coordinate User Acceptance Testing with onboarded LLOs. Send UAT
-instructions, monitor feedback, compile results with sign-off status.
-
-#### llo-launch
-Activate the opportunity for live use. Verifies UAT sign-offs and
-deep-QA verdicts, activates in Connect, notifies LLOs of go-live.
-
-#### llo-launch-eval
-Grade an llo-launch activation against PDD launch preconditions — UAT
-sign-off, Connect activation, app-publish, go-live notify.
-
-#### timeline-monitor (recurring)
-Watch whether LLOs are hitting expected milestones on schedule. Email
-prompts when behind. Recurring during active opp.
-
-#### flw-data-review (recurring)
-Analyze FLW submissions to identify quality issues, trends, and
-improvement opportunities. Recurring during active opp.
-
-#### flw-data-review-eval
-Grade an flw-data-review report — signal coverage, outlier rigor,
-recommendation actionability, evidence citation, trajectory awareness.
-
-#### ocs-chatbot-qa --monitor (recurring)
-See Phase 4. Phase 7 invokes recurring `--monitor` mode.
-
-#### ocs-chatbot-eval --monitor (recurring)
-See Phase 4. Phase 7 invokes recurring `--monitor` mode.
-
----
-
-## Phase 8: Closeout
-
-**Agent:** `closeout`
-
-> Orchestrates opportunity closeout: invoice processing, LLO feedback
-> collection, learnings summary, and overall cycle grading. Triggered
-> when the opportunity reaches its end date.
-
-### Skills
-
-#### opp-closeout
-Pull invoices from the completed opportunity and create a Jira ticket to
-issue payment to the LLO.
-
-#### llo-feedback
-Prompt LLOs for feedback on application, process, and next-step
-suggestions. Collect and document responses for closeout.
-
-#### learnings-summary
-Synthesize learnings from a completed opportunity. Drafts a new PDD to
-seed the next cycle when iteration is warranted.
-
-#### cycle-grade
-Grade the closed CRISPR-Connect cycle end-to-end with concrete
-improvement recommendations for the next cycle.
-
-#### cycle-grade-eval
-Independently re-grade a closed cycle's cycle-grade output. Detects
-self-eval inflation, missing learnings, vague recommendations.
-
----
-
-## Cross-cutting Skills
-
-#### opp-eval (umbrella aggregator)
-Umbrella aggregator that rolls every per-skill -eval verdict into a
-run-level scorecard. Modes: --quick / --deep / --monitor.
-
-#### eval-calibration (methodology reference)
-Methodology reference for calibrating ACE's per-skill -eval rubrics —
-ground-truth catalogues, variance protocol, detection-rate metric.
-
-#### email-communicator (utility, called by other skills)
-Send/receive email via GOG CLI using the ACE Gmail account. Utility
-skill — other skills delegate here for any Gmail operation.
-
-#### upload-transcript (utility)
-Upload a Claude CLI stream-json transcript (.jsonl) to a deployed
-ace-web via /api/ingest/upload. Used by /ace:run --ace-web-url.
-
----
-
-## External Integrations
-
-### Connect API
-ACE talks to Connect through **two** MCP servers, scoped to distinct
-domains:
-
-1. **`connect-labs`** (lives in [`connect-labs` repo](https://github.com/dimagi/connect-labs))
-   — solicitations, reviews, awards, funds. Production-ready and
-   unrelated to the Programs/Opportunities lifecycle ACE manages.
-   Consumed via a thin local stdio proxy (`mcp/connect-labs-server.ts`)
-   that forwards JSON-RPC frames to the remote HTTP MCP at
-   `https://labs.connect.dimagi.com/mcp/`.
-2. **`ace-connect`** — composite Connect backend over `connect.dimagi.com`,
-   authenticated as `ace@dimagi-ai.com` via OAuth-with-CommCareHQ. 21
-   atoms today: 8 authoring atoms route to the REST automation API
-   (commcare-connect#1135); the rest still drive HTML form pages via
-   Playwright.
-
-See `playbook/integrations/connect-api.md` for the atom inventory.
-
-### CommCare API
-Production-ready CommCare HQ tools live in the `connect-labs` MCP
-(`list_apps`, `get_app_structure`, etc.). ACE calls them for app
-inspection during Phase 2.
-
-See `playbook/integrations/commcare-api.md`.
-
-### OCS (Open Chat Studio)
-Composite MCP backend with **22 atomic capabilities** at
-`mcp/ocs-server.ts` → `ace-ocs`. REST + Playwright + composite backends.
-Authenticate with `/ace:ocs-login` before calling tools that hit live OCS.
-
-See `playbook/integrations/ocs-integration.md`.
-
-### Nova (CommCare app builder)
-Live as a sibling Claude Code plugin (`voidcraft-labs/nova-marketplace`).
-End-to-end smoke test passed 2026-04-28. ACE consumes Nova's
-`/nova:autobuild` slash command via the Nova plugin; OAuth on first use.
-
-See `playbook/integrations/nova-integration.md`.
-
-### Mobile (CommCare Android emulation)
-The `ace-mobile` MCP server drives a local Android AVD on the operator's
-Mac via Maestro + adb + Playwright. **Mac-only, dev-machine-only** — no
-cloud device farms. Bootstrap with `/ace:mobile-bootstrap`.
-
-See `playbook/integrations/mobile-integration.md`.
-
-### Slides (Google Slides API)
-Slides atoms (`slides_get`, `slides_batch_update`, `slides_copy_template`)
-shipped 0.10.78. Back the `training-deck-build` skill, which renders
-markdown deck-outlines into editable Google Slides decks.
-
-See `playbook/integrations/slides-integration.md`.
-
----
-
-## Current Limitations
-
-`## Current Workaround` blocks across SKILL.md files document HITL
-fallbacks for capabilities not yet automated. As of 0.13.43, no skills
-ship with active workaround blocks — all previously-blocked Phase 3 / 5 /
-7 paths are atom-driven via the `ace-connect` MCP (since 0.10.47).
-
-The `commcare-form-patch` and `app-multimedia-coverage` skills ARE
-documented workarounds but for the Nova upstream, not for Connect/CCHQ.
-Both have explicit `## Removal criteria` sections naming the upstream
-ticket whose resolution will retire the skill.
-
----
-
-## Skill Reference
-
-54 ACE skills + 3 reference docs. All skills ship with
-`disable-model-invocation: true` (orchestrator-dispatched, never
-free-text invoked). See `skills/README.md` for the author contract.
-
-| Skill | Phase | Description (≤200 chars) |
-|---|---|---|
-| app-connect-coverage | 2 | Verify every form in a Nova-built Learn or Deliver app has the right CommCare Connect markers, auto-fix via Nova edits, loop until clean. |
-| app-deploy | 2 | Upload Nova-built Learn + Deliver apps to CommCare HQ as draft builds via /nova:upload_to_hq. Captures HQ app IDs and writes a deploy summary. |
-| app-multimedia-coverage | 2 (manual) | Attach display-only images to Connect app questions where they meaningfully help FLWs. Manual gate; not part of /ace:run. |
-| app-release | 2 | Build and release the Learn + Deliver CommCare apps on CCHQ so Connect can read their form schema and surface deliver units. |
-| app-release-eval | 2 | Verify every Learn + Deliver build was actually released so Connect can read deliver units. Provisional rubric pending 3+ real releases. |
-| app-screenshot-capture | 5 | Run app smoke recipes against a local AVD and capture per-step screenshots for the training deck. Per-opp content only. |
-| app-test-cases | 2 | Bind each PDD user journey to the Nova-built app structure and emit a Maestro recipe per journey with real selectors. Use after Nova finishes building, before app-release. |
-| app-ux-eval | 5 (deep) | Grade the FLW experience of the built apps via LLM-as-Judge over captured screenshots. Deep-only — runs from /ace:qa-deep. |
-| commcare-form-patch | 2 (workaround) | Apply surgical CCHQ form-XML patches when Nova's compile_app emits output Connect rejects, then re-build + re-release. Workaround skill. |
-| connect-baseline-screenshots | xcut | Capture the per-Connect-version baseline of "how Connect works" screenshots reused across every training deck. Manual, cross-opp. |
-| connect-opp-setup | 3 | Create and fully configure a Connect Opportunity — opp shell, verification flags, payment units, ACE test-user pre-invite for emulator testing. |
-| connect-program-setup | 3 | Create or reuse a Connect Program for the opportunity, archetype-matched to the PDD. Captures program_id for downstream skills. |
-| connect-program-setup-eval | 3 | Grade Connect Program + Opportunity configuration against the PDD — reuse-vs-create, verification rules, delivery units, payment units. |
-| cycle-grade | 8 | Grade the closed CRISPR-Connect cycle end-to-end with concrete improvement recommendations for the next cycle. |
-| cycle-grade-eval | 8 | Independently re-grade a closed cycle's cycle-grade output. Detects self-eval inflation, missing learnings, vague recommendations. |
-| email-communicator | xcut | Send/receive email via GOG CLI using the ACE Gmail account. Utility skill — other skills delegate here for any Gmail operation. |
-| eval-calibration | xcut | Methodology reference for calibrating ACE's per-skill -eval rubrics — ground-truth catalogues, variance protocol, detection-rate metric. |
-| flw-data-review | 7 | Analyze FLW submissions to identify quality issues, trends, and improvement opportunities. Recurring during active opp. |
-| flw-data-review-eval | 7 | Grade an flw-data-review report — signal coverage, outlier rigor, recommendation actionability, evidence citation, trajectory awareness. |
-| idea-to-pdd | 1 | Develop a Program Design Doc (PDD) for a Connect intervention from source material. Iterates a 5-question stress-test rubric until approved. |
-| idea-to-pdd-eval | 1 | Independently grade a PDD against the source idea pack — re-runs the stress test from outside and cross-checks reviewer-comment fidelity. |
-| learnings-summary | 8 | Synthesize learnings from a completed opportunity. Drafts a new PDD to seed the next cycle when iteration is warranted. |
-| llo-feedback | 8 | Prompt LLOs for feedback on application, process, and next-step suggestions. Collect and document responses for closeout. |
-| llo-invite | 6 | Email each PDD-named candidate LLO the public solicitation URL. No-op when PDD has no preferred_llos. |
-| llo-launch | 7 | Activate the opportunity for live use. Verifies UAT sign-offs and deep-QA verdicts, activates in Connect, notifies LLOs of go-live. |
-| llo-launch-eval | 7 | Grade an llo-launch activation against PDD launch preconditions — UAT sign-off, Connect activation, app-publish, go-live notify. |
-| llo-onboarding | 7 | Issue the Connect program invite and send the awarded LLO the ACE onboarding email with training materials and OCS widget link. |
-| llo-uat | 7 | Coordinate User Acceptance Testing with onboarded LLOs. Send UAT instructions, monitor feedback, compile results with sign-off status. |
-| ocs-agent-setup | 4 | Clone the ACE OCS template into a per-opp chatbot, attach a RAG collection from PDD + training + app summaries, publish, return embed credentials. |
-| ocs-chatbot-eval | 4, 7 | LLM-as-Judge grader for OCS chatbot transcripts. Modes: --quick (1-dim smoke), --deep / --monitor (5-dim calibrated; emits gate brief). |
-| ocs-chatbot-qa | 4, 7 | Exercise the per-opp OCS chatbot via its anonymous widget and capture a transcript with structural checks. Modes: --quick / --deep / --monitor. |
-| ocs-widget-handoff-eval | 4 | Grade the OCS widget-handoff staging artifact for HITL paste-in — widget URL, embed key, opportunity-binding instructions. |
-| opp-closeout | 8 | Pull invoices from the completed opportunity and create a Jira ticket to issue payment to the LLO. |
-| opp-eval | xcut | Umbrella aggregator that rolls every per-skill -eval verdict into a run-level scorecard. Modes: --quick / --deep / --monitor. |
-| pdd-to-app-journeys | 1 | Derive opp-specific expected user journeys from an approved PDD. Produces the UX-intent ground truth consumed by app-test-cases and app-ux-eval. |
-| pdd-to-deliver-app | 2 | Build the CommCare Deliver (service-delivery) app from the PDD via Nova's /nova:autobuild. Captures nova_app_id and writes a structure summary. |
-| pdd-to-deliver-app-eval | 2 | Grade a Nova-built Deliver app against the PDD that specified it — field count, ordering, conditional logic, Connectify wiring. |
-| pdd-to-learn-app | 2 | Build the CommCare Learn (training) app from the PDD via Nova's /nova:autobuild. Captures nova_app_id and writes a structure summary. |
-| pdd-to-learn-app-eval | 2 | Grade a Nova-built Learn app against the PDD that specified it — module count, order, Assessment Score wiring, content coverage. |
-| pdd-to-test-prompts | 1 | Derive opp-specific Q&A test prompts from an approved PDD. Produces the ground-truth suite for the Phase 4 OCS chatbot deep gate. |
-| solicitation-create | 6 | Translate the PDD into a solicitation payload, derive evaluation criteria, and publish via connect-labs MCP. Captures solicitation_id. |
-| solicitation-create-eval | 6 | Grade a published solicitation against its source PDD — scope fidelity, field completeness, deadline sensibility. |
-| solicitation-monitor | 6 | Recurring poll for solicitation responses. Modes: --quick (count only) / --monitor (full pull, default) / --close (final pull). |
-| solicitation-review | 6 (manual) | Score solicitation responses, recommend an awardee, and (after HITL approval) call award_response and populate opp.yaml.selected_llo. |
-| solicitation-review-eval | 6 | Compare ACE's top-ranked solicitation recommendation against the human's actual award. Detection-rate metric. |
-| timeline-monitor | 7 | Watch whether LLOs are hitting expected milestones on schedule. Email prompts when behind. Recurring during active opp. |
-| training-deck-build | 5 | Render training-deck-outline.md into a Google Slides deck using the ACE template. Produces a presentable Slides URL. |
-| training-deck-outline | 5 | Generate the slide-by-slide markdown outline that training-deck-build renders into a Google Slides deck. Owns one artifact. |
-| training-faq | 5 | Generate anticipated LLO + FLW questions with authoritative answers. Owns one artifact: faq.md. |
-| training-flw-guide | 5 | Generate the FLW-facing step-by-step guide for the Learn and Deliver apps. Owns one artifact: flw-training-guide.md. |
-| training-llo-guide | 5 | Generate the LLO-facing operations document for overseeing FLW deployment. Owns one artifact: llo-manager-guide.md. |
-| training-onboarding-email | 5 | Generate the LLO onboarding email body, consumed by llo-onboarding and personalized per LLO at send time. Owns one artifact. |
-| training-quick-reference | 5 | Generate the one-page printable pocket-card summary for FLWs in the field. Owns one artifact: quick-reference.md. |
-| upload-transcript | xcut | Upload a Claude CLI stream-json transcript (.jsonl) to a deployed ace-web via /api/ingest/upload. Used by /ace:run --ace-web-url. |
-
-### Reference docs (`skills/_*-template.md`)
-
-Three reference documents extract shared boilerplate so skills don't
-duplicate it. Excluded from the skill catalog because filenames start
-with `_`.
-
-- `_eval-template.md` — verdict YAML contract, severity rules, inflation
-  guard, stock blocks for `## MCP Tools Used / ## Mode Behavior /
-  ## Dry-Run Behavior`. Referenced by all 12 `*-eval` skills.
-- `_training-template.md` — per-artifact decomposition rationale, sibling
-  map, common Drive paths. Referenced by the 7 `training-*` skills.
-- `_solicitation-template.md` — `opp.yaml.solicitation` and
-  `opp.yaml.selected_llo` contract, connect-labs MCP atom inventory,
-  Phase 6 → Phase 7 boundary rule. Referenced by all 5 solicitation
-  skills + `llo-invite`.
diff --git a/docs/superpowers/plans/2026-05-04-ace-solicitations-phase.md b/docs/superpowers/plans/2026-05-04-ace-solicitations-phase.md
deleted file mode 100644
index 48b00109..00000000
--- a/docs/superpowers/plans/2026-05-04-ace-solicitations-phase.md
+++ /dev/null
@@ -1,2425 +0,0 @@
-# ACE Solicitations Phase Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
-
-**Goal:** Insert a new Phase 7 (Solicitation Management) into the ACE lifecycle, renumber the existing Phase 7 (LLO Management → Execution Management) and Phase 8 (Closeout) to 7 and 8, and wire ACE to consume the existing connect-labs remote MCP for solicitation/review/award atoms.
-
-**Architecture:** New `solicitation-management` subagent owns Phase 7. Two skills (`solicitation-create`, `llo-invite` — moved from old Phase 7 and rewritten) run in default `/ace:run`; one recurring skill (`solicitation-monitor`); one manual skill (`solicitation-review`). ACE consumes connect-labs's remote MCP at `https://labs.connect.dimagi.com/mcp/` via a thin local stdio proxy (`mcp/connect-labs-server.ts`) that forwards JSON-RPC and injects the bearer PAT. No new atoms in `ace-connect`.
-
-**Tech Stack:** TypeScript (`npx tsx` MCP subprocesses), vitest, Markdown SKILL.md prompt files, `.claude-plugin/plugin.json` for MCP wiring, 1Password for secrets.
-
-**Spec:** [`docs/superpowers/specs/2026-05-04-ace-solicitations-phase-design.md`](../specs/2026-05-04-ace-solicitations-phase-design.md)
-
----
-
-## File Structure
-
-**New files:**
-- `mcp/connect-labs-server.ts` — stdio MCP proxy forwarding to `labs.connect.dimagi.com/mcp/`
-- `agents/solicitation-management.md` — Phase 7 subagent
-- `skills/solicitation-create/SKILL.md`
-- `skills/solicitation-create-eval/SKILL.md`
-- `skills/solicitation-monitor/SKILL.md`
-- `skills/solicitation-review/SKILL.md`
-- `skills/solicitation-review-eval/SKILL.md`
-- `test/mcp/connect-labs/proxy.test.ts` — unit test for the proxy
-- `test/mcp/connect-labs/integration/e2e.integration.test.ts` — `LABS_INTEGRATION=1` end-to-end
-- `test/skills/solicitation/*.test.ts` — fixture-driven validation
-- `test/fixtures/CRISPR-Test-004-Solicitation/` — golden fixture
-
-**Renamed files:**
-- `agents/llo-manager.md` → `agents/execution-manager.md`
-
-**Heavily modified:**
-- `skills/llo-invite/SKILL.md` — rewritten; phase moves Phase 8 → Phase 7; behavior changes from Connect-roster prep to solicitation-invite email
-- `skills/llo-onboarding/SKILL.md` — reads `selected_llo` from `opp.yaml`, fails fast if empty
-- `agents/ace-orchestrator.md` — phases block, pause-points, prose
-- `lib/artifact-manifest.ts` — drop `connect-setup/invites.md` artifacts, add solicitation/* artifacts
-- `bin/ace-doctor` — new `[Connect Labs]` section
-- `templates/pdd-template.md` — three new optional fields
-- `CLAUDE.md` — phase order list, plugin overview, pause-points
-
-**Search/replace pass (low-content edits):**
-- `agents/connect-setup.md`, `agents/ocs-setup.md`, `agents/qa-and-training.md` — Phase 7/7 references
-- `skills/training-onboarding-email/SKILL.md`, `skills/training-deck-build/SKILL.md`, `skills/llo-launch-eval/SKILL.md`, `skills/cycle-grade-eval/SKILL.md`, `skills/connect-opp-setup/SKILL.md`, `skills/ocs-widget-handoff-eval/SKILL.md`
-- `commands/run.md`, `commands/step.md`
-
----
-
-### Task 1: Renumber phase ordinals across the codebase
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md`
-- Modify: `agents/connect-setup.md`
-- Modify: `agents/ocs-setup.md`
-- Modify: `agents/qa-and-training.md`
-- Modify: `agents/llo-manager.md` (will be renamed in Task 2)
-- Modify: `skills/training-onboarding-email/SKILL.md`
-- Modify: `skills/training-deck-build/SKILL.md`
-- Modify: `skills/llo-launch-eval/SKILL.md`
-- Modify: `skills/cycle-grade-eval/SKILL.md`
-- Modify: `skills/connect-opp-setup/SKILL.md`
-- Modify: `skills/ocs-widget-handoff-eval/SKILL.md`
-- Modify: `bin/ace-doctor`
-
-- [ ] **Step 1: Inspect every Phase 7/7 reference**
-
-Run: `grep -rn "Phase 7\|Phase 8\|phase_ordinal: 6\|phase_ordinal: 7\|phase: 6\|phase: 7\|phase_6\|phase_7\|phases.llo_management\|llo-management" agents/ skills/ commands/ bin/ lib/ CLAUDE.md README.md`
-
-Expected: ~50 matches across the files listed above. Read each match in context — some are descriptive prose ("Phase 7 (LLO Management)"), some are frontmatter (`phase_ordinal: 6`), some are state-key names (`phase_6_backlog`).
-
-- [ ] **Step 2: Apply mechanical replacements**
-
-The renumbering is a context-aware substitution. For each file, apply:
-- `phase_ordinal: 7` (where it was 7 in `closeout`) → `phase_ordinal: 8`
-- `phase_ordinal: 6` (where it was 6 in `llo-manager`) → `phase_ordinal: 7`
-- `Phase 7` (where it referred to llo-management) → `Phase 8`
-- `Phase 8` (where it referred to closeout) → `Phase 9`
-- `phase_6_backlog` (orchestrator state key for old Phase 7) → `phase_7_backlog`
-- `phase_7_backlog` (orchestrator state key for old Phase 8) → `phase_8_backlog`
-- `phases.llo_management` → `phases.execution_management`
-
-**Do not yet touch:**
-- The `Phase 7` references in `solicitation-management` (it doesn't exist yet)
-- The `phases:` block in `ace-orchestrator.md` (handled in Task 18 — must add new entry and renumber atomically)
-
-For `bin/ace-doctor`: any `phase_6_*` / `phase_7_*` health check identifiers shift up by one. The ` [LLO Management]` section header becomes ` [Execution Management]`.
-
-- [ ] **Step 3: Verify no leftover stale references**
-
-Run: `grep -rn "phase_6_backlog\|phases.llo_management\|Phase 7 (LLO\|Phase 8 (Closeout)" agents/ skills/ commands/ bin/ lib/`
-
-Expected: zero matches. (The orchestrator's `phases:` block will still have `phase_ordinal` integers; that's fine — they get rewritten in Task 18.)
-
-- [ ] **Step 4: Run vitest to verify nothing structural broke**
-
-Run: `npm test -- --run`
-
-Expected: tests pass at the same rate as on `main` for this branch. Renumbering is documentation-level; nothing in code references phase numbers as integers except `lib/artifact-manifest.ts` (which doesn't number phases — it uses string names like `'design'`, `'connect'`).
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/ skills/ bin/ace-doctor
-git commit -m "refactor(phases): renumber Phase 7→8, Phase 8→9 (no behavior change)
-
-Pure rename pass: prepares the topology for the new Phase 7
-(Solicitation Management) added in subsequent commits. Touches phase
-ordinals, run-state backlog keys, and prose references in agents,
-skills, and doctor sections. The orchestrator's phases: block is
-rewritten atomically in a later task.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 2: Rename `llo-manager` agent to `execution-manager`
-
-**Files:**
-- Rename: `agents/llo-manager.md` → `agents/execution-manager.md`
-
-- [ ] **Step 1: Rename the file via git**
-
-Run: `git mv agents/llo-manager.md agents/execution-manager.md`
-
-- [ ] **Step 2: Rewrite the frontmatter and opening prose**
-
-Edit `agents/execution-manager.md`. Replace the frontmatter block:
-
-```yaml
----
-name: execution-manager
-description: >
-  Phase 8 of the CRISPR-Connect lifecycle: execute the awarded LLO's run
-  of the opportunity — onboarding, UAT, go-live, and recurring monitoring.
-  Phase 8 entry is gated on `opp.yaml.selected_llo.org_slug` being populated
-  by Phase 7's solicitation-review skill (which the run halts before).
-model: inherit
-phase: execution-management
-phase_display: Execution Management
-phase_ordinal: 7
-skills:
-  - { name: llo-onboarding,  has_judge: false }
-  - { name: llo-uat,         has_judge: false }
-  - { name: llo-launch,      has_judge: true,  eval_skill: llo-launch-eval }
-recurring_skills:
-  - { name: timeline-monitor,   has_judge: true }
-  - { name: flw-data-review,    has_judge: true,  eval_skill: flw-data-review-eval }
-  - { name: ocs-chatbot-qa,     has_judge: false }
-  - { name: ocs-chatbot-eval,   has_judge: true }
----
-```
-
-Note: `llo-invite` is removed from the skills list (it moves to Phase 7 in Task 14). The remainder of the agent body keeps its existing prose for `llo-onboarding`, `llo-uat`, `llo-launch`, and the recurring skills — only the phase numbering and the "first LLO contact" framing get rewritten.
-
-- [ ] **Step 3: Update the agent body's opening paragraph**
-
-Replace the opening "You run the first LLO-facing phase..." paragraph with:
-
-> You run the execution phase of a CRISPR-Connect opportunity. By the time this phase starts, Phase 7 (Solicitation Management) has published a solicitation, collected responses, and (via the manual `solicitation-review` skill) awarded an org. The awardee is recorded in `opp.yaml.selected_llo` — that's the LLO this phase onboards, supports through UAT, takes to go-live, and monitors during execution.
-
-- [ ] **Step 4: Strip Step 1 (LLO Invitation List) from the body**
-
-The existing agent body has a "### Step 1: LLO Invitation List" section that calls `llo-invite`. Delete that section. Renumber the remaining steps so what was Step 2 (LLO Onboarding) becomes Step 1, Step 3 → Step 2, Step 4 → Step 3, Step 5 → Step 4. Update internal cross-references ("Step 1" / "Step 2" / etc.) accordingly.
-
-- [ ] **Step 5: Update the orchestrator-side dispatch reference**
-
-Run: `grep -rn "llo-manager\|ace:llo-manager" agents/ commands/ CLAUDE.md`
-
-For each match, replace `llo-manager` → `execution-manager` (preserve case and the `ace:` prefix where applicable). The orchestrator's `Agent(llo-manager)` calls will be rewritten when the phases block is updated in Task 18.
-
-- [ ] **Step 6: Commit**
-
-```bash
-git add agents/execution-manager.md agents/llo-manager.md commands/ CLAUDE.md
-git commit -m "refactor(agent): rename llo-manager → execution-manager
-
-Phase 8 (was Phase 7) is no longer 'first LLO contact' — that role moves
-to the new Phase 7 (Solicitation Management) which publishes solicitations
-and invites candidate LLOs. Phase 8 takes over once an awardee exists.
-
-Drops the llo-invite skill from the agent's skill list (it moves to
-Phase 7 in a later commit). Renumbers internal step numbering.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 3: Build the connect-labs MCP stdio proxy
-
-**Files:**
-- Create: `mcp/connect-labs-server.ts`
-- Create: `test/mcp/connect-labs/proxy.test.ts`
-
-**Why a proxy.** The labs MCP runs as remote HTTP at `https://labs.connect.dimagi.com/mcp/`. ACE's existing MCP wiring (`.claude-plugin/plugin.json`) only uses stdio MCPs (`command + args`). Rather than experiment with whether plugin.json supports `type: "http"` mcpServers, write a thin local stdio proxy that forwards JSON-RPC frames to labs over HTTP and injects the bearer PAT. Same shape as `mcp/google-drive-server.ts`, `mcp/ocs-server.ts`, etc.
-
-- [ ] **Step 1: Write the failing test**
-
-Create `test/mcp/connect-labs/proxy.test.ts`:
-
-```typescript
-import { describe, it, expect, beforeEach, vi } from 'vitest';
-import { spawn } from 'node:child_process';
-import path from 'node:path';
-
-const PROXY_PATH = path.resolve(__dirname, '../../../mcp/connect-labs-server.ts');
-
-describe('connect-labs-server (stdio → HTTP proxy)', () => {
-  beforeEach(() => {
-    vi.restoreAllMocks();
-  });
-
-  it('forwards a JSON-RPC frame to labs with Bearer auth and returns the body', async () => {
-    const fetchSpy = vi.spyOn(global, 'fetch').mockResolvedValue(
-      new Response(JSON.stringify({ jsonrpc: '2.0', id: 1, result: { ok: true } }), {
-        status: 200,
-        headers: { 'Content-Type': 'application/json' },
-      }),
-    );
-
-    // The proxy is launched as a subprocess in real use; here we import its forward()
-    // function directly for unit testing.
-    const { forward } = await import('../../../mcp/connect-labs-server');
-    const out = await forward(
-      { jsonrpc: '2.0', id: 1, method: 'tools/list', params: {} },
-      { token: 'test-token', url: 'https://labs.example/mcp/' },
-    );
-
-    expect(fetchSpy).toHaveBeenCalledOnce();
-    const [calledUrl, init] = fetchSpy.mock.calls[0];
-    expect(calledUrl).toBe('https://labs.example/mcp/');
-    expect(init?.method).toBe('POST');
-    expect(init?.headers).toMatchObject({
-      'Authorization': 'Bearer test-token',
-      'Content-Type': 'application/json',
-    });
-    expect(JSON.parse(init?.body as string)).toEqual({
-      jsonrpc: '2.0',
-      id: 1,
-      method: 'tools/list',
-      params: {},
-    });
-    expect(out).toEqual({ jsonrpc: '2.0', id: 1, result: { ok: true } });
-  });
-
-  it('returns a JSON-RPC error envelope when the upstream returns 401', async () => {
-    vi.spyOn(global, 'fetch').mockResolvedValue(
-      new Response(JSON.stringify({ error: { code: 'PERMISSION_DENIED', message: 'bad token' } }), {
-        status: 401,
-      }),
-    );
-    const { forward } = await import('../../../mcp/connect-labs-server');
-    const out = await forward(
-      { jsonrpc: '2.0', id: 2, method: 'tools/list', params: {} },
-      { token: 'bad', url: 'https://labs.example/mcp/' },
-    );
-    expect(out).toMatchObject({
-      jsonrpc: '2.0',
-      id: 2,
-      error: {
-        code: -32000,
-        message: expect.stringContaining('401'),
-      },
-    });
-  });
-
-  it('throws if LABS_MCP_TOKEN is empty when invoked without an explicit token', async () => {
-    const { forward } = await import('../../../mcp/connect-labs-server');
-    await expect(
-      forward({ jsonrpc: '2.0', id: 3, method: 'tools/list' }, { token: '', url: 'https://labs.example/mcp/' }),
-    ).rejects.toThrow(/LABS_MCP_TOKEN/);
-  });
-});
-```
-
-- [ ] **Step 2: Run the test to verify it fails**
-
-Run: `npm test -- --run test/mcp/connect-labs/proxy.test.ts`
-
-Expected: FAIL with "Cannot find module '../../../mcp/connect-labs-server'".
-
-- [ ] **Step 3: Implement the proxy**
-
-Create `mcp/connect-labs-server.ts`:
-
-```typescript
-#!/usr/bin/env tsx
-/**
- * connect-labs-server: stdio MCP proxy to labs.connect.dimagi.com/mcp/.
- *
- * Reads LABS_MCP_TOKEN from ${CLAUDE_PLUGIN_DATA}/.env (legacy fallback:
- * plugin root .env), then forwards every JSON-RPC frame received on stdin
- * over HTTPS to the labs MCP, injecting `Authorization: Bearer <token>`.
- * The HTTP response body is written back to stdout as a single line.
- *
- * Stays a stdio MCP because ACE's plugin.json only wires stdio mcpServers
- * (verified via grep: every existing entry uses `command + args`). When
- * Claude Code's plugin.json gains first-class HTTP MCP support, this
- * proxy can be deleted in favor of a direct `type: "http"` entry.
- */
-
-import { readFileSync } from 'node:fs';
-import { join } from 'node:path';
-import { createInterface } from 'node:readline';
-
-export interface JsonRpcFrame {
-  jsonrpc: '2.0';
-  id?: number | string;
-  method?: string;
-  params?: unknown;
-  result?: unknown;
-  error?: { code: number; message: string; data?: unknown };
-}
-
-export interface ForwardOpts {
-  token: string;
-  url: string;
-}
-
-export async function forward(frame: JsonRpcFrame, opts: ForwardOpts): Promise<JsonRpcFrame> {
-  if (!opts.token) {
-    throw new Error('LABS_MCP_TOKEN is required to forward to labs MCP');
-  }
-  const res = await fetch(opts.url, {
-    method: 'POST',
-    headers: {
-      'Authorization': `Bearer ${opts.token}`,
-      'Content-Type': 'application/json',
-    },
-    body: JSON.stringify(frame),
-  });
-  if (!res.ok) {
-    return {
-      jsonrpc: '2.0',
-      id: frame.id,
-      error: {
-        code: -32000,
-        message: `labs MCP returned ${res.status}: ${await res.text()}`,
-      },
-    };
-  }
-  return (await res.json()) as JsonRpcFrame;
-}
-
-function loadEnvFile(path: string): Record<string, string> {
-  try {
-    const txt = readFileSync(path, 'utf8');
-    const out: Record<string, string> = {};
-    for (const line of txt.split('\n')) {
-      if (!line || line.startsWith('#')) continue;
-      const eq = line.indexOf('=');
-      if (eq <= 0) continue;
-      out[line.slice(0, eq).trim()] = line.slice(eq + 1).trim().replace(/^['"]|['"]$/g, '');
-    }
-    return out;
-  } catch {
-    return {};
-  }
-}
-
-function loadToken(): string {
-  if (process.env.LABS_MCP_TOKEN) return process.env.LABS_MCP_TOKEN;
-  const dataDir = process.env.CLAUDE_PLUGIN_DATA;
-  if (dataDir) {
-    const fromData = loadEnvFile(join(dataDir, '.env')).LABS_MCP_TOKEN;
-    if (fromData) return fromData;
-  }
-  const rootEcho = process.env.CLAUDE_PLUGIN_ROOT_ECHO;
-  if (rootEcho) {
-    const fromRoot = loadEnvFile(join(rootEcho, '.env')).LABS_MCP_TOKEN;
-    if (fromRoot) return fromRoot;
-  }
-  return '';
-}
-
-async function main() {
-  const token = loadToken();
-  const url = process.env.LABS_MCP_URL || 'https://labs.connect.dimagi.com/mcp/';
-
-  const rl = createInterface({ input: process.stdin, crlfDelay: Infinity });
-  for await (const line of rl) {
-    const trimmed = line.trim();
-    if (!trimmed) continue;
-    let frame: JsonRpcFrame;
-    try {
-      frame = JSON.parse(trimmed) as JsonRpcFrame;
-    } catch (e) {
-      process.stdout.write(JSON.stringify({
-        jsonrpc: '2.0',
-        id: null,
-        error: { code: -32700, message: `Parse error: ${(e as Error).message}` },
-      }) + '\n');
-      continue;
-    }
-    try {
-      const reply = await forward(frame, { token, url });
-      process.stdout.write(JSON.stringify(reply) + '\n');
-    } catch (e) {
-      process.stdout.write(JSON.stringify({
-        jsonrpc: '2.0',
-        id: frame.id,
-        error: { code: -32000, message: (e as Error).message },
-      }) + '\n');
-    }
-  }
-}
-
-if (import.meta.url === `file://${process.argv[1]}`) {
-  main().catch((e) => {
-    process.stderr.write(`connect-labs-server fatal: ${(e as Error).stack || e}\n`);
-    process.exit(1);
-  });
-}
-```
-
-- [ ] **Step 4: Run the test to verify it passes**
-
-Run: `npm test -- --run test/mcp/connect-labs/proxy.test.ts`
-
-Expected: 3/3 pass.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add mcp/connect-labs-server.ts test/mcp/connect-labs/proxy.test.ts
-git commit -m "feat(mcp): add connect-labs stdio proxy to labs MCP
-
-Forwards JSON-RPC frames from Claude Code (stdio) to
-labs.connect.dimagi.com/mcp/ (HTTP) with Bearer PAT injected from
-LABS_MCP_TOKEN. Same shape as the other ACE MCP servers.
-
-Plugin.json wiring + .env.tpl + doctor checks land in subsequent commits.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 4: Wire LABS_MCP_TOKEN into `.env.tpl` and `plugin.json`
-
-**Files:**
-- Modify: `.env.tpl`
-- Modify: `.claude-plugin/plugin.json`
-
-- [ ] **Step 1: Add the env var to `.env.tpl`**
-
-Append to `.env.tpl` (after the existing Connect block):
-
-```
-# Connect Labs (solicitations / reviews / awards) — labs.connect.dimagi.com
-# Bearer PAT for the labs MCP, scoped to the ace@dimagi-ai.com labs user.
-# To rotate: a labs admin runs:
-#   python manage.py mcp_create_token --user ace@dimagi-ai.com --name ACE-plugin --ttl-days 0
-# then drops the printed token into the 1Password item below.
-LABS_MCP_TOKEN=op://Dimagi/labs-mcp-pat-ace/credential
-```
-
-- [ ] **Step 2: Add the MCP entry to `plugin.json`**
-
-In `.claude-plugin/plugin.json`, append a new entry under `mcpServers` (after the existing `ace-mobile` entry):
-
-```jsonc
-"connect-labs": {
-  "command": "npx",
-  "args": ["tsx", "${CLAUDE_PLUGIN_ROOT}/mcp/connect-labs-server.ts"],
-  "env": {
-    "CLAUDE_PLUGIN_DATA": "${CLAUDE_PLUGIN_DATA}",
-    "CLAUDE_PLUGIN_ROOT_ECHO": "${CLAUDE_PLUGIN_ROOT}"
-  }
-}
-```
-
-Note: the proxy reads `LABS_MCP_TOKEN` from `${CLAUDE_PLUGIN_DATA}/.env` itself (see Task 3 step 3) — passing `CLAUDE_PLUGIN_DATA` is enough; we do not put the token directly in plugin.json's `env` block.
-
-- [ ] **Step 3: Verify the manifest still parses**
-
-Run: `node -e "JSON.parse(require('fs').readFileSync('.claude-plugin/plugin.json', 'utf8'))"`
-
-Expected: no output, exit 0.
-
-- [ ] **Step 4: Verify the marketplace mirror is in sync**
-
-Run: `node -e "const m = JSON.parse(require('fs').readFileSync('.claude-plugin/marketplace.json', 'utf8'));"`
-
-Expected: no output, exit 0. (The version-sync hook keeps marketplace.json in sync with plugin.json on commit.)
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add .env.tpl .claude-plugin/plugin.json
-git commit -m "feat(mcp): wire connect-labs MCP into plugin manifest
-
-Adds the connect-labs stdio proxy to mcpServers and LABS_MCP_TOKEN to
-.env.tpl. After op inject, ACE skills can call mcp__connect-labs__*
-atoms (create_solicitation, list_responses, award_response, etc.).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 5: Add `[Connect Labs]` doctor section
-
-**Files:**
-- Modify: `bin/ace-doctor`
-- Create: `test/doctor/connect-labs.test.ts`
-
-- [ ] **Step 1: Find the existing pattern**
-
-Read `bin/ace-doctor` and locate the `[Connect]` section (anchor: a line beginning with `## ` or printing a `[Connect]` header). Note the style: each check prints a tag (e.g. `connect_env`, `connect_session`) followed by `OK | WARN | FAIL` and a one-liner explanation.
-
-- [ ] **Step 2: Write the failing test**
-
-Create `test/doctor/connect-labs.test.ts`:
-
-```typescript
-import { describe, it, expect, vi, beforeEach } from 'vitest';
-
-// The doctor module exports check functions (assumes bin/ace-doctor has been
-// refactored to expose ts-importable helpers; if not, this test invokes them
-// via subprocess).
-
-describe('doctor [Connect Labs] checks', () => {
-  beforeEach(() => vi.restoreAllMocks());
-
-  it('connect_labs_env: FAIL when LABS_MCP_TOKEN missing', async () => {
-    const { checkConnectLabsEnv } = await import('../../bin/checks/connect-labs');
-    const result = await checkConnectLabsEnv({ envFile: 'test/fixtures/empty.env' });
-    expect(result.tag).toBe('connect_labs_env');
-    expect(result.status).toBe('FAIL');
-    expect(result.message).toMatch(/LABS_MCP_TOKEN/);
-  });
-
-  it('connect_labs_env: OK when token present', async () => {
-    const { checkConnectLabsEnv } = await import('../../bin/checks/connect-labs');
-    const result = await checkConnectLabsEnv({ envFile: 'test/fixtures/with-labs-token.env' });
-    expect(result.status).toBe('OK');
-  });
-
-  it('connect_labs_mcp_reachable: FAIL on 401 (PAT bad)', async () => {
-    vi.spyOn(global, 'fetch').mockResolvedValue(new Response('', { status: 401 }));
-    const { checkConnectLabsReachable } = await import('../../bin/checks/connect-labs');
-    const result = await checkConnectLabsReachable({ token: 'bad', url: 'https://labs.example/mcp/' });
-    expect(result.status).toBe('FAIL');
-    expect(result.message).toMatch(/PAT|401/);
-  });
-
-  it('connect_labs_mcp_reachable: OK on 200', async () => {
-    vi.spyOn(global, 'fetch').mockResolvedValue(
-      new Response(JSON.stringify({ jsonrpc: '2.0', id: 1, result: {} }), { status: 200 }),
-    );
-    const { checkConnectLabsReachable } = await import('../../bin/checks/connect-labs');
-    const result = await checkConnectLabsReachable({ token: 'good', url: 'https://labs.example/mcp/' });
-    expect(result.status).toBe('OK');
-  });
-
-  it('connect_labs_connect_oauth: WARN with actionable hint when tool returns PERMISSION_DENIED', async () => {
-    vi.spyOn(global, 'fetch').mockResolvedValue(
-      new Response(JSON.stringify({
-        jsonrpc: '2.0',
-        id: 1,
-        error: { code: -32000, message: 'PERMISSION_DENIED: connect oauth required' },
-      }), { status: 200 }),
-    );
-    const { checkConnectLabsConnectOAuth } = await import('../../bin/checks/connect-labs');
-    const result = await checkConnectLabsConnectOAuth({ token: 'good', url: 'https://labs.example/mcp/' });
-    expect(result.status).toBe('WARN');
-    expect(result.message).toMatch(/Connect OAuth/);
-  });
-
-  it('connect_labs_connect_oauth: OK on a successful tools/call list_solicitations', async () => {
-    vi.spyOn(global, 'fetch').mockResolvedValue(
-      new Response(JSON.stringify({
-        jsonrpc: '2.0',
-        id: 1,
-        result: { content: [{ type: 'text', text: '[]' }] },
-      }), { status: 200 }),
-    );
-    const { checkConnectLabsConnectOAuth } = await import('../../bin/checks/connect-labs');
-    const result = await checkConnectLabsConnectOAuth({ token: 'good', url: 'https://labs.example/mcp/' });
-    expect(result.status).toBe('OK');
-  });
-});
-```
-
-Also create test fixture files:
-- `test/fixtures/empty.env` — empty file
-- `test/fixtures/with-labs-token.env` — single line `LABS_MCP_TOKEN=test-token`
-
-- [ ] **Step 3: Run the test to verify it fails**
-
-Run: `npm test -- --run test/doctor/connect-labs.test.ts`
-
-Expected: FAIL with "Cannot find module '../../bin/checks/connect-labs'".
-
-- [ ] **Step 4: Implement the check helpers**
-
-Create `bin/checks/connect-labs.ts`:
-
-```typescript
-import { readFileSync } from 'node:fs';
-
-export interface CheckResult {
-  tag: string;
-  status: 'OK' | 'WARN' | 'FAIL';
-  message: string;
-}
-
-function parseEnvFile(path: string): Record<string, string> {
-  try {
-    const out: Record<string, string> = {};
-    for (const line of readFileSync(path, 'utf8').split('\n')) {
-      if (!line || line.startsWith('#')) continue;
-      const eq = line.indexOf('=');
-      if (eq <= 0) continue;
-      out[line.slice(0, eq).trim()] = line.slice(eq + 1).trim().replace(/^['"]|['"]$/g, '');
-    }
-    return out;
-  } catch {
-    return {};
-  }
-}
-
-export async function checkConnectLabsEnv(opts: { envFile: string }): Promise<CheckResult> {
-  const env = parseEnvFile(opts.envFile);
-  const token = env.LABS_MCP_TOKEN;
-  if (!token || token.startsWith('op://')) {
-    return {
-      tag: 'connect_labs_env',
-      status: 'FAIL',
-      message: 'LABS_MCP_TOKEN missing or unrendered. Run: op inject -i .env.tpl -o "$CLAUDE_PLUGIN_DATA/.env" --account dimagi.1password.com',
-    };
-  }
-  return { tag: 'connect_labs_env', status: 'OK', message: 'LABS_MCP_TOKEN present' };
-}
-
-export async function checkConnectLabsReachable(opts: { token: string; url: string }): Promise<CheckResult> {
-  try {
-    const res = await fetch(opts.url, {
-      method: 'POST',
-      headers: { 'Authorization': `Bearer ${opts.token}`, 'Content-Type': 'application/json' },
-      body: JSON.stringify({ jsonrpc: '2.0', id: 1, method: 'initialize', params: {} }),
-    });
-    if (res.status === 401) {
-      return { tag: 'connect_labs_mcp_reachable', status: 'FAIL', message: 'Labs MCP returned 401 — PAT invalid or revoked. Rotate via mcp_create_token.' };
-    }
-    if (!res.ok) {
-      return { tag: 'connect_labs_mcp_reachable', status: 'FAIL', message: `Labs MCP returned ${res.status}` };
-    }
-    return { tag: 'connect_labs_mcp_reachable', status: 'OK', message: 'Labs MCP reachable + PAT accepted' };
-  } catch (e) {
-    return { tag: 'connect_labs_mcp_reachable', status: 'FAIL', message: `Cannot reach labs MCP: ${(e as Error).message}` };
-  }
-}
-
-export async function checkConnectLabsConnectOAuth(opts: { token: string; url: string }): Promise<CheckResult> {
-  try {
-    const res = await fetch(opts.url, {
-      method: 'POST',
-      headers: { 'Authorization': `Bearer ${opts.token}`, 'Content-Type': 'application/json' },
-      body: JSON.stringify({
-        jsonrpc: '2.0',
-        id: 1,
-        method: 'tools/call',
-        params: { name: 'list_solicitations', arguments: {} },
-      }),
-    });
-    const body = await res.json() as { error?: { message: string }; result?: unknown };
-    if (body.error?.message?.includes('PERMISSION_DENIED') || body.error?.message?.includes('connect')) {
-      return {
-        tag: 'connect_labs_connect_oauth',
-        status: 'WARN',
-        message: 'Labs accepts the PAT but the ace user has not completed Connect OAuth linkage. Have ace@dimagi-ai.com sign into labs once and authorize Connect.',
-      };
-    }
-    if (body.error) {
-      return { tag: 'connect_labs_connect_oauth', status: 'FAIL', message: `list_solicitations error: ${body.error.message}` };
-    }
-    return { tag: 'connect_labs_connect_oauth', status: 'OK', message: 'list_solicitations responded — Connect OAuth bridge is live' };
-  } catch (e) {
-    return { tag: 'connect_labs_connect_oauth', status: 'FAIL', message: `Probe failed: ${(e as Error).message}` };
-  }
-}
-```
-
-- [ ] **Step 5: Wire the helpers into `bin/ace-doctor`**
-
-Locate the section in `bin/ace-doctor` that prints `[Connect]` (right before or after the OCS section). After it, add a new `[Connect Labs]` section that calls the three helpers via a small TypeScript invocation. Mirror the existing pattern — if `bin/ace-doctor` is already a bash script that shells into a TS helper for the Connect section, do the same for Connect Labs. If it's pure bash that does HTTP via curl, port the three checks to bash equivalents that call the same `tag/status/message` shape.
-
-If `bin/ace-doctor` is a thin bash wrapper around `tsx`, add a single block:
-
-```bash
-echo ""
-echo "[Connect Labs]"
-npx tsx -e "
-import { checkConnectLabsEnv, checkConnectLabsReachable, checkConnectLabsConnectOAuth } from './bin/checks/connect-labs';
-import { join } from 'node:path';
-const dataDir = process.env.CLAUDE_PLUGIN_DATA || process.env.HOME + '/.ace';
-const envFile = join(dataDir, '.env');
-const tokenEntry = (await checkConnectLabsEnv({ envFile }));
-console.log(\`  \${tokenEntry.tag.padEnd(36)} \${tokenEntry.status.padEnd(4)} \${tokenEntry.message}\`);
-if (tokenEntry.status === 'OK') {
-  const env = require('fs').readFileSync(envFile, 'utf8');
-  const token = env.match(/LABS_MCP_TOKEN=(.+)/)?.[1]?.trim() || '';
-  const url = process.env.LABS_MCP_URL || 'https://labs.connect.dimagi.com/mcp/';
-  for (const check of [checkConnectLabsReachable, checkConnectLabsConnectOAuth]) {
-    const r = await check({ token, url });
-    console.log(\`  \${r.tag.padEnd(36)} \${r.status.padEnd(4)} \${r.message}\`);
-  }
-}
-"
-```
-
-(Adjust to match the actual style and indentation of the existing `[Connect]` section. The principle is: read the env file, run the three checks in order, print one line per check.)
-
-- [ ] **Step 6: Run the test to verify it passes**
-
-Run: `npm test -- --run test/doctor/connect-labs.test.ts`
-
-Expected: 6/6 pass.
-
-- [ ] **Step 7: Smoke `/ace:doctor` locally**
-
-Run: `bin/ace-doctor`
-
-Expected output includes a `[Connect Labs]` section with three `OK/WARN/FAIL` lines. With no PAT yet provisioned, `connect_labs_env` should be FAIL with the actionable `op inject` hint.
-
-- [ ] **Step 8: Commit**
-
-```bash
-git add bin/ace-doctor bin/checks/connect-labs.ts test/doctor/connect-labs.test.ts test/fixtures/empty.env test/fixtures/with-labs-token.env
-git commit -m "feat(doctor): add [Connect Labs] section with token + OAuth probes
-
-Three checks, mirroring the [Connect] section pattern:
-- connect_labs_env: LABS_MCP_TOKEN present and rendered (not op://)
-- connect_labs_mcp_reachable: PAT accepted by labs MCP (distinguishes
-  401 / network / OK)
-- connect_labs_connect_oauth: list_solicitations probe distinguishes
-  PAT-level 401 from tool-level PERMISSION_DENIED (Connect OAuth missing
-  on the ace user's labs account)
-
-Class-level preventer for silent labs misconfig.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 6: Add three optional fields to the PDD template
-
-**Files:**
-- Modify: `templates/pdd-template.md`
-- Modify: `skills/idea-to-pdd/SKILL.md`
-
-- [ ] **Step 1: Append the new fields to the PDD template**
-
-Read `templates/pdd-template.md`. After the existing `total_budget:` field (or at the end of the PDD frontmatter / metadata block, wherever budget lives), add:
-
-```yaml
-# ── Solicitation (optional, drives Phase 7) ────────────────────────────
-# These fields are read by `solicitation-create` to build the solicitation
-# published to labs.connect.dimagi.com. Safe to omit — defaults below.
-solicitation_type: EOI                # 'EOI' (Expression of Interest) | 'RFP' (Request for Proposals)
-solicitation_deadline_days: 14        # response window from publish date
-llo_questions:                        # optional response template
-  - "Describe your prior experience deploying CHW programs in this archetype."
-  - "How will you recruit and train FLWs for this scope?"
-  - "What is your timeline for fielding once awarded?"
-  - "What is your supervision model?"
-  - "Do you have local-language capacity matching the target geography?"
-  - "Provide a budget breakdown for the proposed scope."
-
-# ── Preferred LLOs (optional, used by Phase 7 llo-invite) ──────────────
-preferred_llos: []                    # list of { name, contact_email, organization_slug }
-```
-
-(If `preferred_llos` already exists in the PDD template under another section, do not duplicate — only add the three new solicitation fields. Run `grep -n "preferred_llos" templates/pdd-template.md` first to verify.)
-
-- [ ] **Step 2: Update `idea-to-pdd` SKILL.md**
-
-In `skills/idea-to-pdd/SKILL.md`, locate the section that walks the agent through PDD field collection. Add a paragraph noting the three new optional fields and that they default sensibly:
-
-> **Solicitation fields (optional, Phase 7).** If the user names preferred LLOs or a non-default solicitation type/deadline, capture them. Defaults: `solicitation_type: EOI`, `solicitation_deadline_days: 14`, a generic 6-question response template. Skipping these is fine — Phase 7 will use the defaults. Always ask once whether a custom deadline or response template is needed; if not, leave the defaults.
-
-- [ ] **Step 3: Verify existing PDD fixtures still validate**
-
-Run: `npm test -- --run test/fixtures/`
-
-Expected: pass. The new fields are optional, so existing fixtures (`CRISPR-Test-001`, `-002`, `-003`) without them remain valid.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add templates/pdd-template.md skills/idea-to-pdd/SKILL.md
-git commit -m "feat(pdd): add three optional solicitation fields to PDD template
-
-solicitation_type (EOI|RFP, default EOI), solicitation_deadline_days
-(default 14), llo_questions (default 6-question template). All optional;
-existing PDDs without them continue to validate. Drives the new Phase 7
-solicitation-create skill.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 7: Implement `solicitation-create` skill
-
-**Files:**
-- Create: `skills/solicitation-create/SKILL.md`
-- Modify: `lib/artifact-manifest.ts` (add solicitation/draft.md, solicitation/published.md, opp.yaml.solicitation block)
-
-- [ ] **Step 1: Add the artifact manifest entries**
-
-In `lib/artifact-manifest.ts`, locate the `ARTIFACT_MANIFEST` array. After the last existing entry, add:
-
-```typescript
-// ── Solicitation Management (Phase 7) ──────────────────────────
-
-{
-  path: 'solicitation/draft.md',
-  producedBy: 'solicitation-create',
-  consumedBy: ['solicitation-create-eval'],
-  phase: 'design',  // produced once, audit-only
-  required: false,
-  description: 'Solicitation payload pre-publish: title, type, scope, criteria, response template, deadline. Audit trail for what solicitation-create proposed before posting to labs.',
-},
-{
-  path: 'solicitation/published.md',
-  producedBy: 'solicitation-create',
-  consumedBy: ['solicitation-monitor', 'solicitation-review', 'solicitation-create-eval', 'llo-invite'],
-  phase: 'design',
-  required: false,
-  description: 'Snapshot of the published solicitation: solicitation_id, public_url, manage_url, deadline, criteria. Read by every downstream Phase 7 skill and by Phase 7 llo-invite for the URL to email.',
-},
-{
-  path: 'solicitation/invitations.md',
-  producedBy: 'llo-invite',
-  consumedBy: ['solicitation-monitor', 'solicitation-review-eval'],
-  phase: 'design',
-  required: false,
-  description: 'Per-recipient log: who got emailed the solicitation URL, when, and send status. Empty when PDD has no preferred_llos.',
-},
-{
-  path: 'solicitation/responses/',
-  producedBy: 'solicitation-monitor',
-  consumedBy: ['solicitation-review'],
-  phase: 'design',
-  required: false,
-  description: 'One file per solicitation response, written incrementally as responses arrive. Each file contains the response content plus metadata returned by labs.',
-},
-{
-  path: 'solicitation/review/scoring-rubric.md',
-  producedBy: 'solicitation-review',
-  consumedBy: ['solicitation-review-eval'],
-  phase: 'design',
-  required: false,
-  description: 'Per-response, per-criterion scores produced by solicitation-review.',
-},
-{
-  path: 'solicitation/review/recommendation.md',
-  producedBy: 'solicitation-review',
-  consumedBy: ['solicitation-review-eval'],
-  phase: 'design',
-  required: false,
-  description: 'Ranked candidates + reasoning. Input to the HITL gate before award_response is called.',
-},
-{
-  path: 'solicitation/award-record.md',
-  producedBy: 'solicitation-review',
-  consumedBy: ['solicitation-review-eval', 'opp-closeout'],
-  phase: 'design',
-  required: false,
-  description: 'Written when award_response is called (success or failure). Includes response_id, awarded_at, awarded_org_slug, and any error envelope on failure.',
-},
-```
-
-Also drop the existing entry for `connect-setup/invites.md` (the old `llo-invite` artifact). Locate it in the manifest (`grep -n "connect-setup/invites" lib/artifact-manifest.ts`) and delete that block.
-
-- [ ] **Step 2: Run the manifest validation test**
-
-Run: `npm test -- --run test/fixtures/artifact-manifest.test.ts`
-
-Expected: PASS (or fail with a clear "skill `solicitation-create` referenced by artifact but no SKILL.md found" — that's the next step). If it passes, the validation isn't strict enough to catch missing skills; that's fine, we'll add the skill next.
-
-- [ ] **Step 3: Write the SKILL.md**
-
-Create `skills/solicitation-create/SKILL.md`:
-
-```markdown
----
-name: solicitation-create
-description: >
-  Phase 7 step 1 (auto, default run). Translate the approved PDD into a
-  solicitation payload, derive evaluation criteria via labs's
-  generate_criteria endpoint, and publish the solicitation via the
-  connect-labs MCP. Captures solicitation_id and public_url for downstream
-  skills.
----
-
-# Solicitation Create
-
-Phase 7 default-run skill. Builds and publishes the solicitation in one
-shot — ACE always publishes, never drafts. The solicitation can be edited
-post-publish via the labs UI without affecting responses.
-
-## Inputs
-
-- `ACE/<opp-name>/inputs/pdd.md` — approved PDD (scope, success criteria, total_budget, optional solicitation fields)
-- `ACE/<opp-name>/opp.yaml` — program_id, archetype, opp display name
-
-## Process
-
-1. **Read the PDD.** Extract the fields per the table below. For optional
-   PDD fields (`solicitation_type`, `solicitation_deadline_days`,
-   `llo_questions`), use defaults when missing.
-
-2. **Build the solicitation payload:**
-
-   | Field | Source |
-   |---|---|
-   | `title` | `<solicitation_type>: <pdd.title> — <pdd.archetype>` |
-   | `solicitation_type` | PDD `solicitation_type` (default `EOI`) |
-   | `description` | PDD `intervention_summary` + `target_flw_profile` (concatenate with a newline) |
-   | `scope_of_work` | PDD `visit_structure` + `success_criteria` |
-   | `budget` | PDD `total_budget` |
-   | `deadline` | `now() + (solicitation_deadline_days || 14)` days, ISO-8601 |
-   | `evaluation_criteria` | derived by `generate_criteria` (see step 3) |
-   | `response_template` | PDD `llo_questions` or the default 6-question set |
-   | `status` | `published` |
-   | `program_id` | `opp.yaml.program_id` |
-
-3. **Derive evaluation criteria.** Call:
-
-   ```
-   mcp__connect-labs__generate_criteria(
-     scope_text: <description + scope_of_work>,
-     archetype: <pdd.archetype>
-   )
-   ```
-
-   Capture the structured rubric (criteria + weights) into the payload's
-   `evaluation_criteria` field.
-
-4. **Write the draft for traceability.** Save the full payload + the AI-derived rubric to:
-
-   ```
-   ACE/<opp-name>/solicitation/draft.md
-   ```
-
-5. **Publish.** Call:
-
-   ```
-   mcp__connect-labs__create_solicitation(<payload>)
-   ```
-
-   Capture the returned `solicitation_id`, `public_url`, and `manage_url`.
-
-6. **Write `published.md`.** Save:
-
-   ```
-   ACE/<opp-name>/solicitation/published.md
-   ```
-
-   Body includes the full payload, the returned IDs/URLs, and the deadline
-   in absolute ISO-8601 form.
-
-7. **Update `opp.yaml`.** Add a `solicitation:` block:
-
-   ```yaml
-   solicitation:
-     solicitation_id: <returned>
-     public_url: <returned>
-     manage_url: <returned>
-     type: <EOI|RFP>
-     published_at: <now ISO-8601>
-     deadline: <computed ISO-8601>
-     status: open
-     awarded:
-       response_id: null
-       awarded_at: null
-       awarded_org_slug: null
-       awarded_org_name: null
-       awarded_contact_email: null
-       award_amount: null
-   ```
-
-   Also stub a `selected_llo:` block:
-
-   ```yaml
-   selected_llo:
-     org_slug: null
-     contact_email: null
-     source: null
-     response_id: null
-   ```
-
-   These will be populated by `solicitation-review` on award.
-
-## Error handling
-
-- **Labs MCP unreachable** (proxy returns transport error): halt with a
-  doctor-style message pointing at `/ace:doctor`'s `[Connect Labs]`
-  section.
-- **`create_solicitation` returns 4xx**: preserve `draft.md`, halt, surface
-  the error verbatim. Do not retry — most 4xx is a payload schema mismatch
-  or the program_id is wrong.
-- **`generate_criteria` returns degenerate output** (empty list, single
-  criterion): write what was returned, mark `evaluation_criteria` as
-  `needs-review` in `published.md`, still publish. Criteria are editable
-  post-publish via labs UI without losing responses.
-
-## Output
-
-- `ACE/<opp-name>/solicitation/draft.md` (audit)
-- `ACE/<opp-name>/solicitation/published.md` (live state)
-- `opp.yaml.solicitation.{solicitation_id, public_url, deadline, status: open}` populated
-- `opp.yaml.selected_llo.*` stubbed
-```
-
-- [ ] **Step 4: Run the manifest validation test**
-
-Run: `npm test -- --run test/fixtures/artifact-manifest.test.ts`
-
-Expected: PASS.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add skills/solicitation-create/ lib/artifact-manifest.ts
-git commit -m "feat(skill): add solicitation-create (Phase 7, default run)
-
-Translates the approved PDD into a solicitation payload, derives
-evaluation criteria via labs's generate_criteria, and publishes via
-mcp__connect-labs__create_solicitation. Writes draft.md (audit) and
-published.md (live state), populates opp.yaml.solicitation.
-
-Manifest entries: drops connect-setup/invites.md (moves to Phase 7 in
-a later commit), adds solicitation/* artifacts.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 8: Add `solicitation-create-eval` rubric
-
-**Files:**
-- Create: `skills/solicitation-create-eval/SKILL.md`
-
-- [ ] **Step 1: Read the existing eval rubric pattern**
-
-Read `skills/connect-program-setup-eval/SKILL.md` to align style and structure with the existing `-eval` family.
-
-- [ ] **Step 2: Write the SKILL.md**
-
-Create `skills/solicitation-create-eval/SKILL.md`:
-
-```markdown
----
-name: solicitation-create-eval
-description: >
-  Provisional LLM-as-Judge rubric for solicitation-create. Grades whether
-  the published solicitation faithfully reflects the PDD's intervention
-  scope, has complete fields, and ships a sensible deadline. Calibrated
-  per skills/eval-calibration once 3+ real solicitations have shipped.
----
-
-# Solicitation Create — Eval
-
-Cross-artifact LLM-as-Judge eval. Reads the source PDD plus
-`solicitation/draft.md` and `solicitation/published.md`, scores the
-result, and writes a verdict YAML in the shared QA/eval shape so
-`opp-eval` can aggregate it.
-
-**Status:** Provisional. Calibration TBD until 3+ real solicitations have
-shipped — see `skills/eval-calibration/SKILL.md`.
-
-## Inputs
-
-- `ACE/<opp-name>/inputs/pdd.md`
-- `ACE/<opp-name>/solicitation/draft.md`
-- `ACE/<opp-name>/solicitation/published.md`
-
-## Rubric
-
-Score each dimension 0-10. Hard-deduct rules listed inline.
-
-1. **PDD-fidelity (weight 0.4).** Does the solicitation's `description`
-   and `scope_of_work` actually carry the PDD's intervention summary,
-   target FLW profile, and visit structure forward? Hard-deduct -3 if
-   either field paraphrases away a PDD constraint (e.g. PDD says "weekly
-   visits" and solicitation says "regular visits"). Hard-deduct -5 if a
-   key PDD element is missing entirely.
-
-2. **Field completeness (weight 0.2).** All required fields present?
-   `evaluation_criteria` non-empty (or marked `needs-review`)?
-   `response_template` non-empty?
-
-3. **Deadline sanity (weight 0.1).** Deadline is `now + 7..30 days`. Hard-
-   deduct -5 if deadline is in the past or > 90 days out.
-
-4. **Criteria alignment (weight 0.3).** Do the evaluation criteria reflect
-   what the PDD actually cares about (e.g. archetype-specific capabilities,
-   geographic fit, language capacity)? Penalize generic criteria like
-   "demonstrate experience" when the PDD has specific archetype demands.
-
-## Verdict shape
-
-Write `verdicts/solicitation-create-<mode>.yaml` per the `lib/verdict-schema.ts`
-shape (see `skills/README.md` § QA vs Eval).
-
-```yaml
-schema_version: 1
-skill: solicitation-create
-mode: deep
-overall_score: <0-10 weighted>
-overall_verdict: pass | fail | partial
-dimensions:
-  - { name: pdd-fidelity,        score: <0-10>, weight: 0.4, notes: "..." }
-  - { name: field-completeness,  score: <0-10>, weight: 0.2, notes: "..." }
-  - { name: deadline-sanity,     score: <0-10>, weight: 0.1, notes: "..." }
-  - { name: criteria-alignment,  score: <0-10>, weight: 0.3, notes: "..." }
-hard_deduct_triggered: [ ... ]
-recommendations: [ ... ]
-```
-```
-
-- [ ] **Step 3: Run vitest to verify the manifest doesn't break**
-
-Run: `npm test -- --run test/fixtures/artifact-manifest.test.ts`
-
-Expected: PASS.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add skills/solicitation-create-eval/
-git commit -m "feat(skill): add solicitation-create-eval rubric (provisional)
-
-PDD-fidelity, field completeness, deadline sanity, criteria alignment.
-Provisional rubric — calibration TBD per skills/eval-calibration once 3+
-real solicitations have shipped.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 9: Implement `solicitation-monitor` skill
-
-**Files:**
-- Create: `skills/solicitation-monitor/SKILL.md`
-
-- [ ] **Step 1: Write the SKILL.md**
-
-Create `skills/solicitation-monitor/SKILL.md`:
-
-```markdown
----
-name: solicitation-monitor
-description: >
-  Phase 7 recurring skill. Polls labs for new responses while the
-  solicitation is open, writes one file per response to
-  ACE/<opp>/solicitation/responses/, and appends a tick line to the
-  observation log. Three modes: --quick (count only), --monitor (full
-  pull, default), --close (final pull when deadline passes).
----
-
-# Solicitation Monitor
-
-Recurring skill that runs while `opp.yaml.solicitation.status == open`.
-Mirrors the `ocs-chatbot-qa` recurring pattern (`--quick`/`--monitor`).
-
-## Modes
-
-- **`--quick`**: just count responses; do not pull bodies. Cheap.
-  Suitable for the orchestrator's recurring check.
-- **`--monitor`** (default): for each new response, pull the body and
-  write `solicitation/responses/<response_id>.md`.
-- **`--close`**: same as `--monitor` but also flips `opp.yaml.solicitation.status`
-  from `open` to `closed`. Run once when the deadline passes.
-
-## Inputs
-
-- `opp.yaml.solicitation.solicitation_id`
-- `opp.yaml.solicitation.deadline`
-
-## Process (--monitor)
-
-1. **List responses.** Call:
-
-   ```
-   mcp__connect-labs__list_responses(solicitation_id: <id>)
-   ```
-
-2. **Diff against local state.** Read existing files in
-   `ACE/<opp-name>/solicitation/responses/` (each is named
-   `<response_id>.md`). For each new response:
-
-   ```
-   mcp__connect-labs__get_response(response_id: <id>)
-   ```
-
-   Write the body to `solicitation/responses/<response_id>.md`. Body
-   includes: response_id, submitted_at, organization, contact, the answers
-   to each question in the response template, and any attachments.
-
-3. **Summarize inflow.** Compute:
-   - Total responses received
-   - Responses received since the last monitor tick
-   - Time-to-deadline (delta between `now()` and `solicitation.deadline`)
-   - If `solicitation/invitations.md` exists: list of invitees who have
-     not yet responded.
-
-4. **Append observation.** Append a single line to
-   `ACE/<opp-name>/comms-log/observations.md`:
-
-   ```
-   <ISO-8601>  solicitation-monitor  <count> total responses (<+N> new since last tick), <H>h to deadline
-   ```
-
-5. **Update `opp.yaml`.** If mode is `--close` AND `now() > deadline`, set
-   `opp.yaml.solicitation.status: closed`.
-
-## Process (--quick)
-
-Steps 1, 3 (counts only), 4. Skip body pulls and per-response file writes.
-
-## Error handling
-
-Read-only skill from labs's perspective; failures are non-fatal.
-Log "monitor failed: <reason>" to `comms-log/observations.md` and exit
-without halting the orchestrator. Next tick will retry.
-
-## Output
-
-- New files in `ACE/<opp-name>/solicitation/responses/`
-- Tick line in `ACE/<opp-name>/comms-log/observations.md`
-- (`--close` only) `opp.yaml.solicitation.status: closed`
-
-## No eval companion
-
-`solicitation-monitor` is read-only and recurring. Quality bar is captured
-by `solicitation-review-eval` downstream.
-```
-
-- [ ] **Step 2: Run the manifest validation**
-
-Run: `npm test -- --run test/fixtures/artifact-manifest.test.ts`
-
-Expected: PASS.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/solicitation-monitor/
-git commit -m "feat(skill): add solicitation-monitor (Phase 7 recurring)
-
-Polls labs for responses, writes one file per response, appends a tick
-line to comms-log/observations.md. Three modes: --quick, --monitor
-(default), --close (flip status to closed when deadline passes).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 10: Implement `solicitation-review` skill (manual, with HITL gate)
-
-**Files:**
-- Create: `skills/solicitation-review/SKILL.md`
-
-- [ ] **Step 1: Write the SKILL.md**
-
-Create `skills/solicitation-review/SKILL.md`:
-
-```markdown
----
-name: solicitation-review
-description: >
-  Phase 7 manual skill. Reads all solicitation responses, scores each
-  against the published rubric, presents a recommendation to the human,
-  and (after explicit HITL approval) calls award_response and populates
-  opp.yaml.selected_llo. The only path that unblocks Phase 8.
----
-
-# Solicitation Review
-
-Manual skill — never runs in default `/ace:run`. Only via:
-
-```
-/ace:step solicitation-review --opp <opp-name>
-```
-
-This is the only skill that calls `award_response` (irreversible) and the
-only skill that populates `opp.yaml.selected_llo` (which gates Phase 8).
-
-## Inputs
-
-- `opp.yaml.solicitation.solicitation_id`
-- `opp.yaml.solicitation.public_url`
-- `ACE/<opp-name>/solicitation/published.md` (rubric)
-- `ACE/<opp-name>/solicitation/responses/*.md` (all responses)
-
-## Process
-
-1. **Pull all responses fresh.** Call:
-
-   ```
-   mcp__connect-labs__list_responses(solicitation_id: <id>)
-   ```
-
-   For each response, call `get_response` even if the local cache exists
-   (responses may have been edited).
-
-2. **Score each response.** Read the rubric from `published.md` (the
-   `evaluation_criteria` block). For each response, score every criterion
-   on its declared scale (typically 1-10) and compute a weighted total.
-
-3. **Optionally write to labs.** For each response, call:
-
-   ```
-   mcp__connect-labs__create_review(
-     response_id: <id>,
-     scores: { <criterion_id>: <score>, ... },
-     notes: "<reasoning>"
-   )
-   ```
-
-   This puts ACE's scores in the labs audit trail. Idempotent — call
-   `list_reviews` first and skip if a review by `ace@dimagi-ai.com` already
-   exists for this response.
-
-4. **Write `scoring-rubric.md`.** Save the per-response, per-criterion
-   scores to:
-
-   ```
-   ACE/<opp-name>/solicitation/review/scoring-rubric.md
-   ```
-
-5. **Write `recommendation.md`.** Save:
-
-   ```
-   ACE/<opp-name>/solicitation/review/recommendation.md
-   ```
-
-   Body: ranked list of candidates with reasoning. Top candidate gets a
-   `Recommended awardee` callout.
-
-6. **HITL gate.** Present `recommendation.md` to the human and ask:
-
-   > "Confirm awarding response_id=<top> ($<amount>) to <org_name>? Reply
-   > 'award <response_id> $<amount>' to confirm, or 'cancel' to halt."
-
-   Wait for an explicit reply. **Do not call `award_response` without one.**
-   If the human picks a different response_id or amount, use those.
-
-7. **Call `award_response`.** On confirm:
-
-   ```
-   mcp__connect-labs__award_response(
-     response_id: <chosen>,
-     amount: <chosen_amount>
-   )
-   ```
-
-8. **Write `award-record.md`.**
-
-   ```
-   ACE/<opp-name>/solicitation/award-record.md
-   ```
-
-   Body: `response_id`, `awarded_at`, `awarded_org_slug`, `awarded_org_name`,
-   `awarded_contact_email`, `award_amount`, and (if labs returned an error)
-   `status: failed` + the error envelope.
-
-9. **Populate `opp.yaml.selected_llo`.** Only on a successful award:
-
-   ```yaml
-   selected_llo:
-     org_slug: <returned>
-     contact_email: <returned>
-     source: solicitation
-     response_id: <chosen>
-   ```
-
-   Also flip `opp.yaml.solicitation.status: awarded` and populate the
-   `solicitation.awarded.*` block.
-
-## Error handling
-
-- **HITL gate timeout / no reply**: do not call `award_response`. Do not
-  mutate `opp.yaml`. Exit cleanly so the human can re-run the skill.
-- **`award_response` returns 4xx after approval**: write `award-record.md`
-  with `status: failed` and the error envelope. **Do not** populate
-  `selected_llo` (Phase 8 stays gated). Surface the error to the human
-  and suggest contacting a labs admin if the award call must succeed
-  out-of-band.
-- **`list_reviews` shows ACE already reviewed all responses**: skip the
-  scoring step (we don't re-score), proceed to step 4 from the existing
-  reviews.
-
-## Output
-
-- `ACE/<opp-name>/solicitation/review/scoring-rubric.md`
-- `ACE/<opp-name>/solicitation/review/recommendation.md`
-- `ACE/<opp-name>/solicitation/award-record.md`
-- `opp.yaml.selected_llo.*` populated (only on success)
-- `opp.yaml.solicitation.status: awarded` (only on success)
-```
-
-- [ ] **Step 2: Commit**
-
-```bash
-git add skills/solicitation-review/
-git commit -m "feat(skill): add solicitation-review (Phase 7 manual, HITL-gated)
-
-Scores all responses against the published rubric, presents a
-recommendation, and (after explicit human approval) calls award_response
-and populates opp.yaml.selected_llo. The only path that unblocks Phase 8.
-
-The award call is gated on a literal 'award <response_id> \$<amount>'
-reply from the human — no auto-award. Never runs in default /ace:run.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 11: Add `solicitation-review-eval` rubric
-
-**Files:**
-- Create: `skills/solicitation-review-eval/SKILL.md`
-
-- [ ] **Step 1: Write the SKILL.md**
-
-Create `skills/solicitation-review-eval/SKILL.md`:
-
-```markdown
----
-name: solicitation-review-eval
-description: >
-  Provisional LLM-as-Judge rubric for solicitation-review. Compares ACE's
-  top-ranked recommendation against the human's actual award decision.
-  Detection-rate metric: did ACE's recommended awardee match the human's
-  pick? Calibrated per skills/eval-calibration once 3+ awards have shipped.
----
-
-# Solicitation Review — Eval
-
-Cross-artifact LLM-as-Judge eval. Compares ACE's recommendation in
-`solicitation/review/recommendation.md` against the actual outcome
-in `solicitation/award-record.md`.
-
-**Status:** Provisional. Calibration TBD until 3+ real awards have shipped.
-
-## Inputs
-
-- `ACE/<opp-name>/solicitation/review/scoring-rubric.md`
-- `ACE/<opp-name>/solicitation/review/recommendation.md`
-- `ACE/<opp-name>/solicitation/award-record.md`
-- `ACE/<opp-name>/solicitation/published.md` (rubric reference)
-
-## Rubric
-
-1. **Recommendation alignment (weight 0.4).** Did ACE's top-ranked
-   recommendation match the awarded response_id? Score 10 if yes, 5 if
-   awardee was in ACE's top 3, 0 otherwise. Hard-deduct -3 if
-   `award-record.md` has `status: failed` while `selected_llo` is populated
-   (data-integrity violation — that path should be impossible per the
-   skill's contract, and any verdict must flag it).
-
-2. **Scoring rationale quality (weight 0.3).** Are the scores in
-   `scoring-rubric.md` traceable to the criteria in `published.md`? Are
-   the per-criterion notes specific or generic? Penalize one-line "good
-   experience" justifications.
-
-3. **Recommendation specificity (weight 0.2).** Does `recommendation.md`
-   surface concrete differentiators between candidates, or is it a
-   ranked list with no narrative? Higher score for surfacing the close
-   calls.
-
-4. **Edge case coverage (weight 0.1).** Did the recommendation flag any
-   responses that were structurally unscoreable (incomplete answers,
-   wrong-archetype)? Penalize silent skipping.
-
-## Verdict shape
-
-Write `verdicts/solicitation-review-<mode>.yaml` per `lib/verdict-schema.ts`.
-```
-
-- [ ] **Step 2: Commit**
-
-```bash
-git add skills/solicitation-review-eval/
-git commit -m "feat(skill): add solicitation-review-eval rubric (provisional)
-
-Detection-rate metric (recommendation alignment with actual award) plus
-scoring rationale, specificity, and edge-case coverage. Calibration TBD
-until 3+ real awards have shipped.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 12: Transform `llo-invite` skill (move to Phase 7, rewrite behavior)
-
-**Files:**
-- Modify: `skills/llo-invite/SKILL.md` (substantial rewrite)
-
-- [ ] **Step 1: Replace the SKILL.md content**
-
-Replace the entire content of `skills/llo-invite/SKILL.md` with:
-
-```markdown
----
-name: llo-invite
-description: >
-  Phase 7 step 2 (auto, default run). For each PDD-named candidate LLO,
-  send an invitation email with the public solicitation URL. No-op when
-  the PDD has no preferred_llos (long-term solicitation flow). Makes no
-  Connect API calls — those happen for the awardee only, in
-  llo-onboarding (Phase 8).
----
-
-# LLO Invite
-
-Phase 7 default-run skill. Runs after `solicitation-create` has captured
-`opp.yaml.solicitation.public_url`. Sends each PDD-named candidate LLO an
-email containing the solicitation URL, deadline, and a scope summary.
-
-This skill replaces the previous Phase-7 (was Phase-6) `llo-invite` that
-prepared a Connect-side invite roster. The Connect program-level invite
-(`connect_send_llo_invite`) is now `llo-onboarding`'s responsibility and
-fires only for the awardee.
-
-## Inputs
-
-- `ACE/<opp-name>/inputs/pdd.md` (specifically `preferred_llos:`)
-- `opp.yaml.solicitation.public_url`
-- `opp.yaml.solicitation.deadline`
-
-## Process
-
-1. **Read `preferred_llos`** from the PDD.
-
-2. **If empty:** write `ACE/<opp-name>/solicitation/invitations.md`:
-
-   ```markdown
-   # Solicitation Invitations
-
-   Status: empty (long-term solicitation flow — no PDD-named candidates).
-   The solicitation is publicly listed at <public_url>; orgs find it on the
-   labs portal.
-   ```
-
-   Exit successfully.
-
-3. **For each preferred LLO**, compose an email:
-
-   ```
-   Subject: Invitation to respond — <pdd.title>
-   To: <preferred_llo.contact_email>
-
-   Hi <name>,
-
-   <Dimagi greeting + program summary, 2-3 sentences from PDD>
-
-   We are inviting your organization to respond to a solicitation for
-   <pdd.title>. The full description, scope of work, and response template
-   are at:
-
-       <opp.yaml.solicitation.public_url>
-
-   Responses are due by <opp.yaml.solicitation.deadline> (UTC).
-
-   To respond, sign into labs.connect.dimagi.com with your organization
-   account, open the solicitation linked above, and click "Submit Response."
-
-   Questions? Reply to this email.
-
-   <ace signature block>
-   ```
-
-   Send via the `email-communicator` skill (uses ACE's Gmail account
-   `ace@dimagi-ai.com`).
-
-4. **Log every send** to `ACE/<opp-name>/solicitation/invitations.md`:
-
-   ```markdown
-   # Solicitation Invitations
-
-   Solicitation: <public_url>
-   Deadline: <deadline>
-
-   ## Recipients
-
-   | Recipient | Org | Sent at | Status |
-   |---|---|---|---|
-   | <name>    | <org> | <ISO>  | sent |
-   | <name>    | <org> | <ISO>  | failed: <reason> |
-   ```
-
-## Review-mode gate
-
-If invoked under `/ace:run --review` mode, present the prepared email list
-to the human before sending and pause. Default mode sends without a gate
-(the orchestrator's gate is the Phase 7→8 boundary, not here).
-
-## Error handling
-
-- Per-recipient email failure: log `status: failed: <reason>` for that
-  row, continue with the rest.
-- All recipients fail: halt with a surfaced error.
-- PDD has no `preferred_llos`: no-op per Step 2 above.
-- `opp.yaml.solicitation.public_url` empty: halt with "run
-  solicitation-create first" message.
-
-## Output
-
-- `ACE/<opp-name>/solicitation/invitations.md` — recipient log
-```
-
-- [ ] **Step 2: Verify the manifest now matches**
-
-Run: `npm test -- --run test/fixtures/artifact-manifest.test.ts`
-
-Expected: PASS. The `solicitation/invitations.md` entry from Task 7 lists `producedBy: 'llo-invite'`, which now matches.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/llo-invite/
-git commit -m "refactor(skill): llo-invite — move to Phase 7, rewrite for solicitations
-
-Previously Phase 7 (was Phase 8 after renumbering): identified PDD-named
-candidates and prepared a Connect-side invite roster. Now Phase 7: same
-candidate identification, but emails each one a link to the public
-solicitation URL. Makes no Connect API calls — the Connect program-level
-invite fires only for the awardee inside llo-onboarding.
-
-Empty PDD preferred_llos → no-op (long-term flow: solicitation is public,
-orgs find it via the labs portal).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 13: Update `llo-onboarding` to read `selected_llo`
-
-**Files:**
-- Modify: `skills/llo-onboarding/SKILL.md`
-
-- [ ] **Step 1: Read the current SKILL.md**
-
-Run: `cat skills/llo-onboarding/SKILL.md`
-
-Note where it currently reads from `connect-setup/invites.md` (the old roster). The change replaces that with reading `opp.yaml.selected_llo`.
-
-- [ ] **Step 2: Edit the inputs and process**
-
-In `skills/llo-onboarding/SKILL.md`:
-
-(a) In the Inputs section, replace any reference to `connect-setup/invites.md` with:
-
-```
-- `opp.yaml.selected_llo` — populated by Phase 7 solicitation-review on award.
-  Halt with a clear error if `org_slug` is null (Phase 8 must not start
-  without an awardee).
-```
-
-(b) In Process step 1 (or wherever the roster gets read), replace the roster-loading logic with:
-
-```
-1. Read `opp.yaml.selected_llo`. If `org_slug` is null:
-   ```
-   FATAL: Phase 8 cannot start — opp.yaml.selected_llo.org_slug is empty.
-   Run `/ace:step solicitation-review --opp <opp-name>` to score responses
-   and award an awardee. The orchestrator's pre-Phase-7 gate should have
-   caught this; if you're seeing this from a manual /ace:step invocation,
-   the gate was bypassed.
-   ```
-   Halt.
-2. Use `selected_llo.org_slug` as the target for `connect_send_llo_invite`
-   and `selected_llo.contact_email` as the recipient for the ACE
-   onboarding email.
-```
-
-(c) Drop any prose that talks about iterating a multi-LLO roster — Phase 8 onboards exactly one awardee.
-
-- [ ] **Step 3: Smoke a fixture**
-
-Run: `npm test -- --run test/fixtures/`
-
-Expected: PASS. Fixtures don't currently set `selected_llo`, but the SKILL.md change is prose; tests don't execute the skill.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add skills/llo-onboarding/
-git commit -m "refactor(skill): llo-onboarding reads opp.yaml.selected_llo
-
-Replaces the connect-setup/invites.md roster read with a single
-selected_llo lookup populated by Phase 7 solicitation-review. Fails fast
-with an actionable message if Phase 8 is reached without an awardee.
-
-The Connect program-level invite (connect_send_llo_invite) and the ACE
-onboarding email both target selected_llo.org_slug /
-selected_llo.contact_email.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 14: Add the `solicitation-management` subagent
-
-**Files:**
-- Create: `agents/solicitation-management.md`
-
-- [ ] **Step 1: Read an existing subagent for shape**
-
-Run: `cat agents/closeout.md` (it's the simplest subagent in the codebase).
-
-- [ ] **Step 2: Write the new agent**
-
-Create `agents/solicitation-management.md`:
-
-```markdown
----
-name: solicitation-management
-description: >
-  Phase 7 of the CRISPR-Connect lifecycle: publish a solicitation derived
-  from the PDD, invite PDD-named candidate LLOs to it by email, and stop.
-  The review-and-award lifecycle continues via the manually-invoked
-  solicitation-review skill (gated on a human-in-the-loop checkpoint
-  before award_response is called). Phase 8 starts once an awardee is
-  recorded in opp.yaml.selected_llo.
-model: inherit
-phase: solicitation-management
-phase_display: Solicitation Management
-phase_ordinal: 6
-skills:
-  - { name: solicitation-create,  has_judge: true,  eval_skill: solicitation-create-eval }
-  - { name: llo-invite,           has_judge: false }
-recurring_skills:
-  - { name: solicitation-monitor, has_judge: false }
-manual_skills:
-  - { name: solicitation-review,  has_judge: true,  eval_skill: solicitation-review-eval }
----
-
-# Solicitation Management Agent (Phase 7)
-
-You run the solicitation phase of a CRISPR-Connect opportunity. By the
-time this phase starts, Phases 1–5 have produced an approved PDD,
-deployed CommCare apps, a configured Connect opportunity, a quality-gated
-OCS chatbot, and per-opp training materials. The opportunity is fully
-prepared on the ACE side — what's missing is an LLO to run it.
-
-This phase publishes a solicitation that potential LLOs can respond to.
-In default `/ace:run` mode, you publish the solicitation and email the
-PDD-named candidate LLOs (if any), then stop. The review-and-award
-lifecycle requires explicit human approval and is run manually via
-`/ace:step solicitation-review`.
-
-## Workflow (default run)
-
-### Step 1: Solicitation Create
-
-Run the `solicitation-create` skill. It translates the PDD into a
-solicitation payload, derives evaluation criteria via labs's
-`generate_criteria` endpoint, and publishes the solicitation via the
-`connect-labs` MCP. Captures `solicitation_id` and `public_url` into
-`opp.yaml.solicitation`.
-
-- Input: approved PDD, opp.yaml (program_id, total_budget)
-- Output: `solicitation/published.md`, `opp.yaml.solicitation` populated
-- Eval (unless `--no-evals`): `solicitation-create-eval`
-
-### Step 2: LLO Invite
-
-Run the `llo-invite` skill. For each PDD-named candidate LLO, send an
-invitation email pointing at the public solicitation URL.
-
-- Input: PDD `preferred_llos`, `opp.yaml.solicitation.public_url`
-- Output: `solicitation/invitations.md`
-- No-op when PDD has no `preferred_llos` (long-term solicitation flow).
-
-### Recurring: Solicitation Monitor
-
-While `opp.yaml.solicitation.status == open`, the orchestrator's recurring
-loop calls `solicitation-monitor` to pull new responses, write one file
-per response to `solicitation/responses/`, and append a tick line to
-`comms-log/observations.md`.
-
-This loop runs OUTSIDE the default `/ace:run` invocation (which exits
-after Step 2). It is meant to be scheduled (cron or manual `/ace:step
-solicitation-monitor`) until the deadline passes.
-
-### Manual: Solicitation Review
-
-Once the deadline has passed (or whenever a human decides to award), the
-human runs:
-
-```
-/ace:step solicitation-review --opp <opp-name>
-```
-
-This skill scores all responses, presents a recommendation, gates on
-explicit human approval, then calls `award_response` and populates
-`opp.yaml.selected_llo`. Only this skill unblocks Phase 8.
-
-## Pause-points
-
-- **End of Step 2** (default `/ace:run` exit): `/ace:run` halts here. Phase 8
-  cannot start until `solicitation-review` has populated `selected_llo`.
-- **Inside `solicitation-review`**: HITL gate before `award_response`.
-
-## Outputs at phase end (default run)
-
-- `ACE/<opp-name>/solicitation/draft.md`
-- `ACE/<opp-name>/solicitation/published.md`
-- `ACE/<opp-name>/solicitation/invitations.md`
-- `opp.yaml.solicitation.{solicitation_id, public_url, deadline, status: open}`
-- `opp.yaml.selected_llo.*` (stubbed, null until award)
-
-## Completion
-
-The phase is "complete" in the orchestrator's sense after Step 2. The
-recurring monitor and manual review are NOT part of phase completion —
-they happen post-`/ace:run` and gate Phase 8 entry.
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add agents/solicitation-management.md
-git commit -m "feat(agent): add solicitation-management subagent (Phase 7)
-
-Owns the new Phase 7: solicitation-create + llo-invite (auto, default
-run), solicitation-monitor (recurring), solicitation-review (manual,
-HITL-gated). Default /ace:run halts at the end of llo-invite; Phase 8 is
-gated on opp.yaml.selected_llo being populated by solicitation-review.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 15: Update `ace-orchestrator` phases block and pause-points
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md`
-
-- [ ] **Step 1: Locate the `phases:` block**
-
-Run: `grep -n "^phases:" agents/ace-orchestrator.md`
-
-Read the block (lines 66-105 in the current file). Note the existing entries.
-
-- [ ] **Step 2: Rewrite the phases block**
-
-Replace the existing `phases:` block in `agents/ace-orchestrator.md` with:
-
-```yaml
-phases:
-  design-review:        # Phase 1
-    # (existing entries, unchanged)
-  commcare-setup:       # Phase 2
-    # (existing entries, unchanged)
-  connect-setup:        # Phase 3
-    # (existing entries, unchanged)
-  ocs-setup:            # Phase 4
-    # (existing entries, unchanged)
-  qa-and-training:      # Phase 5
-    # (existing entries, unchanged)
-  solicitation-management:  # Phase 7 (NEW)
-    solicitation-create: pending
-    llo-invite: pending
-    # solicitation-monitor and solicitation-review run outside /ace:run.
-  execution-management:     # Phase 8 (was llo-management, Phase 7)
-    llo-onboarding: pending
-    llo-uat: pending
-    llo-launch: pending
-  closeout:             # Phase 9 (was Phase 8)
-    # (existing entries, unchanged)
-```
-
-(Preserve the existing pending/skip values inside each unchanged phase — the example above shows only the structure, not literal replacement of inner entries. Use `Edit` with surgical replacements for each block; do not blow away the inner state.)
-
-- [ ] **Step 3: Update the pause-points list**
-
-Locate the pause-points list in `agents/ace-orchestrator.md` (around lines 274-307 — search for "Phase 5→6 transition" and "After `llo-invite`"). Replace the existing pause-points text with:
-
-```markdown
-**Pause-points:**
-- After `idea-to-pdd` (Phase 1) — PDD must be approved before building apps
-- After `app-deploy` (Phase 2) — apps must be verified before Connect setup
-- After `ocs-chatbot-eval --deep` (Phase 4) — OCS quality must clear pre-launch bar
-- **Phase 7 → 7 boundary** — `/ace:run` halts here in default mode. Phase 8
-  cannot start until `opp.yaml.selected_llo.org_slug` is populated, which
-  only happens via the manual `solicitation-review` skill. This is the new
-  external-communication boundary (Phase 8 sends the first email to the
-  awardee LLO).
-- After `solicitation-review` (Phase 7, manual) — HITL gate before
-  `award_response` is called.
-- After `llo-launch` (Phase 8) — activation verified before monitoring
-- Phase 5 → 6 is **no longer mandatory pause**. Solicitation publication
-  is passive (labs portal listing); the active-outreach boundary moves
-  to Phase 7 → 7.
-```
-
-- [ ] **Step 4: Update the agent dispatch references**
-
-Run: `grep -n "Agent(llo-manager)\|Agent('llo-manager')" agents/ace-orchestrator.md`
-
-For each match, replace with `Agent(execution-manager)` / `Agent('execution-manager')`. Then add a new dispatch reference for `solicitation-management` between Phase 5 and Phase 8 dispatch sites:
-
-```markdown
-After Phase 5 (qa-and-training) completes, dispatch Phase 7:
-
-  Agent(solicitation-management)
-
-Wait for it to return. After Phase 7 completes, the orchestrator HALTS in
-default mode. The next phase requires manual intervention
-(/ace:step solicitation-review). Resume Phase 8 only after
-opp.yaml.selected_llo.org_slug is populated:
-
-  Agent(execution-manager)
-```
-
-- [ ] **Step 5: Update prose references to phase numbering**
-
-Throughout `ace-orchestrator.md`, update prose references:
-- "Phase 5→6 transition: always pause" → moved to Phase 7→8 (covered above)
-- "Phase 7 is where LLOs first hear from ACE" → "Phase 8 is where the awardee LLO first hears from ACE; Phase 7 publishes the public solicitation but does not contact specific LLOs unless the PDD names preferred_llos."
-
-- [ ] **Step 6: Commit**
-
-```bash
-git add agents/ace-orchestrator.md
-git commit -m "feat(orchestrator): wire Phase 7 (solicitation-management)
-
-phases: block now lists solicitation-management between qa-and-training
-and execution-management. Pause-points: Phase 5→6 no longer mandatory;
-Phase 7→8 is the new external-comms boundary. Agent dispatch now calls
-Agent(solicitation-management) and Agent(execution-manager).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 16: Update commands/run.md and commands/step.md
-
-**Files:**
-- Modify: `commands/run.md`
-- Modify: `commands/step.md`
-
-- [ ] **Step 1: Find references to llo-manager**
-
-Run: `grep -n "llo-manager\|llo_management\|llo-management" commands/`
-
-- [ ] **Step 2: Apply replacements**
-
-For each match: `llo-manager` → `execution-manager`, `llo_management` → `execution_management`, `llo-management` → `execution-management`.
-
-If `commands/step.md` documents the `/ace:step` command's valid skill list, add the four new Phase 7 skills (`solicitation-create`, `llo-invite`, `solicitation-monitor`, `solicitation-review`) and the two new eval skills.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add commands/run.md commands/step.md
-git commit -m "chore(commands): rename llo-manager → execution-manager in command docs
-
-Also adds the four new Phase 7 skills to /ace:step's documented skill list.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 17: Add CRISPR-Test-004-Solicitation fixture
-
-**Files:**
-- Create: `test/fixtures/CRISPR-Test-004-Solicitation/inputs/pdd.md`
-- Create: `test/fixtures/CRISPR-Test-004-Solicitation/opp.yaml`
-- Modify: `test/fixtures/artifact-manifest.test.ts`
-
-- [ ] **Step 1: Read an existing fixture for shape**
-
-Run: `ls test/fixtures/CRISPR-Test-001*/`, then read its `pdd.md` and `opp.yaml` for structure.
-
-- [ ] **Step 2: Create the fixture**
-
-Create `test/fixtures/CRISPR-Test-004-Solicitation/inputs/pdd.md`:
-
-```markdown
----
-title: "FLW Outreach for Maternal Health — Niger"
-archetype: atomic-visit
-intervention_summary: >
-  CHWs visit pregnant women and new mothers monthly to provide ANC/PNC
-  guidance, basic screening, and referrals. The program targets districts
-  with low facility-delivery rates.
-target_flw_profile: >
-  Existing community-elected health volunteers, primarily women, with
-  basic literacy in Hausa or French. 6-month engagement, ~30 visits per
-  month per FLW.
-visit_structure: >
-  Single-visit data collection at each woman's home. Form covers
-  demographics, pregnancy status, danger signs screening, and referral
-  log. ~15 minutes per visit.
-success_criteria:
-  - "≥80% of pregnant women in catchment receive at least 1 ANC visit"
-  - "≥60% of identified danger-sign cases referred to facility"
-  - "FLW retention ≥85% over 6 months"
-total_budget: 75000
-
-# Solicitation fields
-solicitation_type: EOI
-solicitation_deadline_days: 21
-llo_questions:
-  - "Describe your prior experience deploying CHW programs in West Africa"
-  - "How will you recruit and train 40 FLWs across 3 districts?"
-  - "What is your timeline for fielding once awarded?"
-  - "What is your supervision model for FLW visits?"
-  - "Do you have local-language capacity (Hausa or French)?"
-  - "Provide a budget breakdown for the proposed scope"
-
-preferred_llos:
-  - { name: "Niger Health Initiative", contact_email: "ops@niger-health.example", organization_slug: "niger-health-initiative" }
-  - { name: "Sahel Maternal Care", contact_email: "info@sahel-maternal.example", organization_slug: "sahel-maternal-care" }
----
-```
-
-Create `test/fixtures/CRISPR-Test-004-Solicitation/opp.yaml`:
-
-```yaml
-display_name: "Niger Maternal Health Pilot"
-slug: niger-maternal-health-pilot
-program_id: 42
-created_at: 2026-05-04T12:00:00Z
-created_by: ace@dimagi-ai.com
-last_run_id: null
-tags: [solicitation-fixture, atomic-visit]
-```
-
-- [ ] **Step 3: Update the manifest validation test**
-
-In `test/fixtures/artifact-manifest.test.ts`, add `'CRISPR-Test-004-Solicitation'` to the list of fixtures that get walked. (Look for an array like `const FIXTURES = ['CRISPR-Test-001-...', ...]` and append.)
-
-- [ ] **Step 4: Run the test**
-
-Run: `npm test -- --run test/fixtures/`
-
-Expected: PASS for all 4 fixtures.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add test/fixtures/CRISPR-Test-004-Solicitation/ test/fixtures/artifact-manifest.test.ts
-git commit -m "test(fixture): add CRISPR-Test-004-Solicitation
-
-PDD with all three new optional solicitation fields populated, two
-preferred_llos. Used by Phase 7 skill tests and the LABS_INTEGRATION
-e2e test.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 18: Update opp-eval to include solicitation category
-
-**Files:**
-- Modify: `skills/opp-eval/SKILL.md`
-
-- [ ] **Step 1: Read the current category list**
-
-Run: `grep -n "category\|categories" skills/opp-eval/SKILL.md`
-
-Note the existing 6 categories (likely: design, commcare, connect, ocs, operate, closeout).
-
-- [ ] **Step 2: Add the solicitation category**
-
-In `skills/opp-eval/SKILL.md`, add a new category between `connect` and `ocs` (or wherever the phase-ordering sits in the document):
-
-- Category: `solicitation`
-- Eval rubrics aggregated: `solicitation-create-eval`, `solicitation-review-eval` (when present)
-- Phase: 6
-- Coverage tier rule: full coverage requires verdicts from both rubrics; partial coverage requires `solicitation-create-eval` only.
-
-If the SKILL.md has a "category coverage tier" table that says "6 of 6 = full", update to "7 of 7 = full". The "full" threshold lifts by one.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/opp-eval/
-git commit -m "feat(eval): add solicitation category to opp-eval
-
-opp-eval now aggregates verdicts from solicitation-create-eval and
-solicitation-review-eval as a 7th category. Full coverage threshold
-lifts from 6 → 7.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 19: Add LABS_INTEGRATION end-to-end test
-
-**Files:**
-- Create: `test/mcp/connect-labs/integration/e2e.integration.test.ts`
-
-- [ ] **Step 1: Write the test**
-
-Create `test/mcp/connect-labs/integration/e2e.integration.test.ts`:
-
-```typescript
-import { describe, it, expect, beforeAll } from 'vitest';
-import { forward } from '../../../../mcp/connect-labs-server';
-
-const RUN = process.env.LABS_INTEGRATION === '1';
-const URL = process.env.LABS_MCP_URL || 'https://labs.connect.dimagi.com/mcp/';
-const TOKEN = process.env.LABS_MCP_TOKEN || '';
-
-describe.runIf(RUN)('connect-labs MCP — live integration', () => {
-  beforeAll(() => {
-    if (!TOKEN) throw new Error('LABS_MCP_TOKEN required for LABS_INTEGRATION=1');
-  });
-
-  it('lists tools (sanity)', async () => {
-    const reply = await forward(
-      { jsonrpc: '2.0', id: 1, method: 'tools/list' },
-      { token: TOKEN, url: URL },
-    );
-    expect(reply.error).toBeUndefined();
-    expect((reply.result as any)?.tools?.length).toBeGreaterThan(0);
-    const names = (reply.result as any).tools.map((t: any) => t.name);
-    expect(names).toEqual(expect.arrayContaining([
-      'list_solicitations',
-      'create_solicitation',
-      'list_responses',
-      'award_response',
-    ]));
-  });
-
-  it('list_solicitations returns at least an empty list (Connect OAuth bridge live)', async () => {
-    const reply = await forward(
-      {
-        jsonrpc: '2.0',
-        id: 2,
-        method: 'tools/call',
-        params: { name: 'list_solicitations', arguments: {} },
-      },
-      { token: TOKEN, url: URL },
-    );
-    expect(reply.error).toBeUndefined();
-    // Result is the labs-side serialized list; shape may be a JSON-encoded
-    // string or a structured array depending on the labs MCP transport.
-    expect(reply.result).toBeDefined();
-  });
-
-  it('create_solicitation → list_responses → cleanup (smoke)', async () => {
-    // Create a draft solicitation in a test program. The fixture's program_id
-    // must point at a "Solicitation Test" program in labs that has no real
-    // responders. Skip the test (don't fail) if the env doesn't provide one.
-    const programId = process.env.LABS_TEST_PROGRAM_ID;
-    if (!programId) {
-      console.warn('LABS_TEST_PROGRAM_ID unset — skipping create_solicitation smoke');
-      return;
-    }
-    const create = await forward(
-      {
-        jsonrpc: '2.0',
-        id: 3,
-        method: 'tools/call',
-        params: {
-          name: 'create_solicitation',
-          arguments: {
-            program_id: programId,
-            title: `ACE integration test ${new Date().toISOString()}`,
-            solicitation_type: 'EOI',
-            description: 'integration test — please ignore',
-            scope_of_work: 'integration test',
-            budget: 1,
-            deadline: new Date(Date.now() + 24 * 3600 * 1000).toISOString(),
-            evaluation_criteria: [{ id: 'fit', weight: 1.0, scale: 10 }],
-            response_template: ['Why are you interested?'],
-            status: 'draft',  // never publish from a test
-          },
-        },
-      },
-      { token: TOKEN, url: URL },
-    );
-    expect(create.error).toBeUndefined();
-  });
-});
-```
-
-- [ ] **Step 2: Verify the test is skipped without the env var**
-
-Run: `npm test -- --run test/mcp/connect-labs/integration/`
-
-Expected: 0 tests run (the `runIf` skips when `LABS_INTEGRATION` is unset).
-
-- [ ] **Step 3: Run the test with the env var (manual verification)**
-
-Run: `LABS_INTEGRATION=1 LABS_MCP_TOKEN=<actual> npm test -- --run test/mcp/connect-labs/integration/`
-
-Expected: 3/3 pass against live labs (only if you have a labs PAT and a `LABS_TEST_PROGRAM_ID` to target).
-
-If you don't have a PAT yet, skip this verification — the test exists for CI / future runs.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add test/mcp/connect-labs/integration/
-git commit -m "test(integration): add LABS_INTEGRATION e2e for connect-labs MCP
-
-Three checks: tools/list, list_solicitations (verifies Connect OAuth
-bridge), create_solicitation smoke (skipped without LABS_TEST_PROGRAM_ID).
-Gated like OCS_INTEGRATION — does not run in default npm test.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 20: Update CLAUDE.md and README.md prose
-
-**Files:**
-- Modify: `CLAUDE.md`
-- Modify: `README.md`
-
-- [ ] **Step 1: Update CLAUDE.md phase order list**
-
-Locate the phase listing in `CLAUDE.md` (search for "Orchestration runs 7 phases"). Update to:
-
-> **Orchestration runs 8 phases as of <next version>.** Phase order: (1) design-review → (2) commcare-setup → (3) connect-setup → (4) ocs-setup → (5) qa-and-training → (6) solicitation-management → (7) execution-management → (8) closeout. Phase 7 (new) publishes a solicitation derived from the PDD and emails PDD-named candidate LLOs the public URL. Phase 8 (renamed from llo-management) onboards the awardee chosen by the manual solicitation-review skill.
-
-- [ ] **Step 2: Update CLAUDE.md MCP section**
-
-Add a new bullet under the existing MCP-server description block:
-
-> - `connect-labs-server.ts` → `connect-labs` (stdio proxy forwarding to `labs.connect.dimagi.com/mcp/`). 10 atoms consumed: `list/get/create/update_solicitation`, `list/get_responses`, `create_review`, `list_reviews`, `award_response`, `generate_criteria`. Source under `mcp/connect-labs-server.ts` is a thin proxy — the real catalog lives in connect-labs (`commcare_connect/mcp/tools/`). Auth: Bearer PAT in `LABS_MCP_TOKEN` (1Password). Provisioned per-machine via `op inject -i .env.tpl`.
-
-- [ ] **Step 3: Update CLAUDE.md gotchas section**
-
-Add a new bullet under "Gotchas":
-
-> - **Connect Labs MCP is HTTP, but ACE consumes it via a stdio proxy.** `mcp/connect-labs-server.ts` reads `LABS_MCP_TOKEN` from `${CLAUDE_PLUGIN_DATA}/.env` and forwards JSON-RPC frames to `labs.connect.dimagi.com/mcp/`. If the labs MCP gains first-class HTTP support in `plugin.json` later, the proxy can be removed.
-> - **`solicitation` and `selected_llo` are separate blocks in `opp.yaml`.** `solicitation` is the audit trail (URLs, deadline, status); `selected_llo` is the narrow contract Phase 8 reads. Only `solicitation-review` populates `selected_llo`. If you see `selected_llo` set without a corresponding `solicitation` block, that's a contract violation.
-
-- [ ] **Step 4: Update README.md (if present)**
-
-If `README.md` lists phases, update to the 8-phase order. If it has a "What ACE does" summary, add a sentence about Phase 7 being solicitation-driven.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add CLAUDE.md README.md
-git commit -m "docs: update CLAUDE.md + README for 8-phase order
-
-Phase 7 (Solicitation Management) added between qa-and-training and the
-renamed Execution Management. New connect-labs MCP entry, new gotchas
-on the stdio proxy + opp.yaml.solicitation/selected_llo split.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 21: Bump version, update CHANGELOG, final smoke
-
-**Files:**
-- Modify: `VERSION`
-- Modify: `CHANGELOG.md`
-
-- [ ] **Step 1: Bump VERSION (worktree-safe)**
-
-Run: `scripts/version-bump.sh`
-
-Expected: prints something like `bumped 0.11.9 → 0.12.0` (the bumper picks `max(local, origin) + patch+1`; the actual minor-vs-patch depends on the current state). For a feature this size, manually override to a minor bump if the script picked patch:
-
-If needed, edit `VERSION` to `0.12.0` and run the pre-commit-style sync:
-
-```bash
-echo "0.12.0" > VERSION
-bash scripts/sync-version.sh
-```
-
-- [ ] **Step 2: Update CHANGELOG.md**
-
-Prepend to `CHANGELOG.md`:
-
-```markdown
-## 0.12.0 — Solicitation Management (new Phase 7)
-
-**Phase topology shifts.** Inserts Phase 7 (Solicitation Management) between
-qa-and-training and the renamed Execution Management (was llo-management,
-Phase 7). Closeout shifts to Phase 9.
-
-**New phase: Solicitation Management.**
-- Default `/ace:run` publishes a solicitation derived from the PDD via the
-  new `connect-labs` MCP, then emails PDD-named candidate LLOs the public
-  URL. `/ace:run` halts at the Phase 7→8 boundary.
-- Recurring `solicitation-monitor` polls labs for responses; runs outside
-  `/ace:run`.
-- Manual `solicitation-review` (HITL-gated) scores responses, presents a
-  recommendation, and on human approval calls `award_response` and
-  populates `opp.yaml.selected_llo`. The only path that unblocks Phase 8.
-
-**New MCP: `connect-labs`.** A thin stdio proxy at
-`mcp/connect-labs-server.ts` forwards JSON-RPC frames to
-`labs.connect.dimagi.com/mcp/` with a Bearer PAT (`LABS_MCP_TOKEN`).
-Consumes 10 atoms; no new code in `ace-connect`.
-
-**Phase 8 changes:** `llo-manager` agent renamed to `execution-manager`.
-`llo-invite` skill moved to Phase 7 with rewritten behavior (sends
-solicitation invites instead of preparing a Connect roster).
-`llo-onboarding` reads `opp.yaml.selected_llo` and fails fast if empty.
-
-**Pause-points:**
-- Phase 5→6 no longer mandatory pause.
-- Phase 7→8 is the new external-communication boundary (where `/ace:run`
-  halts in default mode).
-- HITL gate inside `solicitation-review` before `award_response`.
-
-**Doctor:** new `[Connect Labs]` section with three checks
-(env / reachable / Connect OAuth bridge).
-
-**Provisional eval rubrics:** `solicitation-create-eval`,
-`solicitation-review-eval`. Calibration TBD per `eval-calibration` once
-3+ real solicitations + awards have shipped.
-
-**No migration script.** In-flight opps finish on the old code; new opps
-use the new schema.
-```
-
-- [ ] **Step 3: Run the full test suite**
-
-Run: `npm test -- --run`
-
-Expected: full pass.
-
-- [ ] **Step 4: Run /ace:doctor smoke**
-
-Run: `bin/ace-doctor`
-
-Expected: all sections OK or WARN. The new `[Connect Labs]` section will
-likely FAIL on `connect_labs_env` until a PAT is provisioned in 1Password
-— that's expected and not a blocker for the merge. The doctor exit
-status should reflect the FAIL, but the operator will provision the PAT
-during/after the merge.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add VERSION package.json .claude-plugin/plugin.json .claude-plugin/marketplace.json CHANGELOG.md
-git commit -m "release: 0.12.0 — Solicitation Management (new Phase 7)
-
-Inserts Phase 7 (Solicitation Management) between qa-and-training and
-the renamed Execution Management (was llo-management, Phase 7). Closeout
-shifts to Phase 9. Adds the connect-labs stdio proxy MCP, four new
-skills (solicitation-create, llo-invite-rewritten, solicitation-monitor,
-solicitation-review), two provisional eval rubrics, and a new doctor
-section.
-
-See CHANGELOG.md for full details.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-## Self-review
-
-**Spec coverage:** Walked each section of the spec —
-
-- ✅ Phase topology (renumbering): Tasks 1, 2, 14, 15
-- ✅ MCP integration (proxy, auth, doctor): Tasks 3, 4, 5
-- ✅ PDD additions: Task 6
-- ✅ Solicitation skills: Tasks 7, 9, 10
-- ✅ Eval rubrics: Tasks 8, 11
-- ✅ Transformed llo-invite: Task 12
-- ✅ Updated llo-onboarding: Task 13
-- ✅ Solicitation-management agent: Task 14
-- ✅ Orchestrator phases + pause-points: Task 15
-- ✅ Commands updates: Task 16
-- ✅ Fixture: Task 17
-- ✅ opp-eval coverage: Task 18
-- ✅ Integration test: Task 19
-- ✅ Doc updates: Task 20
-- ✅ Version + CHANGELOG: Task 21
-
-**Placeholder scan:** No "TBD", "TODO", "implement later", or
-"add appropriate error handling" without specifics. Provisional rubrics
-are explicitly flagged with calibration plans.
-
-**Type consistency:** Skill names match between manifest entries
-(Task 7), the agent's `skills:` block (Task 14), and orchestrator
-references (Task 15). Atom names (`create_solicitation`, `list_responses`,
-`award_response`, etc.) match what `connect-labs/commcare_connect/mcp/tools/solicitations.py`
-registers.
-
-**Known limitation:** The doctor wiring in Task 5 step 5 ("if `bin/ace-doctor`
-is a thin bash wrapper around `tsx`") branches on the current shape of
-`ace-doctor`. The implementation step should inspect the file first and
-match its existing convention. This is documented in the task itself.
-
----
-
-Plan complete and saved to `docs/superpowers/plans/2026-05-04-ace-solicitations-phase.md`. Two execution options:
-
-1. **Subagent-Driven (recommended)** — I dispatch a fresh subagent per task, review between tasks, fast iteration.
-
-2. **Inline Execution** — Execute tasks in this session using executing-plans, batch execution with checkpoints.
-
-Which approach?
diff --git a/docs/superpowers/plans/2026-05-04-shallow-deep-qa-split.md b/docs/superpowers/plans/2026-05-04-shallow-deep-qa-split.md
deleted file mode 100644
index 3df5a419..00000000
--- a/docs/superpowers/plans/2026-05-04-shallow-deep-qa-split.md
+++ /dev/null
@@ -1,1202 +0,0 @@
-# Shallow / Deep QA Split Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
-
-**Goal:** Make `/ace:run` shallow-by-default (~5 LLM judge calls vs ~90 today), introduce a manual `/ace:qa-deep` command for quality assessment, and move QA-plan generation upstream to phases that know design intent (Phase 1) and built structure (Phase 2). Add a Phase 7 gate that prevents activation without fresh deep verdicts.
-
-**Architecture:** Add two new artifact-producing skills upstream (`pdd-to-app-journeys` in Phase 1, `app-test-cases` in Phase 2) so Phase 5 can become a thin executor. Add one new eval skill (`app-ux-eval`) plus a top-level `/ace:qa-deep` command that wraps deep OCS + deep app eval. Thin OCS `--quick` to a 3-prompt × 1-dimension smoke check. Drop Phase 4's `--deep` gate. Wire the deep-verdict requirement into `llo-launch` so go-live can't ship without it. Retire `qa-plan` and `app-test` once their successors are live.
-
-**Tech Stack:** TypeScript (MCP atoms, lib/), prompt-based skills (.md files), Vitest tests, Google Drive artifact layout under `ACE/<opp>/runs/<run-id>/`.
-
-**Spec:** `docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md`
-
----
-
-## File Structure
-
-**New files:**
-- `skills/pdd-to-app-journeys/SKILL.md` — Phase 1 producer for `expected-journeys.md`
-- `skills/app-test-cases/SKILL.md` — Phase 2 producer for `app-test-cases.yaml`
-- `skills/app-ux-eval/SKILL.md` — deep-only LLM-as-Judge over screenshots + journeys
-- `commands/qa-deep.md` — `/ace:qa-deep <opp>` slash command
-- `templates/expected-journeys-template.md` — markdown skeleton consumed by `pdd-to-app-journeys`
-- `templates/app-test-cases-template.yaml` — yaml skeleton consumed by `app-test-cases`
-- `migrations/0.x.0-shallow-deep-qa.md` — migration notes for in-flight opps
-
-**Modified files:**
-- `skills/ocs-chatbot-qa/SKILL.md` — thin `--quick` to 3 prompts; gate calls only from `/ace:qa-deep`
-- `skills/ocs-chatbot-eval/SKILL.md` — `--quick` collapses to 1 dimension (`overall_quality`)
-- `skills/app-screenshot-capture/SKILL.md` — read `app-test-cases.yaml` instead of `qa-plan/`; add 1-question UX smoke judge
-- `skills/llo-launch/SKILL.md` — gate activation on fresh deep verdicts
-- `agents/design-review.md` — add `pdd-to-app-journeys` step
-- `agents/commcare-setup.md` — add `app-test-cases` step after Nova builds, before `app-release`
-- `agents/qa-and-training.md` — drop `qa-plan` step, point at new artifacts
-- `agents/ocs-setup.md` — drop `--deep` gate; only `--quick` runs in Phase 4
-- `agents/llo-manager.md` — note the new gate in `llo-launch`
-- `lib/artifact-manifest.ts` — add new artifacts; drop `qa-plan/` and `app-test` artifacts; update `consumedBy` lists
-- `bin/ace-doctor` — add freshness check for deep verdicts
-- `commands/run.md` — note that deep QA is no longer part of `/ace:run`
-- `VERSION` — bump
-
-**Retired files (deleted at end):**
-- `skills/qa-plan/SKILL.md` and directory
-- `skills/app-test/SKILL.md` and directory
-- `test-results/` artifact entries in manifest
-
----
-
-## Task 1: New Phase 1 skill — `pdd-to-app-journeys`
-
-**Goal:** Phase 1 emits `expected-journeys.md` describing UX intent. Nothing reads it yet (Task 3 does), so this lands cleanly without breaking anything.
-
-**Files:**
-- Create: `skills/pdd-to-app-journeys/SKILL.md`
-- Create: `templates/expected-journeys-template.md`
-- Modify: `agents/design-review.md` (add a step that dispatches the new skill)
-- Modify: `lib/artifact-manifest.ts` (add `expected-journeys.md` entry under `phase: 'design'`, `required: true`, `producedBy: 'pdd-to-app-journeys'`, `consumedBy: ['app-test-cases', 'app-ux-eval']`)
-- Test: `test/fixtures/artifact-manifest.test.ts` (existing — re-run after manifest edit)
-
-- [ ] **Step 1: Read the existing Phase 1 skill `pdd-to-test-prompts` for structure**
-
-Read: `skills/pdd-to-test-prompts/SKILL.md`. The new skill mirrors its frontmatter/process layout, including the `## Archetypes` branching.
-
-- [ ] **Step 2: Write the template**
-
-Create `templates/expected-journeys-template.md`:
-
-```markdown
-# Expected User Journeys — {{opp_name}}
-
-Derived from: pdd.md (rev {{pdd_rev_date}})
-Archetype: {{archetype}}
-
-## Persona
-
-{{persona_summary — pulled verbatim from PDD's "Target FLW" section}}
-
-## Journey 1 — {{journey_name}}
-
-**Goal:** {{one-line goal of the journey}}
-
-**Happy path narrative:**
-{{2-4 sentences describing what the FLW does, in user-outcome language —
-not field/form mechanics. Example: "FLW arrives at a household, opens
-the Deliver app, confirms the household by name and phone, completes
-the screening, photographs the MTN card, and submits. They see a
-confirmation that their visit has been recorded."}}
-
-**Edge cases (UX outcomes, not error codes):**
-- {{e.g., "FLW understands why a duplicate-household submission was
-  rejected and how to proceed"}}
-- {{e.g., "FLW understands they cannot submit without GPS"}}
-
-**Pass criteria:**
-- {{e.g., "Journey completes in <3 minutes including form fill"}}
-- {{e.g., "Required-field errors are recoverable in-form"}}
-
-## Journey 2 — {{journey_name}}
-...
-```
-
-- [ ] **Step 3: Write the skill file**
-
-Create `skills/pdd-to-app-journeys/SKILL.md`. Frontmatter:
-
-```markdown
----
-name: pdd-to-app-journeys
-description: >
-  Derive opp-specific expected user journeys from an approved PDD.
-  Output is `expected-journeys.md`, the UX-intent ground truth for
-  `app-test-cases` (Phase 2) and `app-ux-eval` (deep QA). Mirrors
-  pdd-to-test-prompts but for the apps, not the chatbot.
----
-```
-
-Body must include:
-- A `## Process` section with steps: read PDD, branch on archetype, generate journeys per persona, self-evaluate coverage, write file
-- An `## Archetypes` section that mirrors `pdd-to-test-prompts`:
-  - `atomic-visit`: 2-4 journeys covering visit-flow, eligibility-edge, data-quality-error, duplicate-handling
-  - `focus-group`: 2-4 journeys covering session-setup, recruitment-failure, consent-handling, output-coherence
-  - `multi-stage`: per-stage journeys + cross-stage transition
-- A `## Coverage rules` section requiring at least one `error_recovery`-flavored edge case per journey (so `app-ux-eval`'s rubric has signal)
-- A `## Failure modes` and `## Mode behavior` block matching `pdd-to-test-prompts`
-- A `## Change log` entry
-
-The skill writes to `ACE/<opp>/runs/<run-id>/expected-journeys.md`. (Use the run-scoped path — see `lib/run-paths.ts`.)
-
-- [ ] **Step 4: Wire the skill into `design-review` agent**
-
-Modify `agents/design-review.md`. Find the existing `pdd-to-test-prompts` dispatch step. Add a parallel step right after it (same level — Phase 1 Step 3 or 4):
-
-```markdown
-### Step <N>: Generate expected user journeys
-
-Dispatch `pdd-to-app-journeys`:
-- Reads: `pdd.md`
-- Writes: `expected-journeys.md`
-- Halts on missing/empty PDD or missing target-FLW persona section
-
-This skill is the UX-intent ground truth for downstream app QA. Phase 5
-shallow execution and `/ace:qa-deep` both read it.
-```
-
-- [ ] **Step 5: Add to artifact manifest**
-
-Modify `lib/artifact-manifest.ts`. Add this entry inside the `// ── Design phase (Phase 1) ─────────` block, alongside `test-prompts.md`:
-
-```typescript
-{
-  path: 'expected-journeys.md',
-  producedBy: 'pdd-to-app-journeys',
-  consumedBy: ['app-test-cases', 'app-ux-eval', 'app-screenshot-capture'],
-  phase: 'design',
-  required: true,
-  description: 'PDD-derived user journeys + UX edge cases. Ground truth for app-test-cases (Phase 2) and app-ux-eval (deep). Each journey carries a goal, happy-path narrative, edge cases phrased as UX outcomes, and pass criteria.',
-},
-```
-
-- [ ] **Step 6: Run manifest tests**
-
-Run: `npm test -- test/fixtures/artifact-manifest.test.ts`
-Expected: PASS. If it fails saying a fixture is missing the file, that's expected — the fixture-update lands in Task 8.
-
-(If the test enforces strict fixture coverage and fails, mark this as a known-blocker for Task 8 and proceed; do not block this task on it.)
-
-- [ ] **Step 7: Commit**
-
-```bash
-git add skills/pdd-to-app-journeys/ templates/expected-journeys-template.md \
-        agents/design-review.md lib/artifact-manifest.ts
-git commit -m "feat(phase-1): add pdd-to-app-journeys skill + expected-journeys.md artifact
-
-Mirror of pdd-to-test-prompts for the app side. Emits the UX-intent
-ground truth that app-test-cases (Phase 2) and app-ux-eval (deep) will
-consume in subsequent commits. Artifact-manifest gets the new entry
-under phase=design, required=true.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 2: New Phase 2 skill — `app-test-cases`
-
-**Goal:** Phase 2 emits `app-test-cases.yaml` after Nova builds. Binds Phase 1 journeys to real built structure + Maestro recipe stubs. Doesn't replace `qa-plan` yet — `qa-plan` keeps running in Phase 5 until Task 5 retires it.
-
-**Files:**
-- Create: `skills/app-test-cases/SKILL.md`
-- Create: `templates/app-test-cases-template.yaml`
-- Modify: `agents/commcare-setup.md` (add dispatch step after Nova builds, before `app-release`)
-- Modify: `lib/artifact-manifest.ts` (add `app-test-cases.yaml` entry)
-
-- [ ] **Step 1: Read existing Phase 2 producers for context**
-
-Read in parallel:
-- `skills/pdd-to-learn-app/SKILL.md` — how Phase 2 skills read `nova_app_id` and Nova's blueprint
-- `skills/qa-plan/SKILL.md` — the recipe-composition pattern we'll inherit (Steps 2 + 3)
-- `mcp/mobile/recipes/static/connect-login.yaml` (and siblings in that directory) — the static-recipe palette
-
-- [ ] **Step 2: Write the template**
-
-Create `templates/app-test-cases-template.yaml`:
-
-```yaml
-# app-test-cases.yaml — bindings of Phase 1 journeys to Phase 2 built structure.
-# Producer: app-test-cases (Phase 2)
-# Consumers: app-screenshot-capture (Phase 5 shallow), /ace:qa-deep (manual deep)
-
-opp: {{opp_name}}
-run_id: {{run_id}}
-generated_at: {{ISO}}
-pdd_rev: {{pdd_rev_date}}
-nova_apps:
-  learn: {{learn_nova_app_id}}
-  deliver: {{deliver_nova_app_id}}
-
-# Each entry binds one Journey from expected-journeys.md to:
-#   - the actual forms/fields it exercises (real IDs, not placeholders)
-#   - a Maestro recipe filled with concrete selectors (no REPLACE_*)
-#   - the structural pass criteria (boot, no crash, submit confirmation)
-#
-# `is_smoke: true` marks the recipe Phase 5 runs in shallow mode (one
-# per app — the cheapest representative happy path).
-
-journeys:
-  - id: J1
-    name: {{journey_name from expected-journeys.md}}
-    app: deliver  # or learn
-    is_smoke: false
-    forms_exercised:
-      - {{form_id_or_name}}
-    fields_exercised:
-      - {{field_id}}
-    recipe_path: app-test-cases/recipes/J1.yaml
-    structural_pass_criteria:
-      - app_boots
-      - no_crash
-      - submission_confirmed   # or "assessment_complete" for Learn
-    pdd_time_budget_seconds: {{from PDD if specified, else null}}
-```
-
-- [ ] **Step 3: Write the skill file**
-
-Create `skills/app-test-cases/SKILL.md`:
-
-```markdown
----
-name: app-test-cases
-description: >
-  After Nova builds the Learn and Deliver apps, bind each user journey
-  from expected-journeys.md to the actual built structure, emit a
-  Maestro recipe stub per journey with real selectors (not REPLACE_*),
-  and write the consolidated app-test-cases.yaml. Phase 5 reads this
-  for shallow execution; /ace:qa-deep reads it for full execution.
-  Successor to qa-plan (which is retired in this same release).
----
-
-# App Test Cases
-
-Binds Phase 1 UX intent to Phase 2 built structure. Runs after Nova
-finishes both apps, before `app-release` — so the recipes exist when
-Phase 5 needs them.
-
-## Process
-
-### Step 1: Read inputs
-
-- `expected-journeys.md`
-- `app-summaries/learn-app-summary.md`
-- `app-summaries/deliver-app-summary.md`
-- The Nova blueprints (call `mcp__plugin_nova_nova__get_app` with each
-  app id) for real form/field IDs
-- The static-recipe library at `mcp/mobile/recipes/static/`
-
-### Step 2: For each journey, decide its app + smoke flag
-
-Map each journey from `expected-journeys.md` to either Learn or Deliver
-based on whether the journey describes assessment behavior (Learn) or
-visit/delivery behavior (Deliver). Multi-stage opps may have both.
-
-**Smoke flag rules:**
-- Exactly ONE journey per app gets `is_smoke: true`
-- The smoke journey is the simplest happy-path that exercises the
-  app's primary submission/completion flow
-- If two journeys could plausibly be the smoke, pick the one with the
-  smallest `pdd_time_budget_seconds`
-
-### Step 3: For each journey, compose the Maestro recipe
-
-Use the same composition pattern as the retired `qa-plan` skill (read
-`skills/qa-plan/SKILL.md` § Step 3 for the static-recipe palette).
-Differences:
-
-- Recipes here are journey-keyed, not module-keyed (`J1.yaml`, `J2.yaml`)
-- Each journey's recipe MUST include a final
-  `takeScreenshot: "sc-J<n>-final"` for the deep UX judge to grade
-- Validate via `mobile_validate_recipe` before writing
-
-Write recipes to `ACE/<opp>/runs/<run-id>/app-test-cases/recipes/J<n>.yaml`.
-
-### Step 4: Emit the consolidated yaml
-
-Write `ACE/<opp>/runs/<run-id>/app-test-cases.yaml` per the template
-in `templates/app-test-cases-template.yaml`.
-
-### Step 5: Self-evaluate coverage
-
-(Same shape as pdd-to-test-prompts.) Verify:
-- Every journey from `expected-journeys.md` has a binding
-- Exactly one `is_smoke: true` per app
-- Every recipe passes `mobile_validate_recipe`
-- Every `forms_exercised` entry resolves to a real Nova form ID
-
-If any check fails, halt with a `[BLOCKER]` verdict.
-
-## Mode behavior
-
-- Auto: write everything, halt on blocker
-- Review: pause to show the journey→form bindings before composing recipes
-- Dry-run: write the yaml + journey bindings; stub recipe paths; state
-  tracks as `dry-run-success`
-
-## Failure modes
-
-- expected-journeys.md missing or empty → Phase 1 hasn't completed; halt
-- Nova blueprint missing for one of the apps → Phase 2 build hasn't
-  succeeded; halt with pointer to upstream skill
-- mobile_validate_recipe rejects more than 2× per journey → escalate
-  with the validator output
-
-## MCP tools used
-
-- ace-gdrive: drive_read_file, drive_create_file, drive_create_folder
-- ace-mobile: mobile_resolve_selectors, mobile_validate_recipe
-- nova: mcp__plugin_nova_nova__get_app
-
-## Change log
-
-| Date | Change | Author |
-|------|--------|--------|
-| {{today}} | Initial version. Phase 2 producer for app-test-cases.yaml; binds expected-journeys.md to Nova-built structure with Maestro recipe stubs. Successor to qa-plan (retired in same release). | ACE team |
-```
-
-- [ ] **Step 4: Wire the skill into `commcare-setup` agent**
-
-Modify `agents/commcare-setup.md`. Find the dispatch chain that goes:
-`pdd-to-learn-app` → `pdd-to-deliver-app` → `app-deploy` → `app-release` → `app-test`.
-
-Insert `app-test-cases` between `app-deploy` and `app-release` (Nova builds are uploaded via app-deploy, so the blueprint IDs are stable by then; app-release is when we can no longer rebuild the apps cheaply, so it's also the natural cutoff for "the apps are now what they are"). Step text:
-
-```markdown
-### Step <N>: Generate app-test-cases.yaml
-
-Dispatch `app-test-cases`:
-- Reads: expected-journeys.md, both app summaries, Nova blueprints
-- Writes: app-test-cases.yaml + recipes/J*.yaml under app-test-cases/
-- Halts on missing inputs or recipe-validation failure
-
-Phase 5 shallow runs the smoke recipes; /ace:qa-deep runs them all.
-```
-
-- [ ] **Step 5: Add to artifact manifest**
-
-Modify `lib/artifact-manifest.ts`. Add inside the CommCare phase block, after `deployment-summary.md`:
-
-```typescript
-{
-  path: 'app-test-cases.yaml',
-  producedBy: 'app-test-cases',
-  consumedBy: ['app-screenshot-capture', 'app-ux-eval'],
-  phase: 'commcare',
-  required: true,
-  description: 'Bindings of expected-journeys.md to Phase-2-built app structure: per-journey form/field IDs, Maestro recipe paths, smoke flags, structural pass criteria. Phase 5 shallow uses is_smoke: true entries; /ace:qa-deep uses all entries.',
-},
-```
-
-Update consumed-by lists already in the file:
-- `expected-journeys.md` → confirm `consumedBy` includes `'app-test-cases'`
-- `app-summaries/learn-app-summary.md` → add `'app-test-cases'`
-- `app-summaries/deliver-app-summary.md` → add `'app-test-cases'`
-
-- [ ] **Step 6: Run manifest tests**
-
-Run: `npm test -- test/fixtures/artifact-manifest.test.ts`
-Expected: PASS (modulo the fixture-coverage warning carried from Task 1).
-
-- [ ] **Step 7: Commit**
-
-```bash
-git add skills/app-test-cases/ templates/app-test-cases-template.yaml \
-        agents/commcare-setup.md lib/artifact-manifest.ts
-git commit -m "feat(phase-2): add app-test-cases skill + app-test-cases.yaml artifact
-
-Phase 2 producer for the journey→build binding layer. Composes Maestro
-recipes per journey with real selectors (not REPLACE_*), marks one
-smoke recipe per app for Phase 5 shallow execution. Successor to
-qa-plan; qa-plan keeps running in Phase 5 until Task 5 swaps over.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 3: New deep eval skill — `app-ux-eval`
-
-**Goal:** New LLM-as-Judge skill that grades captured screenshots against `expected-journeys.md`. Deep-only — no `--quick` mode. Used by `/ace:qa-deep` (Task 4) and the Phase 7 gate (Task 7).
-
-**Files:**
-- Create: `skills/app-ux-eval/SKILL.md`
-- Modify: `lib/artifact-manifest.ts` (add `verdicts/app-ux-eval-deep.yaml`)
-- Modify: `lib/verdict-schema.ts` (only if the existing schema doesn't already cover the dimensions; prefer reusing)
-
-- [ ] **Step 1: Read existing eval skills + verdict schema**
-
-Read in parallel:
-- `skills/ocs-chatbot-eval/SKILL.md` — uniform verdict shape, hard-deduction pattern
-- `lib/verdict-schema.ts` — confirm the schema is dimension-agnostic (it should be; it just stores `dimensions: { name: string; score: number; reason: string }[]`)
-- `lib/parse-verdict.ts` — confirm parser is generic
-
-If the schema is already generic, no schema edits are needed.
-
-- [ ] **Step 2: Write the skill file**
-
-Create `skills/app-ux-eval/SKILL.md`:
-
-```markdown
----
-name: app-ux-eval
-description: >
-  LLM-as-Judge over captured screenshots + expected-journeys.md.
-  Per-journey verdict on UX dimensions: clarity, flow_predictability,
-  error_recovery, time_budget, journey_completion. Deep-only — runs from
-  /ace:qa-deep, never from /ace:run. Writes verdicts/app-ux-eval-deep.yaml
-  in the uniform verdict shape so opp-eval can aggregate.
----
-
-# App UX Eval
-
-Grades the FLW experience of the built apps. Asks: "would this be a
-good experience for the user?" and pins each judgment to concrete
-PDD-derived ground truth (the journey's stated goal, time budget, edge
-cases) so the rubric isn't unmoored.
-
-## Process
-
-### Step 1: Read inputs
-
-- `expected-journeys.md` — ground truth
-- `app-test-cases.yaml` — journey↔recipe bindings
-- The captured screenshots from the recent execution run (look up by
-  the run id passed in)
-- `pdd.md` — for persona context (the FLW the rubric is judging "good
-  experience" against)
-
-### Step 2: For each journey, score 5 dimensions (1-3)
-
-| Dimension | What to look for | Hard deduction → fail |
-|---|---|---|
-| `clarity` | Field labels and prompts unambiguous to the persona from PDD's "Target FLW" section | Any field name only a developer would understand (e.g., `q3_v2_optional`) |
-| `flow_predictability` | Conditional branches go where FLW expects; skip patterns don't surprise | A screen appears or disappears with no apparent cause from the user's perspective |
-| `error_recovery` | Validation errors tell the FLW what's wrong and how to fix | Dead-end errors with no recovery path |
-| `time_budget` | Step count + estimated input time vs. journey's `pdd_time_budget_seconds` | Recipe step count × 5s exceeds 2× the budget |
-| `journey_completion` | Recipe accomplishes the journey's stated goal end-to-end | Recipe ends without confirmation / stuck screen |
-
-### Step 3: Aggregate
-
-- Per-journey verdict: weighted average of dimensions, hard-deduction
-  on any single dimension clamps the journey to fail
-- Phase verdict: pass = all journeys pass; fail = any journey fails,
-  with summary of which journeys failed which dimensions
-
-### Step 4: Write verdict
-
-Write `ACE/<opp>/runs/<run-id>/verdicts/app-ux-eval-deep.yaml` per the
-uniform verdict shape (see `skills/README.md § Eval verdict shape` or
-`lib/verdict-schema.ts`). Required fields:
-
-- skill: app-ux-eval
-- mode: deep
-- timestamp: ISO with timezone
-- artifact_refs: { learn_build_id, deliver_build_id } — read from
-  deployment-summary.md so the Phase 7 gate can timestamp-compare
-- dimensions: per-dimension scores + reasons
-- per_unit_verdicts: per-journey verdicts
-- overall_score, status (pass | fail), failing_units
-
-Also append a row to `eval-calibration/app-ux-eval-runs.md` so
-calibration metrics keep accumulating.
-
-## Mode behavior
-
-- Deep only. There is no `--quick`.
-
-## Failure modes
-
-- Screenshots missing for a journey marked in app-test-cases.yaml →
-  halt with a `[BLOCKER]` saying which recipe didn't run
-- expected-journeys.md missing → upstream Phase 1 or migration gap;
-  halt with pointer
-- Nova builds older than the screenshots → screenshots are stale; halt
-
-## MCP tools used
-
-- ace-gdrive: drive_read_file, drive_list_folder, drive_create_file
-- (No mobile/MCP — this is pure judging over already-captured artifacts)
-
-## Change log
-
-| Date | Change | Author |
-|------|--------|--------|
-| {{today}} | Initial version. Deep-only LLM-as-Judge for app UX. Used by /ace:qa-deep and the Phase 7 gate. | ACE team |
-```
-
-- [ ] **Step 3: Add to artifact manifest**
-
-Modify `lib/artifact-manifest.ts`. Add to the operate phase block (mirroring `verdicts/ocs-chatbot-eval-deep.yaml`):
-
-```typescript
-{
-  path: 'verdicts/app-ux-eval-deep.yaml',
-  producedBy: 'app-ux-eval',
-  consumedBy: ['llo-launch', 'opp-eval'],
-  phase: 'operate',
-  required: false,
-  description: 'Machine-readable verdict from app-ux-eval (deep). Read by llo-launch (Phase 7 activation gate) for freshness check vs. latest released CommCare build, and by opp-eval for cross-skill aggregation. Required to be fresh and passing for go-live; absent if /ace:qa-deep has not been run.',
-},
-```
-
-- [ ] **Step 4: Run manifest tests + verdict-schema tests**
-
-Run in parallel:
-- `npm test -- test/fixtures/artifact-manifest.test.ts`
-- `npm test -- test/lib/verdict-schema.test.ts` (if the path exists)
-
-Expected: PASS. The new skill produces the same shape so no test changes needed.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add skills/app-ux-eval/ lib/artifact-manifest.ts
-git commit -m "feat: add app-ux-eval skill for deep app UX grading
-
-LLM-as-Judge over captured screenshots + expected-journeys.md. Five
-dimensions (clarity, flow_predictability, error_recovery, time_budget,
-journey_completion), each with a hard-deduction rule. Deep-only —
-called from /ace:qa-deep (next task) and gated by Phase 7 in Task 7.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 4: New `/ace:qa-deep <opp>` slash command
-
-**Goal:** Manual deep-QA surface. Thin wrapper that dispatches deep-mode versions of the existing OCS qa+eval pair plus the new `app-ux-eval`.
-
-**Files:**
-- Create: `commands/qa-deep.md`
-- Modify: `commands/run.md` (add a one-line "deep QA is no longer part of /ace:run; see /ace:qa-deep")
-
-- [ ] **Step 1: Read existing slash commands for the format**
-
-Read in parallel:
-- `commands/run.md`
-- `commands/step.md`
-- `commands/eval.md`
-
-Note the frontmatter schema (`description`, `argument-hint`, etc.) and how multi-arg commands handle flags.
-
-- [ ] **Step 2: Write `commands/qa-deep.md`**
-
-```markdown
----
-description: Run deep QA (OCS + apps) against an existing opportunity. Manual gate, not part of /ace:run.
-argument-hint: <opp-name> [--ocs-only | --apps-only] [--since=<verdict-id>]
----
-
-# /ace:qa-deep — Manual Deep QA
-
-Triggers a full LLM-as-Judge quality assessment of an opportunity that
-already has a successful /ace:run behind it.
-
-## Inputs read from Drive (`ACE/$1/`)
-
-- `pdd.md`, `test-prompts.md` (OCS deep ground truth)
-- `expected-journeys.md`, `app-test-cases.yaml` (app deep ground truth)
-- The published OCS chatbot's current configuration
-- The latest released CommCare builds (Learn + Deliver)
-
-## What this does
-
-Run the following dispatches in this order:
-
-### Stage A — OCS deep (skip if `--apps-only`)
-
-1. Dispatch `ocs-chatbot-qa --deep` for $1
-2. Dispatch `ocs-chatbot-eval --deep` for $1
-
-Writes:
-- qa-captures/<date>-ocs-chat-deep.md
-- verdicts/ocs-chatbot-eval-deep.yaml
-- gate-briefs/ocs-chatbot-eval-deep.md
-
-### Stage B — Apps deep (skip if `--ocs-only`)
-
-1. Read `app-test-cases.yaml` for the run.
-2. If `--since=<verdict-id>` is provided: filter to journeys whose
-   `recipe_path` mtime is newer than the prior verdict at
-   `verdicts/app-ux-eval-deep.yaml@<verdict-id>`. Otherwise run all.
-3. For each journey: call `mobile_run_recipe` against a fresh AVD,
-   capture screenshots, upload to Drive under
-   `screenshots/qa-deep/<journey-id>/`.
-4. Dispatch `app-ux-eval` to grade the captured set.
-
-Writes:
-- screenshots/qa-deep/J*/*.png
-- verdicts/app-ux-eval-deep.yaml
-- eval-calibration/app-ux-eval-runs.md (appended row)
-
-## What this does NOT do
-
-- No /ace:run side effects. No Phase 7 activation, no app rebuild, no
-  training-material regeneration.
-- No FLW invites, no LLO emails.
-
-## After completion
-
-Both verdicts go to `verdicts/*-deep.yaml`. The Phase 7 `llo-launch`
-gate reads them and refuses activation if either is missing or stale.
-
-If you ran this and want to proceed to go-live, re-enter Phase 7 via
-/ace:step llo-launch $1 (or let /ace:run resume from where it left off).
-```
-
-- [ ] **Step 3: Update `commands/run.md`**
-
-Find the "What this does" section. Add a single bullet noting the change:
-
-```markdown
-- Phase 4 (OCS) and Phase 5 (apps) run **shallow** QA only. Deep
-  quality assessment is a separate command — see /ace:qa-deep <opp>.
-  Phase 7 activation will refuse to proceed without fresh deep
-  verdicts (run /ace:qa-deep before go-live).
-```
-
-- [ ] **Step 4: Sanity-test the command file lints**
-
-Run: `npx tsx scripts/sync-version.sh --dry-run` (or whatever the repo's command-validator is — check `bin/ace-doctor` for hints).
-
-If the repo doesn't have a command linter, skip — the command is just markdown frontmatter that Claude Code parses.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add commands/qa-deep.md commands/run.md
-git commit -m "feat: add /ace:qa-deep command for manual deep quality assessment
-
-Thin wrapper that dispatches OCS deep qa+eval + new app-ux-eval. Read-
-and-grade only — no run side effects. Supports --ocs-only / --apps-only
-for surgical re-runs, --since=<verdict-id> for incremental app grading.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 5: Switch Phase 5 to executor-only; retire `qa-plan`
-
-**Goal:** Phase 5 stops synthesizing test plans. `app-screenshot-capture` reads `app-test-cases.yaml`, runs only smoke recipes, and adds a thin UX judge per app. The `qa-plan` skill becomes dead code (deleted in Task 8).
-
-**Files:**
-- Modify: `skills/app-screenshot-capture/SKILL.md`
-- Modify: `agents/qa-and-training.md`
-- Modify: `lib/artifact-manifest.ts` (drop qa-plan/* artifacts; add app-ux-shallow verdict)
-
-- [ ] **Step 1: Read current Phase 5 wiring**
-
-Read in parallel:
-- `agents/qa-and-training.md`
-- `skills/app-screenshot-capture/SKILL.md`
-
-Confirm the existing dispatch order. Phase 5 should look like:
-1. `qa-plan` (will be removed)
-2. `app-screenshot-capture` (modified to read new artifact)
-3. Per-artifact training skills in parallel
-4. `training-deck-build`
-
-- [ ] **Step 2: Modify `skills/app-screenshot-capture/SKILL.md`**
-
-Edits:
-- Replace the current input list ("reads `qa-plan/test-matrix.md`,
-  `qa-plan/walkthrough-recipes/manifest.yaml`...") with:
-  - `expected-journeys.md`
-  - `app-test-cases.yaml`
-- Add a new Step labeled "Filter to smoke recipes":
-  ```markdown
-  ### Step <N>: Select smoke recipes only
-  Read `app-test-cases.yaml`. Filter `journeys[]` to entries with
-  `is_smoke: true`. There MUST be exactly two (one per app — Learn
-  and Deliver). Halt with a clear pointer to `app-test-cases` if
-  fewer or more are found.
-  ```
-- After the existing screenshot-capture loop, add a new Step labeled
-  "Thin UX smoke judge":
-  ```markdown
-  ### Step <N>: Thin UX smoke judge
-
-  For each smoke recipe (Learn + Deliver), assemble the captured
-  screenshot set into a single LLM-as-Judge call:
-
-  Prompt: "These screenshots are from a smoke run of the {{app}}
-  app. The target FLW persona (from PDD) is: {{persona_summary}}.
-  Looking at these screenshots in order, would this person be able
-  to complete the journey without confusion? Rate 0-3 + one-line
-  reason. 0 = a typical persona-matching FLW would get stuck; 3 =
-  obviously usable."
-
-  Threshold: ≥ 2/3 per app. Below → halt with verdict.
-  ```
-- Update the verdict-writing section to also write
-  `ACE/<opp>/runs/<run-id>/verdicts/app-screenshot-capture-shallow.yaml`
-  with the smoke-judge dimension.
-
-- [ ] **Step 3: Modify `agents/qa-and-training.md`**
-
-Find the `qa-plan` dispatch step. Delete it. Adjust the now-first
-step (`app-screenshot-capture`) to note that its inputs come from
-upstream phases:
-
-```markdown
-### Step 1: Capture smoke screenshots + thin UX judge
-
-Dispatch `app-screenshot-capture`:
-- Reads: expected-journeys.md (Phase 1), app-test-cases.yaml (Phase 2)
-- Writes: screenshots/J*/*.png + verdicts/app-screenshot-capture-shallow.yaml
-- Halts on smoke-recipe failure or UX judge < 2/3
-```
-
-Confirm downstream training-skill dispatches still consume
-`screenshots/manifest.yaml` (they do — `app-screenshot-capture` still
-emits it).
-
-- [ ] **Step 4: Update artifact manifest**
-
-Modify `lib/artifact-manifest.ts`:
-
-(a) Drop the `qa-plan/*` entries (test-matrix, walkthrough-recipes/*, screenshot-manifest, uat-checklist, verdicts/qa-plan.yaml).
-(b) Drop the `test-results/*` entries produced by `app-test`.
-(c) Add the new shallow verdict:
-
-```typescript
-{
-  path: 'verdicts/app-screenshot-capture-shallow.yaml',
-  producedBy: 'app-screenshot-capture',
-  consumedBy: ['opp-eval'],
-  phase: 'operate',
-  required: true,
-  description: 'Shallow smoke verdict from /ace:run Phase 5 — smoke recipe pass/fail + thin UX judge ≥ 2/3 per app. Always present after a successful /ace:run.',
-},
-```
-
-(d) Update consumed-by lists: anything that listed `qa-plan` or `app-test` as consumer/producer needs the references removed.
-
-- [ ] **Step 5: Run manifest tests**
-
-Run: `npm test -- test/fixtures/artifact-manifest.test.ts`
-Expected: PASS. Fixtures may need updating if they reference the dropped paths — handle in Task 8 if so.
-
-- [ ] **Step 6: Commit**
-
-```bash
-git add skills/app-screenshot-capture/ agents/qa-and-training.md \
-        lib/artifact-manifest.ts
-git commit -m "refactor(phase-5): executor-only — drop qa-plan synthesis
-
-app-screenshot-capture now reads expected-journeys.md (Phase 1) and
-app-test-cases.yaml (Phase 2) as inputs. Runs the two smoke recipes
-flagged is_smoke: true (one per app), captures screenshots, runs a
-single-question UX judge per app (~2 LLM calls total). Drops qa-plan
-artifacts from the manifest; the qa-plan skill itself is deleted in
-Task 8 once retirement settles.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 6: Thin OCS `--quick`; drop Phase 4 deep gate
-
-**Goal:** OCS shallow (Phase 4 default) collapses to 3 prompts × 1 dimension. Deep no longer runs in Phase 4 — it lives only in `/ace:qa-deep`.
-
-**Files:**
-- Modify: `skills/ocs-chatbot-qa/SKILL.md`
-- Modify: `skills/ocs-chatbot-eval/SKILL.md`
-- Modify: `agents/ocs-setup.md`
-
-- [ ] **Step 1: Read current OCS skill files**
-
-Read:
-- `skills/ocs-chatbot-qa/SKILL.md`
-- `skills/ocs-chatbot-eval/SKILL.md`
-- `agents/ocs-setup.md`
-
-Find the section in each that defines `--quick` behavior.
-
-- [ ] **Step 2: Thin `ocs-chatbot-qa` `--quick`**
-
-Modify `skills/ocs-chatbot-qa/SKILL.md`:
-- In the `--quick` mode section, change "5 smoke prompts" to "3 smoke prompts" (universal Connect-domain questions: 1 about claiming an opp, 1 about syncing data, 1 about getting paid)
-- Tighten the timeout: total cap = 90s × 3 = 270s
-- Note in the change log: "Thinned from 5 to 3 prompts (0.x.0). Phase 4 cost reduction; multi-dimensional judging moves to deep-only."
-
-- [ ] **Step 3: Thin `ocs-chatbot-eval` `--quick`**
-
-Modify `skills/ocs-chatbot-eval/SKILL.md`:
-- In the `--quick` mode section, replace the 5-dimension grading rubric with a single-dimension `overall_quality_0_to_3`
-- Pass criterion: every prompt's `overall_quality` ≥ 2/3
-- Verdict path stays `verdicts/ocs-chatbot-eval-quick.yaml` but the
-  dimensions array now has 1 entry
-- Note in the change log
-
-- [ ] **Step 4: Drop the `--deep` gate from `agents/ocs-setup.md`**
-
-In `agents/ocs-setup.md`, find the "Step 3: Deep eval" section (or
-equivalent). Delete the entire deep-eval step. Adjust step numbering
-in the rest of the agent.
-
-In the Phase 4 gate-brief section, change "deep verdict" references
-to "quick verdict" — Phase 4 → 5 only requires the quick gate now.
-
-Add a paragraph at the end of the agent's overview:
-
-```markdown
-**Note:** Deep OCS evaluation moved out of Phase 4 in 0.x.0. Run
-/ace:qa-deep <opp> after /ace:run completes to grade chatbot quality
-before go-live. The Phase 7 llo-launch gate refuses to proceed
-without a fresh, passing deep verdict.
-```
-
-- [ ] **Step 5: Run OCS-related tests**
-
-Run in parallel:
-- `npm test -- test/mcp/ocs/` (unit tests, no live OCS)
-- `npm test -- test/fixtures/artifact-manifest.test.ts`
-
-Expected: PASS. (Integration tests OCS_INTEGRATION=1 are out of scope here — those exercise live OCS and are a separate CI concern.)
-
-- [ ] **Step 6: Commit**
-
-```bash
-git add skills/ocs-chatbot-qa/ skills/ocs-chatbot-eval/ agents/ocs-setup.md
-git commit -m "refactor(phase-4): thin --quick to 3 prompts × 1 dim; drop Phase 4 deep gate
-
-OCS shallow collapses from 5 prompts × 5 dims (~25 calls) to 3 prompts
-× 1 dim (overall_quality_0_to_3, 3 calls). Phase 4 → 5 gate is now
-quick-pass-only. Deep OCS eval moves entirely to /ace:qa-deep.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 7: Wire Phase 7 deep-verdict gate
-
-**Goal:** `llo-launch` reads both deep verdicts before activation. Refuses if missing, stale, or failing. Adds an override flag with audit trail.
-
-**Files:**
-- Modify: `skills/llo-launch/SKILL.md`
-- Modify: `bin/ace-doctor` (add a freshness check that mirrors the gate)
-- Modify: `agents/llo-manager.md` (note the new gate behavior)
-
-- [ ] **Step 1: Read current `llo-launch`**
-
-Read: `skills/llo-launch/SKILL.md`. Identify the step that calls
-`connect_activate_opportunity`.
-
-- [ ] **Step 2: Add the gate to `llo-launch`**
-
-Insert a new step **immediately before** the activation call:
-
-```markdown
-### Step <N>: Verify deep-QA verdicts before activation
-
-Read these two files from `ACE/<opp>/runs/<run-id>/`:
-- `verdicts/ocs-chatbot-eval-deep.yaml`
-- `verdicts/app-ux-eval-deep.yaml`
-
-For each verdict, require:
-
-1. File exists.
-2. `status: pass`.
-3. Verdict timestamp is newer than the relevant artifact:
-   - OCS verdict: newer than the chatbot's last `published_at`
-     (read via `ocs_get_chatbot`)
-   - App verdict: newer than the latest released CommCare build
-     timestamp (read from `deployment-summary.md`)
-
-If ANY check fails, halt with [BLOCKER]:
-
-> Deep QA verdicts missing or stale.
-> Run /ace:qa-deep <opp> before activation.
-> Missing: <list of failing checks>
-
-### Step <N+1>: Override (operator-only, audited)
-
-If the activation includes the flag `--override-deep-qa-gate=<reason>`,
-skip the gate. Required:
-- The flag must include a non-empty reason
-- /ace:run cannot pass this flag (only /ace:step llo-launch can)
-- Append to `comms-log/observations.md`:
-  > YYYY-MM-DD HH:MM TZ — Deep-QA gate overridden during activation.
-  > Reason: <reason>. Operator: <ace user>. Verdicts at time of override:
-  > <ocs-status> / <app-status>.
-
-### Step <N+2>: Activate the opportunity (existing step)
-
-(Preserve the existing connect_activate_opportunity call.)
-```
-
-Also update the skill's frontmatter description to mention the new gate.
-
-- [ ] **Step 3: Add a freshness check to `bin/ace-doctor`**
-
-Read `bin/ace-doctor`. Find the section reporting on per-opp verdicts
-(if any; if not, this becomes a new section).
-
-Add a new check `[deep-qa-freshness]` that, given an opp name:
-1. Reads `verdicts/ocs-chatbot-eval-deep.yaml` if present
-2. Reads `verdicts/app-ux-eval-deep.yaml` if present
-3. For each: compare timestamp to the artifact it grades
-4. Reports: PASS / WARN (one is missing) / FAIL (one is stale)
-
-This is advisory in doctor (WARN-level), not a blocker. The actual
-enforcement is the gate in `llo-launch`.
-
-- [ ] **Step 4: Update `agents/llo-manager.md`**
-
-Find the description of the `llo-launch` dispatch. Add a note:
-
-```markdown
-**Note:** llo-launch enforces a deep-QA-verdict freshness gate before
-activation in 0.x.0+. If /ace:qa-deep hasn't been run since the most
-recent app release / chatbot publish, llo-launch halts with a
-[BLOCKER] and the operator must run /ace:qa-deep <opp> before resuming.
-```
-
-- [ ] **Step 5: Run tests**
-
-Run: `npm test -- test/`
-Expected: PASS. Existing tests don't cover the new gate (it's prompt-side); integration coverage comes from a manual test on a fixture opp in Task 9.
-
-- [ ] **Step 6: Commit**
-
-```bash
-git add skills/llo-launch/ bin/ace-doctor agents/llo-manager.md
-git commit -m "feat(phase-6): gate llo-launch on fresh deep-QA verdicts
-
-Before connect_activate_opportunity, llo-launch reads both deep
-verdicts (OCS + apps), checks they exist + pass + are newer than the
-artifacts they grade. Halts with [BLOCKER] otherwise. Override flag
---override-deep-qa-gate=<reason> bypasses with audit trail in
-comms-log/observations.md (only available via /ace:step, not /ace:run).
-Doctor adds a WARN-level freshness check.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 8: Retire `qa-plan` and `app-test`; migration script
-
-**Goal:** Delete the dead skills, update fixtures, write the migration doc, bump version.
-
-**Files:**
-- Delete: `skills/qa-plan/` (entire directory)
-- Delete: `skills/app-test/` (entire directory)
-- Create: `migrations/0.x.0-shallow-deep-qa.md`
-- Modify: `test/fixtures/...` (remove references to retired artifacts)
-- Modify: `VERSION` (bump per `scripts/version-bump.sh`)
-- Modify: `CHANGELOG.md` (entry for 0.x.0)
-
-- [ ] **Step 1: Find every reference to retired skills**
-
-Run in parallel:
-- `git grep -l 'qa-plan' -- skills/ agents/ commands/ lib/ test/`
-- `git grep -l 'app-test' -- skills/ agents/ commands/ lib/ test/ -- ':!skills/app-test-cases'`
-
-Note: `app-test-cases` (the new skill) shouldn't be matched by the
-second grep — that's why we exclude its directory.
-
-- [ ] **Step 2: Remove references**
-
-For each file that references `qa-plan` or `app-test` (the retired ones):
-- Skill markdown files: delete the line / paragraph referencing them
-- Agent markdown files: confirm they were already updated in Tasks 1, 2, 5; if not, update now
-- `lib/artifact-manifest.ts`: confirm Task 5's edits removed all entries for these skills (no `producedBy: 'qa-plan'` or `producedBy: 'app-test'` remaining)
-- Tests: update fixtures so they no longer expect `qa-plan/*` or `test-results/*` files
-
-- [ ] **Step 3: Delete the skill directories**
-
-```bash
-git rm -r skills/qa-plan/ skills/app-test/
-```
-
-- [ ] **Step 4: Write the migration doc**
-
-Create `migrations/0.x.0-shallow-deep-qa.md`:
-
-```markdown
-# Migration: 0.x.0 — Shallow / Deep QA Split
-
-**Date:** YYYY-MM-DD
-
-## What changed
-
-- New skills: `pdd-to-app-journeys` (Phase 1), `app-test-cases` (Phase 2),
-  `app-ux-eval` (deep, manual)
-- New artifacts: `expected-journeys.md`, `app-test-cases.yaml`,
-  `verdicts/app-ux-eval-deep.yaml`, `verdicts/app-screenshot-capture-shallow.yaml`
-- New command: `/ace:qa-deep <opp>`
-- Modified: OCS `--quick` thinned to 3×1 dim; `app-screenshot-capture`
-  reads new artifacts; `llo-launch` gates on deep verdicts
-- Retired: `qa-plan`, `app-test` skills + their artifacts
-
-## In-flight opportunities (mid-/ace:run when 0.x.0 lands)
-
-If an opp's run had completed Phase 1 but not Phase 2 before this update:
-- Re-run Phase 1 just for the new artifact:
-  `/ace:step pdd-to-app-journeys <opp>`
-- Resume from where Phase 2 left off
-
-If an opp had completed Phase 5 (qa-plan + app-screenshot-capture) on
-the old shape:
-- The old artifacts (qa-plan/*) remain in Drive; nothing reads them.
-  Safe to leave.
-- For deep QA, run `/ace:qa-deep <opp>` to populate the new verdicts.
-
-## Activation gate (Phase 7)
-
-Existing opps that completed Phase 5 on the old shape but have NOT yet
-been activated will hit the new deep-QA gate. Run `/ace:qa-deep <opp>`
-before `/ace:step llo-launch`. If you must bypass for emergency
-activation: `/ace:step llo-launch <opp> --override-deep-qa-gate="<reason>"`
-(reason is required; gets logged to `comms-log/observations.md`).
-
-## Cost impact
-
-- /ace:run shallow QA: ~5 LLM judge calls (was ~90)
-- /ace:qa-deep (manual, optional): ~65 OCS + per-journey app
-- Net: /ace:run cycles cheaper; deep grading is now opt-in
-
-## Rollback
-
-Revert to <commit before this PR>. The old qa-plan + app-test skills
-return; the new artifacts in Drive are ignored. No Drive data loss.
-```
-
-- [ ] **Step 5: Bump version**
-
-Run: `bash scripts/version-bump.sh`
-
-This fetches origin/main, picks `max(local, origin) + patch+1`, and
-syncs the four version files. Capture the new version (e.g., `0.x.0`).
-
-Edit `migrations/0.x.0-shallow-deep-qa.md` to replace `0.x.0` with the
-real version. Same for the change-log entries inside skill files
-(Task 1 step 7 commit body, Task 2 step 7 commit body, etc. — for the
-log table dates). If those changelog tables already have the literal
-`0.x.0`, replace via `git grep '0\.x\.0' | grep -v 0.x.0-shallow` and
-inspect.
-
-- [ ] **Step 6: Update `CHANGELOG.md`**
-
-Add a section at the top:
-
-```markdown
-## 0.x.0 — Shallow / Deep QA Split
-
-- New: /ace:qa-deep <opp> for manual deep quality assessment
-- New: pdd-to-app-journeys (Phase 1), app-test-cases (Phase 2),
-  app-ux-eval (deep) skills
-- Changed: /ace:run does shallow QA only — ~5 LLM judge calls vs ~90
-  before. Phase 7 llo-launch refuses activation without fresh deep
-  verdicts (override available with audit reason).
-- Retired: qa-plan, app-test skills (replaced by upstream producers)
-- Migration: see migrations/0.x.0-shallow-deep-qa.md
-- Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md
-```
-
-- [ ] **Step 7: Run the full test suite**
-
-Run: `npm test`
-
-Expected: PASS. Any fixture-coverage failures from earlier tasks should
-now be fixed (since fixtures were updated in Step 2 above).
-
-- [ ] **Step 8: Commit**
-
-```bash
-git add migrations/ CHANGELOG.md VERSION package.json \
-        .claude-plugin/plugin.json .claude-plugin/marketplace.json \
-        skills/qa-plan/ skills/app-test/ \
-        test/fixtures/
-git commit -m "chore(0.x.0): retire qa-plan + app-test, migration doc, version bump
-
-Wraps the shallow/deep QA split. qa-plan and app-test skills + their
-artifacts are removed (their jobs moved to pdd-to-app-journeys,
-app-test-cases, and app-ux-eval). Migration notes for in-flight opps
-in migrations/. Version bumped via scripts/version-bump.sh.
-
-Spec: docs/superpowers/specs/2026-05-04-shallow-deep-qa-split-design.md"
-```
-
----
-
-## Task 9: End-to-end smoke against a test fixture
-
-**Goal:** Verify the new path runs end-to-end on a fixture opp before
-shipping. Doesn't replace integration tests (those are CI-time); this
-is a one-time confidence check that the wiring works.
-
-**Files:**
-- Read: `test/fixtures/CRISPR-Test-001/...` (atomic-visit golden fixture)
-- Possibly modify: fixture files to include the new artifacts
-
-- [ ] **Step 1: Pick the smallest existing fixture**
-
-Read: `test/fixtures/` — find the CRISPR-Test-001 atomic-visit fixture.
-
-- [ ] **Step 2: Verify or backfill the fixture's new artifacts**
-
-Confirm or create:
-- `expected-journeys.md`
-- `app-test-cases.yaml` (with at least one `is_smoke: true` per app)
-- Sample `verdicts/app-ux-eval-deep.yaml` (passing)
-
-If any are missing, hand-write minimal versions matching the templates
-from Tasks 1+2.
-
-- [ ] **Step 3: Run manifest validation against the fixture**
-
-Run: `npm test -- test/fixtures/artifact-manifest.test.ts`
-Expected: PASS for the updated fixture.
-
-- [ ] **Step 4: Dry-run the new skills against the fixture**
-
-The dry-run paths from each skill write under `comms-log/dry-run-*`.
-Confirm:
-- `pdd-to-app-journeys` dry-run produces a non-empty journeys file
-- `app-test-cases` dry-run produces yaml with at least the bindings
-  (recipes can be stubbed)
-- `/ace:qa-deep` dry-run prints the planned dispatches without running them
-
-(If dry-run plumbing isn't wired in your skill body for a given step,
-that's fine — we're checking inputs/outputs, not exhaustive dry-run
-coverage.)
-
-- [ ] **Step 5: Commit fixture updates if any**
-
-```bash
-git add test/fixtures/
-git commit -m "test(fixture): add new shallow-deep-qa artifacts to CRISPR-Test-001
-
-expected-journeys.md, app-test-cases.yaml, app-ux-eval-deep verdict
-sample. Lets manifest validation pass and provides a known-good fixture
-for future regressions."
-```
-
----
-
-## Self-Review
-
-After writing this plan I checked it against the spec:
-
-**Spec coverage:**
-- §1 Artifact ownership → Tasks 1, 2, 5 (artifact moves + manifest edits)
-- §2 Skill changes (new/retired/changed) → Tasks 1–8 cover every entry
-- §3 Shallow path (OCS + apps) → Tasks 5, 6
-- §4 Deep app UX rubric → Task 3
-- §5 /ace:qa-deep → Task 4
-- §6 Phase 7 deep-verdict gate → Task 7
-- Migration / rollout → Task 8
-- Open questions (1)–(4) noted in the spec are intentionally not
-  blockers; they get iterated post-ship.
-
-**Placeholder scan:** No `TBD` / `TODO` / "implement later" / "add validation". The dimension table is fully filled in (Task 3 step 2). Each skill body has its `## Process` section spelled out. Version numbers are intentionally `0.x.0` until Task 8 step 5 resolves the actual bump.
-
-**Type consistency:**
-- Verdict file names align across tasks: `app-ux-eval-deep.yaml` (Tasks 3, 4, 7), `ocs-chatbot-eval-deep.yaml` (Tasks 4, 7), `app-screenshot-capture-shallow.yaml` (Task 5)
-- Smoke flag spelled `is_smoke: true` consistently (Tasks 2, 5)
-- Skill names: `pdd-to-app-journeys`, `app-test-cases`, `app-ux-eval` consistent across all tasks
-- Artifact paths use `runs/<run-id>/` shape consistently (matches `lib/run-paths.ts` convention)
-
-**Order dependency check:**
-- Tasks 1–4 are additive and don't break the running pipeline
-- Task 5 swaps Phase 5 over to the new artifacts — depends on Tasks 1, 2, 3 having shipped first
-- Task 6 thins OCS — independent of Tasks 1–5; can run in any order after Task 4
-- Task 7 wires Phase 7 — depends on Task 3 (verdict producer must exist) and Task 4 (gate references /ace:qa-deep in error messages)
-- Task 8 retires dead code — must come last
-- Task 9 verifies — must come last
-
-The plan is implementable end-to-end as written. Migration ordering preserves a working pipeline at every commit.
diff --git a/docs/superpowers/plans/2026-05-05-app-multimedia-coverage.md b/docs/superpowers/plans/2026-05-05-app-multimedia-coverage.md
deleted file mode 100644
index f05459fb..00000000
--- a/docs/superpowers/plans/2026-05-05-app-multimedia-coverage.md
+++ /dev/null
@@ -1,2105 +0,0 @@
-# app-multimedia-coverage Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
-
-**Goal:** Ship a manually-invoked ACE skill that takes Nova-built CommCare apps, uses an LLM judge to pick which fields deserve display-only images, generates them via Dimagi's Content Generator API, patches form XML and bundles assets into the CCZ, then re-builds and re-releases — closing the loop on multimedia attachment.
-
-**Architecture:** Sibling of `commcare-form-patch`. New skill `skills/app-multimedia-coverage/`, two new pure helpers (`lib/multimedia-judge.ts`, `lib/content-generator-client.ts`, `lib/multimedia-manifest.ts`, `lib/multimedia-prompt-hash.ts`, `lib/multimedia-xform-patch.ts`), one new MCP atom (`commcare_upload_multimedia`), `.env.tpl` additions for Content Generator credentials, doctor env-drift checks, a smoke fixture, and a Nova feature request filed against `voidcraft-labs/nova-plugin`. Spec at `docs/superpowers/specs/2026-05-05-app-multimedia-coverage-design.md`.
-
-**Tech Stack:** TypeScript / vitest / Anthropic SDK (Sonnet 4.6) / Playwright for CCHQ I/O / Zod for schemas / @xmldom/xmldom (already in repo for form-XML manipulation).
-
----
-
-## Task 0: Branch verification & worktree confirmation
-
-**Files:** none
-
-- [ ] **Step 1: Verify branch and clean tree**
-
-```bash
-git rev-parse --git-dir | grep -q worktrees && echo "in worktree: ok"
-git status --short    # expect empty
-git log --oneline -1  # expect the design-spec commit (638a855 or later)
-```
-
-Expected: in worktree, clean tree, the spec commit at HEAD.
-
-- [ ] **Step 2: Read the spec**
-
-Read `docs/superpowers/specs/2026-05-05-app-multimedia-coverage-design.md` end-to-end before continuing. The plan below assumes you've read it.
-
----
-
-## Task 1: Probe the Content Generator API contract
-
-**Files:**
-- Create: `scripts/probe-content-generator.ts`
-
-This is investigative; no test, no commit yet (the script gets committed alongside the client in Task 6 once the contract is documented). The goal is to discover: does the API return PNG bytes inline or a signed URL? What's the exact request body and auth header shape?
-
-- [ ] **Step 1: Pull credentials from 1Password**
-
-```bash
-op item get "Content Generator API" --vault AI-Agents --account dimagi.1password.com --format json
-```
-
-Expected: JSON containing fields like `url`, `apikey` (or `credential`, etc.). Note the exact field names — they go into `.env.tpl` in Task 3.
-
-- [ ] **Step 2: Write the probe script**
-
-```typescript
-// scripts/probe-content-generator.ts
-//
-// Probes Dimagi's Content Generator API to document the live contract.
-// Purely investigative — outputs:
-//   - Request shape that worked
-//   - Response shape (Content-Type, body size, structure)
-//   - Total wall-clock for one image
-//
-// Run: npx tsx scripts/probe-content-generator.ts
-
-import { writeFileSync } from 'node:fs';
-
-const URL = process.env.CONTENT_GENERATOR_URL!;
-const KEY = process.env.CONTENT_GENERATOR_API_KEY!;
-if (!URL || !KEY) {
-  console.error('Set CONTENT_GENERATOR_URL and CONTENT_GENERATOR_API_KEY');
-  process.exit(1);
-}
-
-const body = {
-  application_context:
-    'Frontline workers in Africa teaching mothers to care for Small Vulnerable Newborns with Kangaroo Mother Care. Modestly dressed, representative of context.',
-  form_text: 'Show the mother how to support the baby\'s head and neck while skin-to-skin.',
-  image_directives:
-    'Frontline worker assisting a mother holding a small newborn skin-to-skin against her chest, head supported, warm lighting.',
-};
-
-const t0 = Date.now();
-const res = await fetch(URL, {
-  method: 'POST',
-  headers: {
-    Authorization: `Bearer ${KEY}`,
-    'Content-Type': 'application/json',
-  },
-  body: JSON.stringify(body),
-});
-const elapsed = Date.now() - t0;
-
-console.log({ status: res.status, contentType: res.headers.get('content-type'), elapsedMs: elapsed });
-
-const buf = Buffer.from(await res.arrayBuffer());
-writeFileSync('/tmp/content-gen-probe-response.bin', buf);
-
-if (res.headers.get('content-type')?.startsWith('image/')) {
-  console.log('Response is image bytes inline. Saved to /tmp/content-gen-probe-response.bin (open to confirm).');
-} else if (res.headers.get('content-type')?.includes('json')) {
-  console.log('Response is JSON:', buf.toString('utf-8').slice(0, 500));
-} else {
-  console.log('Unexpected content type. Body bytes 0..200:', buf.slice(0, 200).toString());
-}
-```
-
-- [ ] **Step 3: Run the probe and capture findings**
-
-```bash
-export CONTENT_GENERATOR_URL=<from 1Password>
-export CONTENT_GENERATOR_API_KEY=<from 1Password>
-npx tsx scripts/probe-content-generator.ts
-```
-
-If status is non-200, iterate on auth header (`Authorization: Bearer X` vs `X-API-Key: X` vs `?api_key=`), body wrapper, etc., until 200.
-
-- [ ] **Step 4: Document the contract**
-
-Append a top-of-file comment block to `scripts/probe-content-generator.ts` documenting the live contract verbatim:
-
-```
-// LIVE CONTRACT (probed YYYY-MM-DD):
-//   Method: POST
-//   URL: <full URL>
-//   Auth: <header name>: <scheme> <value>
-//   Request body: { application_context, form_text, image_directives }
-//   Response: <Content-Type> — <inline bytes | {url: ...} | other>
-//   Wall-clock: ~Xs low-res / ~Ys upscaled
-```
-
-This block becomes the source of truth for `lib/content-generator-client.ts` in Task 6.
-
----
-
-## Task 2: Probe the CCHQ multimedia upload endpoint
-
-**Files:**
-- Create: `scripts/probe-multimedia-upload.ts`
-
-Same shape as Task 1: discover the live endpoint, document it, no commit yet.
-
-- [ ] **Step 1: Read existing CCHQ atoms for the auth pattern**
-
-Read `mcp/connect/backends/commcare.ts` lines 285–360 (the `patchXform` implementation). The probe script needs to use the same Playwright-session auth.
-
-- [ ] **Step 2: Write the probe script**
-
-```typescript
-// scripts/probe-multimedia-upload.ts
-//
-// Probes CCHQ's multimedia upload endpoint to document the live contract.
-// Uses the same authenticated Playwright session as commcare_patch_xform.
-//
-// Run: npx tsx scripts/probe-multimedia-upload.ts <hq_domain> <app_id>
-//
-// The endpoint is best-guess `/a/<domain>/apps/<app_id>/multimedia/uploaded/`;
-// CCHQ may use a different path. Iterate until 200.
-
-import { commcareClient } from '../mcp/connect/backends/commcare.js'; // adjust if needed
-import { readFileSync } from 'node:fs';
-
-const [, , domain, appId] = process.argv;
-if (!domain || !appId) {
-  console.error('Usage: npx tsx scripts/probe-multimedia-upload.ts <domain> <app_id>');
-  process.exit(1);
-}
-
-// 1x1 PNG (smallest valid PNG, ~67 bytes)
-const TINY_PNG = Buffer.from(
-  'iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mNkYAAAAAYAAjCB0C8AAAAASUVORK5CYII=',
-  'base64'
-);
-
-const candidatePaths = [
-  `/a/${domain}/apps/${appId}/multimedia/uploaded/`,
-  `/a/${domain}/apps/${appId}/multimedia_upload/`,
-  `/a/${domain}/multimedia/upload_multimedia/${appId}/`,
-  `/a/${domain}/apps/multimedia/${appId}/uploaded/`,
-];
-
-const client = await commcareClient();
-for (const path of candidatePaths) {
-  const form = new FormData();
-  form.set('Filedata', new Blob([TINY_PNG], { type: 'image/png' }), 'probe.png');
-  form.set('media_type', 'image');
-  form.set('file_name', 'probe.png');
-
-  // Probe: try the path with a multipart POST through the existing session.
-  // Adjust the POST helper to match commcare.ts's actual API.
-  const res = await client.rawPost(path, form); // <-- helper to add if missing
-  console.log({ path, status: res.status, body: (await res.text()).slice(0, 300) });
-  if (res.status === 200) break;
-}
-```
-
-- [ ] **Step 3: Run against a test HQ project**
-
-Use an existing ACE smoke opp's HQ domain + app_id from a recent `2-commcare/app-deploy_summary.md`. Iterate the candidate paths and form-field names until one returns 200 with a multimedia_id-shaped response.
-
-- [ ] **Step 4: Document the contract**
-
-Top-of-file comment block in `scripts/probe-multimedia-upload.ts`:
-
-```
-// LIVE CONTRACT (probed YYYY-MM-DD against <hq_domain>):
-//   Method: POST
-//   Path: /a/<domain>/<exact-path>/<app_id>/...
-//   Content-Type: multipart/form-data
-//   Required fields: Filedata=<png-bytes>, media_type=image, file_name=<filename>
-//   Optional fields: <any others discovered>
-//   CSRF: required via X-CSRFToken header (per CCHQ standard)
-//   Auth: same Playwright session as patchXform (login_or_digest)
-//   Response: 200 application/json { multimedia_id, sha1, ... }
-//   Errors: 400 on bad media_type, 403 on csrf miss
-```
-
-This is the source of truth for `commcare_upload_multimedia` in Task 9.
-
----
-
-## Task 3: Add Content Generator credentials to `.env.tpl`
-
-**Files:**
-- Modify: `.env.tpl` (append a new section after CommCare HQ block)
-
-- [ ] **Step 1: Append new section**
-
-Append to `.env.tpl` (use the field names discovered in Task 1 step 1):
-
-```bash
-# ── Content Generator (image gen for app-multimedia-coverage) ───────
-#
-# Dimagi's internal image-generation service (Cloud Run, Gemini-3-Flash).
-# Used by the app-multimedia-coverage skill to attach display-only images
-# to CommCare app questions.
-#
-# 1Password item: "Content Generator API" in AI-Agents vault.
-
-CONTENT_GENERATOR_URL=op://AI-Agents/Content Generator API/url
-CONTENT_GENERATOR_API_KEY=op://AI-Agents/Content Generator API/credential
-```
-
-(Adjust `url` / `credential` to match the actual 1Password field names from Task 1 step 1.)
-
-- [ ] **Step 2: Regenerate local `.env`**
-
-```bash
-op inject -i .env.tpl -o "$CLAUDE_PLUGIN_DATA/.env" --account dimagi.1password.com 2>&1 | tail -5
-# Verify new vars are present:
-grep -c CONTENT_GENERATOR "$CLAUDE_PLUGIN_DATA/.env"   # expect 2
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add .env.tpl
-git commit -m "feat(env): add CONTENT_GENERATOR_URL / _API_KEY for multimedia skill"
-```
-
----
-
-## Task 4: `lib/multimedia-prompt-hash.ts` — content-addressed cache key
-
-**Files:**
-- Create: `lib/multimedia-prompt-hash.ts`
-- Test: `lib/multimedia-prompt-hash.test.ts`
-
-Pure function. Used to cache-skip image regeneration when inputs haven't changed.
-
-- [ ] **Step 1: Write the failing test**
-
-```typescript
-// lib/multimedia-prompt-hash.test.ts
-import { describe, it, expect } from 'vitest';
-import { promptHash } from './multimedia-prompt-hash.js';
-
-describe('promptHash', () => {
-  it('returns the same hash for identical inputs', () => {
-    const a = promptHash({ appContext: 'X', formText: 'Y', directive: 'Z' });
-    const b = promptHash({ appContext: 'X', formText: 'Y', directive: 'Z' });
-    expect(a).toBe(b);
-  });
-
-  it('returns a different hash when any field changes', () => {
-    const base = { appContext: 'X', formText: 'Y', directive: 'Z' };
-    const h = promptHash(base);
-    expect(promptHash({ ...base, appContext: 'X2' })).not.toBe(h);
-    expect(promptHash({ ...base, formText: 'Y2' })).not.toBe(h);
-    expect(promptHash({ ...base, directive: 'Z2' })).not.toBe(h);
-  });
-
-  it('is whitespace-insensitive on leading/trailing whitespace', () => {
-    const a = promptHash({ appContext: 'X', formText: 'Y', directive: 'Z' });
-    const b = promptHash({ appContext: '  X  ', formText: '\nY\n', directive: ' Z ' });
-    expect(a).toBe(b);
-  });
-
-  it('treats null/undefined directive as the same as empty string', () => {
-    const a = promptHash({ appContext: 'X', formText: 'Y', directive: '' });
-    const b = promptHash({ appContext: 'X', formText: 'Y', directive: null });
-    const c = promptHash({ appContext: 'X', formText: 'Y', directive: undefined });
-    expect(a).toBe(b);
-    expect(b).toBe(c);
-  });
-
-  it('returns a 64-char hex string (SHA-256)', () => {
-    const h = promptHash({ appContext: 'X', formText: 'Y', directive: 'Z' });
-    expect(h).toMatch(/^[0-9a-f]{64}$/);
-  });
-});
-```
-
-- [ ] **Step 2: Run test, verify it fails**
-
-```bash
-npm test -- lib/multimedia-prompt-hash.test.ts
-```
-
-Expected: FAIL — module does not exist.
-
-- [ ] **Step 3: Implement**
-
-```typescript
-// lib/multimedia-prompt-hash.ts
-import { createHash } from 'node:crypto';
-
-export interface PromptHashInput {
-  appContext: string;
-  formText: string;
-  directive: string | null | undefined;
-}
-
-export function promptHash(input: PromptHashInput): string {
-  const norm = (s: string | null | undefined) => (s ?? '').trim();
-  const payload = [norm(input.appContext), norm(input.formText), norm(input.directive)].join(' ');
-  return createHash('sha256').update(payload, 'utf-8').digest('hex');
-}
-```
-
-- [ ] **Step 4: Run tests, verify pass**
-
-```bash
-npm test -- lib/multimedia-prompt-hash.test.ts
-```
-
-Expected: PASS (all 5 tests).
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add lib/multimedia-prompt-hash.ts lib/multimedia-prompt-hash.test.ts
-git commit -m "feat(lib): multimedia prompt-hash helper for cache-skip"
-```
-
----
-
-## Task 5: `lib/multimedia-manifest.ts` — Zod schema + I/O helpers
-
-**Files:**
-- Create: `lib/multimedia-manifest.ts`
-- Test: `lib/multimedia-manifest.test.ts`
-
-The manifest is the auth-only source of truth for what's been generated. Stored as YAML in Drive at `2-commcare/app-multimedia-coverage_manifest.yaml`.
-
-- [ ] **Step 1: Write the failing test**
-
-```typescript
-// lib/multimedia-manifest.test.ts
-import { describe, it, expect } from 'vitest';
-import {
-  multimediaManifestSchema,
-  parseManifest,
-  serializeManifest,
-  type MultimediaManifest,
-} from './multimedia-manifest.js';
-
-const sample: MultimediaManifest = {
-  app_context_hash: 'a'.repeat(64),
-  images: [
-    {
-      app: 'learn',
-      form_unique_id: 'f'.repeat(32),
-      field_id: 'kmc_position_demo',
-      prompt_hash: 'b'.repeat(64),
-      file_path:
-        'app-multimedia-coverage_generated/learn/ffffffffffffffffffffffffffffffff/kmc_position_demo__bbbbbbbb.png',
-      ccz_filename: 'kmc_position_demo.png',
-      cchq_multimedia_id: 'mm_123',
-      cchq_sha1: 'c'.repeat(40),
-      generated_at: '2026-05-05T20:00:00.000Z',
-    },
-  ],
-};
-
-describe('multimediaManifestSchema', () => {
-  it('accepts a well-formed manifest', () => {
-    expect(multimediaManifestSchema.parse(sample)).toEqual(sample);
-  });
-
-  it('rejects an unknown app value', () => {
-    const bad = { ...sample, images: [{ ...sample.images[0], app: 'feedback' }] };
-    expect(() => multimediaManifestSchema.parse(bad)).toThrow();
-  });
-
-  it('rejects a non-32-char form_unique_id', () => {
-    const bad = { ...sample, images: [{ ...sample.images[0], form_unique_id: 'short' }] };
-    expect(() => multimediaManifestSchema.parse(bad)).toThrow();
-  });
-
-  it('round-trips through YAML serialize/parse', () => {
-    const yaml = serializeManifest(sample);
-    expect(parseManifest(yaml)).toEqual(sample);
-  });
-});
-```
-
-- [ ] **Step 2: Run test, verify it fails**
-
-```bash
-npm test -- lib/multimedia-manifest.test.ts
-```
-
-Expected: FAIL — module does not exist.
-
-- [ ] **Step 3: Implement**
-
-```typescript
-// lib/multimedia-manifest.ts
-import { z } from 'zod';
-import { dump as yamlDump, load as yamlLoad } from 'js-yaml';
-
-export const multimediaImageSchema = z.object({
-  app: z.enum(['learn', 'deliver']),
-  form_unique_id: z.string().regex(/^[0-9a-f]{32}$/, '32-char hex'),
-  field_id: z.string().min(1),
-  prompt_hash: z.string().regex(/^[0-9a-f]{64}$/, '64-char hex SHA-256'),
-  file_path: z.string().min(1),
-  ccz_filename: z.string().min(1),
-  cchq_multimedia_id: z.string().nullable(),
-  cchq_sha1: z.string().regex(/^[0-9a-f]{40}$/).nullable(),
-  generated_at: z.string().datetime(),
-});
-
-export const multimediaManifestSchema = z.object({
-  app_context_hash: z.string().regex(/^[0-9a-f]{64}$/),
-  images: z.array(multimediaImageSchema),
-});
-
-export type MultimediaImage = z.infer<typeof multimediaImageSchema>;
-export type MultimediaManifest = z.infer<typeof multimediaManifestSchema>;
-
-export function parseManifest(yaml: string): MultimediaManifest {
-  return multimediaManifestSchema.parse(yamlLoad(yaml));
-}
-
-export function serializeManifest(m: MultimediaManifest): string {
-  multimediaManifestSchema.parse(m); // throw on invalid
-  return yamlDump(m, { noRefs: true, lineWidth: 100 });
-}
-```
-
-- [ ] **Step 4: Run tests, verify pass**
-
-```bash
-npm test -- lib/multimedia-manifest.test.ts
-```
-
-Expected: PASS.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add lib/multimedia-manifest.ts lib/multimedia-manifest.test.ts
-git commit -m "feat(lib): multimedia-manifest Zod schema + YAML I/O"
-```
-
----
-
-## Task 6: `lib/content-generator-client.ts` — typed wrapper for the API
-
-**Files:**
-- Create: `lib/content-generator-client.ts`
-- Test: `lib/content-generator-client.test.ts`
-
-Wrapper around the live contract documented in Task 1 step 4.
-
-- [ ] **Step 1: Write the failing test**
-
-```typescript
-// lib/content-generator-client.test.ts
-import { describe, it, expect, vi, beforeEach } from 'vitest';
-import { ContentGeneratorClient, ContentGeneratorAuthError } from './content-generator-client.js';
-
-const PNG_MAGIC = new Uint8Array([0x89, 0x50, 0x4e, 0x47, 0x0d, 0x0a, 0x1a, 0x0a]);
-
-describe('ContentGeneratorClient', () => {
-  let fetchMock: ReturnType<typeof vi.fn>;
-  beforeEach(() => {
-    fetchMock = vi.fn();
-    vi.stubGlobal('fetch', fetchMock);
-  });
-
-  it('returns PNG bytes on 200 image/png', async () => {
-    fetchMock.mockResolvedValueOnce(
-      new Response(PNG_MAGIC, { status: 200, headers: { 'content-type': 'image/png' } }),
-    );
-    const c = new ContentGeneratorClient({ url: 'https://x.test/gen', apiKey: 'k' });
-    const out = await c.generateImage({ applicationContext: 'A', formText: 'F', imageDirectives: 'D' });
-    expect(out.subarray(0, 8)).toEqual(Buffer.from(PNG_MAGIC));
-  });
-
-  it('sends Authorization Bearer header with the API key', async () => {
-    fetchMock.mockResolvedValueOnce(
-      new Response(PNG_MAGIC, { status: 200, headers: { 'content-type': 'image/png' } }),
-    );
-    const c = new ContentGeneratorClient({ url: 'https://x.test/gen', apiKey: 'k123' });
-    await c.generateImage({ applicationContext: 'A', formText: 'F' });
-    expect(fetchMock.mock.calls[0][1].headers.Authorization).toBe('Bearer k123');
-  });
-
-  it('retries once on 5xx then succeeds', async () => {
-    fetchMock
-      .mockResolvedValueOnce(new Response('fail', { status: 503 }))
-      .mockResolvedValueOnce(
-        new Response(PNG_MAGIC, { status: 200, headers: { 'content-type': 'image/png' } }),
-      );
-    const c = new ContentGeneratorClient({ url: 'https://x.test/gen', apiKey: 'k', retryDelayMs: 1 });
-    const out = await c.generateImage({ applicationContext: 'A', formText: 'F' });
-    expect(out.subarray(0, 4).toString('hex')).toBe('89504e47');
-    expect(fetchMock).toHaveBeenCalledTimes(2);
-  });
-
-  it('throws ContentGeneratorAuthError on 401/403', async () => {
-    fetchMock.mockResolvedValueOnce(new Response('forbidden', { status: 403 }));
-    const c = new ContentGeneratorClient({ url: 'https://x.test/gen', apiKey: 'bad', retryDelayMs: 1 });
-    await expect(c.generateImage({ applicationContext: 'A', formText: 'F' })).rejects.toBeInstanceOf(
-      ContentGeneratorAuthError,
-    );
-  });
-
-  it('does not retry on 4xx (other than 408/429)', async () => {
-    fetchMock.mockResolvedValueOnce(new Response('bad request', { status: 400 }));
-    const c = new ContentGeneratorClient({ url: 'https://x.test/gen', apiKey: 'k', retryDelayMs: 1 });
-    await expect(c.generateImage({ applicationContext: 'A', formText: 'F' })).rejects.toThrow();
-    expect(fetchMock).toHaveBeenCalledTimes(1);
-  });
-});
-```
-
-- [ ] **Step 2: Run test, verify it fails**
-
-```bash
-npm test -- lib/content-generator-client.test.ts
-```
-
-Expected: FAIL — module does not exist.
-
-- [ ] **Step 3: Implement (adapt to live contract from Task 1)**
-
-```typescript
-// lib/content-generator-client.ts
-//
-// Wrapper around Dimagi's internal Content Generator API. Live contract
-// documented in scripts/probe-content-generator.ts.
-
-export class ContentGeneratorAuthError extends Error {
-  constructor(public status: number, body: string) {
-    super(`Content Generator auth failed (${status}): ${body.slice(0, 200)}`);
-    this.name = 'ContentGeneratorAuthError';
-  }
-}
-
-export class ContentGeneratorClient {
-  constructor(
-    private opts: {
-      url: string;
-      apiKey: string;
-      timeoutMs?: number;       // default 60_000
-      retryDelayMs?: number;    // default 1_000
-    },
-  ) {}
-
-  async generateImage(input: {
-    applicationContext: string;
-    formText: string;
-    imageDirectives?: string;
-  }): Promise<Buffer> {
-    const body = {
-      application_context: input.applicationContext,
-      form_text: input.formText,
-      image_directives: input.imageDirectives ?? '',
-    };
-
-    const attempt = async (): Promise<Response> => {
-      const ac = new AbortController();
-      const t = setTimeout(() => ac.abort(), this.opts.timeoutMs ?? 60_000);
-      try {
-        return await fetch(this.opts.url, {
-          method: 'POST',
-          headers: {
-            Authorization: `Bearer ${this.opts.apiKey}`,
-            'Content-Type': 'application/json',
-          },
-          body: JSON.stringify(body),
-          signal: ac.signal,
-        });
-      } finally {
-        clearTimeout(t);
-      }
-    };
-
-    let res = await attempt();
-    if (res.status >= 500 || res.status === 408 || res.status === 429) {
-      await new Promise(r => setTimeout(r, this.opts.retryDelayMs ?? 1_000));
-      res = await attempt();
-    }
-
-    if (res.status === 401 || res.status === 403) {
-      throw new ContentGeneratorAuthError(res.status, await res.text());
-    }
-    if (res.status !== 200) {
-      throw new Error(`Content Generator HTTP ${res.status}: ${(await res.text()).slice(0, 300)}`);
-    }
-
-    const ct = res.headers.get('content-type') ?? '';
-    if (ct.startsWith('image/')) {
-      return Buffer.from(await res.arrayBuffer());
-    }
-    if (ct.includes('json')) {
-      // Live contract may return {url: signed} — fetch it inline.
-      const j = await res.json();
-      if (typeof j?.url === 'string') {
-        const r2 = await fetch(j.url);
-        if (r2.status !== 200) throw new Error(`signed URL fetch ${r2.status}`);
-        return Buffer.from(await r2.arrayBuffer());
-      }
-      throw new Error(`Content Generator JSON response had no .url: ${JSON.stringify(j).slice(0, 200)}`);
-    }
-    throw new Error(`Content Generator unexpected content-type: ${ct}`);
-  }
-}
-```
-
-- [ ] **Step 4: Run tests, verify pass**
-
-```bash
-npm test -- lib/content-generator-client.test.ts
-```
-
-Expected: PASS.
-
-- [ ] **Step 5: Commit (probe + client together)**
-
-```bash
-git add lib/content-generator-client.ts lib/content-generator-client.test.ts scripts/probe-content-generator.ts
-git commit -m "feat(lib): content-generator-client + probe script"
-```
-
----
-
-## Task 7: `lib/multimedia-judge.ts` — LLM judge for "image-worthy" fields
-
-**Files:**
-- Create: `lib/multimedia-judge.ts`
-- Test: `lib/multimedia-judge.test.ts`
-
-Single Anthropic SDK call per field. App Context goes in an `ephemeral`-cached block. Sonnet 4.6.
-
-- [ ] **Step 1: Write the failing test**
-
-```typescript
-// lib/multimedia-judge.test.ts
-import { describe, it, expect, vi } from 'vitest';
-import { judgeField, type JudgeInput } from './multimedia-judge.js';
-
-const fakeAnthropic = (responseText: string) => ({
-  messages: {
-    create: vi.fn().mockResolvedValue({
-      content: [{ type: 'text', text: responseText }],
-      usage: { input_tokens: 10, output_tokens: 5 },
-    }),
-  },
-});
-
-const baseInput: JudgeInput = {
-  appContext: 'African FLWs teaching mothers KMC for SVN newborns. Modestly dressed.',
-  appType: 'learn',
-  formName: 'KMC positioning',
-  formPosition: 'module 1, form 0 (instructional)',
-  field: {
-    id: 'kmc_position_demo',
-    kind: 'label',
-    label: "Show the mother how to support the baby's head and neck while skin-to-skin.",
-    hint: null,
-    options: [],
-  },
-  surroundingFields: [],
-};
-
-describe('judgeField', () => {
-  it('parses a valid yes-self-use response', async () => {
-    const fake = fakeAnthropic(
-      JSON.stringify({
-        generate: true,
-        use_case: 'flw_self_use',
-        why: 'FLW uses this to demonstrate KMC positioning.',
-        directive: 'Frontline worker assisting a mother holding a small newborn skin-to-skin.',
-      }),
-    );
-    const out = await judgeField(baseInput, fake as any);
-    expect(out.generate).toBe(true);
-    expect(out.use_case).toBe('flw_self_use');
-  });
-
-  it('parses a valid no response', async () => {
-    const fake = fakeAnthropic(JSON.stringify({ generate: false, why: 'numeric input', directive: null }));
-    const out = await judgeField(baseInput, fake as any);
-    expect(out.generate).toBe(false);
-  });
-
-  it('throws on schema-invalid LLM output', async () => {
-    const fake = fakeAnthropic(JSON.stringify({ generate: 'maybe', why: 42 }));
-    await expect(judgeField(baseInput, fake as any)).rejects.toThrow();
-  });
-
-  it('throws on non-JSON LLM output', async () => {
-    const fake = fakeAnthropic('I am sorry, I cannot');
-    await expect(judgeField(baseInput, fake as any)).rejects.toThrow();
-  });
-
-  it('places appContext in a cache_control:ephemeral block', async () => {
-    const fake = fakeAnthropic(
-      JSON.stringify({ generate: false, why: 'x', directive: null }),
-    );
-    await judgeField(baseInput, fake as any);
-    const callArgs = fake.messages.create.mock.calls[0][0];
-    const sysBlocks = Array.isArray(callArgs.system) ? callArgs.system : [];
-    const ephemeral = sysBlocks.find((b: any) => b.cache_control?.type === 'ephemeral');
-    expect(ephemeral).toBeDefined();
-    expect(ephemeral.text).toContain(baseInput.appContext);
-  });
-});
-```
-
-- [ ] **Step 2: Run test, verify it fails**
-
-```bash
-npm test -- lib/multimedia-judge.test.ts
-```
-
-Expected: FAIL — module does not exist.
-
-- [ ] **Step 3: Implement**
-
-```typescript
-// lib/multimedia-judge.ts
-import { z } from 'zod';
-import type Anthropic from '@anthropic-ai/sdk';
-
-export const judgeOutputSchema = z.object({
-  generate: z.boolean(),
-  use_case: z.enum(['flw_self_use', 'flw_shows_client', 'both']).optional().nullable(),
-  why: z.string().min(1).max(500),
-  directive: z.string().max(800).nullable(),
-});
-
-export type JudgeOutput = z.infer<typeof judgeOutputSchema>;
-
-export interface JudgeInput {
-  appContext: string;
-  appType: 'learn' | 'deliver';
-  formName: string;
-  formPosition: string;
-  field: {
-    id: string;
-    kind: string;
-    label: string;
-    hint: string | null;
-    options: string[];
-  };
-  surroundingFields: Array<{ id: string; kind: string; label: string }>;
-}
-
-const SYSTEM_HEAD = `You decide whether to generate a display-only image for a single CommCare app question.
-
-Criterion (yes if EITHER applies):
-1. The frontline worker (FLW) would use this image themselves to do their job — e.g. a step-by-step demonstration, a labeled diagram of an anatomy or device.
-2. The FLW would show the image to a client to communicate something — e.g. a visual choice card, a "what does X look like" reference.
-
-Skip if the question is purely numeric (weight, age), date/time, or a yes/no without ambiguity. Skip if the question's text alone is unambiguous and concrete.
-
-Return STRICT JSON only, matching this schema:
-{
-  "generate": boolean,
-  "use_case": "flw_self_use" | "flw_shows_client" | "both" | null,
-  "why": "short rationale, ≤200 chars",
-  "directive": "draft Image Directive for the generator, ≤500 chars, or null if generate=false"
-}
-
-Image Directive guidance: be specific about the subject, action, environment, lighting, and any modesty/representation cues from the application context. The directive will be passed verbatim to an image generator.`;
-
-export async function judgeField(
-  input: JudgeInput,
-  anthropic: Anthropic,
-  model = 'claude-sonnet-4-6',
-): Promise<JudgeOutput> {
-  const userPayload = {
-    app_type: input.appType,
-    form_name: input.formName,
-    form_position: input.formPosition,
-    field: input.field,
-    surrounding_fields: input.surroundingFields,
-  };
-
-  const res = await anthropic.messages.create({
-    model,
-    max_tokens: 600,
-    system: [
-      { type: 'text', text: SYSTEM_HEAD },
-      {
-        type: 'text',
-        text: `Application Context (constant for this opp):\n${input.appContext}`,
-        cache_control: { type: 'ephemeral' },
-      },
-    ],
-    messages: [{ role: 'user', content: JSON.stringify(userPayload) }],
-  });
-
-  const text = (res.content[0] as { type: string; text?: string }).text ?? '';
-  const trimmed = text.trim().replace(/^```(?:json)?\s*|\s*```$/g, '');
-  let parsed: unknown;
-  try {
-    parsed = JSON.parse(trimmed);
-  } catch {
-    throw new Error(`judge returned non-JSON: ${text.slice(0, 200)}`);
-  }
-  return judgeOutputSchema.parse(parsed);
-}
-```
-
-- [ ] **Step 4: Run tests, verify pass**
-
-```bash
-npm test -- lib/multimedia-judge.test.ts
-```
-
-Expected: PASS (5 tests).
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add lib/multimedia-judge.ts lib/multimedia-judge.test.ts
-git commit -m "feat(lib): multimedia-judge LLM rubric for image-worthy fields"
-```
-
----
-
-## Task 8: `lib/multimedia-xform-patch.ts` — add `<image>` itext to a form
-
-**Files:**
-- Create: `lib/multimedia-xform-patch.ts`
-- Test: `lib/multimedia-xform-patch.test.ts`
-- Test fixture: `test/fixtures/cchq/multimedia-sample-form.xml` (a minimal CommCare XForm with a label question, 30-50 lines)
-
-Pure XML-DOM manipulation. No I/O.
-
-- [ ] **Step 1: Create the test fixture**
-
-```xml
-<!-- test/fixtures/cchq/multimedia-sample-form.xml -->
-<?xml version="1.0" encoding="UTF-8"?>
-<h:html xmlns:h="http://www.w3.org/1999/xhtml" xmlns="http://www.w3.org/2002/xforms"
-        xmlns:jr="http://openrosa.org/javarosa">
-  <h:head>
-    <h:title>KMC positioning</h:title>
-    <model>
-      <instance>
-        <data id="kmc_positioning" xmlns="">
-          <kmc_position_demo/>
-        </data>
-      </instance>
-      <itext>
-        <translation lang="en" default="">
-          <text id="kmc_position_demo-label">
-            <value>Show the mother how to support the baby's head and neck.</value>
-          </text>
-        </translation>
-      </itext>
-      <bind nodeset="/data/kmc_position_demo" type="xsd:string"/>
-    </model>
-  </h:head>
-  <h:body>
-    <input ref="/data/kmc_position_demo">
-      <label ref="jr:itext('kmc_position_demo-label')"/>
-    </input>
-  </h:body>
-</h:html>
-```
-
-- [ ] **Step 2: Write the failing test**
-
-```typescript
-// lib/multimedia-xform-patch.test.ts
-import { describe, it, expect } from 'vitest';
-import { readFileSync } from 'node:fs';
-import { join } from 'node:path';
-import { addImageItext } from './multimedia-xform-patch.js';
-
-const FIXTURE = readFileSync(
-  join(__dirname, '../test/fixtures/cchq/multimedia-sample-form.xml'),
-  'utf-8',
-);
-
-describe('addImageItext', () => {
-  it('adds an <image> jr:// value to the matching itext text node', () => {
-    const out = addImageItext(FIXTURE, [
-      { fieldId: 'kmc_position_demo', cczFilename: 'kmc_position_demo.png' },
-    ]);
-    expect(out.patched).toBe(true);
-    expect(out.xml).toContain('<value form="image">jr://file/commcare/image/kmc_position_demo.png</value>');
-    // The original label value must remain intact.
-    expect(out.xml).toContain("Show the mother how to support the baby's head and neck.");
-  });
-
-  it('is idempotent — re-applying does not duplicate the <image> entry', () => {
-    const once = addImageItext(FIXTURE, [
-      { fieldId: 'kmc_position_demo', cczFilename: 'kmc_position_demo.png' },
-    ]);
-    const twice = addImageItext(once.xml, [
-      { fieldId: 'kmc_position_demo', cczFilename: 'kmc_position_demo.png' },
-    ]);
-    const occurrences = (twice.xml.match(/jr:\/\/file\/commcare\/image\/kmc_position_demo\.png/g) ?? []).length;
-    expect(occurrences).toBe(1);
-    expect(twice.patched).toBe(false);
-  });
-
-  it('returns patched=false when the field has no matching itext entry', () => {
-    const out = addImageItext(FIXTURE, [{ fieldId: 'no_such_field', cczFilename: 'x.png' }]);
-    expect(out.patched).toBe(false);
-  });
-
-  it('handles multiple fields in one pass', () => {
-    // Build a form with two label-text entries
-    const twoFieldForm = FIXTURE.replace(
-      /<text id="kmc_position_demo-label">[\s\S]*?<\/text>/,
-      `<text id="a-label"><value>A</value></text><text id="b-label"><value>B</value></text>`,
-    );
-    const out = addImageItext(twoFieldForm, [
-      { fieldId: 'a', cczFilename: 'a.png' },
-      { fieldId: 'b', cczFilename: 'b.png' },
-    ]);
-    expect(out.xml).toContain('jr://file/commcare/image/a.png');
-    expect(out.xml).toContain('jr://file/commcare/image/b.png');
-  });
-});
-```
-
-- [ ] **Step 3: Run test, verify it fails**
-
-```bash
-npm test -- lib/multimedia-xform-patch.test.ts
-```
-
-Expected: FAIL — module does not exist.
-
-- [ ] **Step 4: Check available XML library**
-
-```bash
-grep -E '"@xmldom/xmldom"|"xmldom"|"fast-xml-parser"' package.json
-```
-
-If `@xmldom/xmldom` is present, use it. Otherwise `npm install --save @xmldom/xmldom` then commit `package.json` + `package-lock.json` changes alongside the implementation in step 6.
-
-- [ ] **Step 5: Implement**
-
-```typescript
-// lib/multimedia-xform-patch.ts
-//
-// Pure XML transformation: given a CommCare XForm and a list of
-// (fieldId, cczFilename) pairs, add a `<value form="image">jr://...</value>`
-// child to the matching `<text id="<fieldId>-label">` node in itext.
-// Idempotent: skips fields whose <image> value is already present.
-
-import { DOMParser, XMLSerializer } from '@xmldom/xmldom';
-
-export interface ImageBinding {
-  fieldId: string;
-  cczFilename: string;
-}
-
-export interface PatchResult {
-  patched: boolean;
-  xml: string;
-  applied: string[];      // field ids that were modified
-  skipped: string[];      // field ids whose itext was already up-to-date
-  notFound: string[];     // field ids with no matching itext text
-}
-
-export function addImageItext(xml: string, bindings: ImageBinding[]): PatchResult {
-  const doc = new DOMParser().parseFromString(xml, 'text/xml');
-  const applied: string[] = [];
-  const skipped: string[] = [];
-  const notFound: string[] = [];
-
-  // Find every <text id="..."> node anywhere; loose match handles
-  // multi-translation forms (each <translation lang="..."> has its own copy).
-  const texts = Array.from(doc.getElementsByTagName('text'));
-
-  for (const b of bindings) {
-    const targetId = `${b.fieldId}-label`;
-    const matches = texts.filter(t => t.getAttribute('id') === targetId);
-    if (matches.length === 0) {
-      notFound.push(b.fieldId);
-      continue;
-    }
-
-    const jrUrl = `jr://file/commcare/image/${b.cczFilename}`;
-    let modifiedThisField = false;
-    for (const t of matches) {
-      const existing = Array.from(t.getElementsByTagName('value')).some(
-        v => v.getAttribute('form') === 'image' && (v.textContent ?? '').trim() === jrUrl,
-      );
-      if (existing) continue;
-
-      const valueEl = doc.createElement('value');
-      valueEl.setAttribute('form', 'image');
-      valueEl.appendChild(doc.createTextNode(jrUrl));
-      t.appendChild(valueEl);
-      modifiedThisField = true;
-    }
-
-    if (modifiedThisField) applied.push(b.fieldId);
-    else skipped.push(b.fieldId);
-  }
-
-  const out = new XMLSerializer().serializeToString(doc);
-  return { patched: applied.length > 0, xml: out, applied, skipped, notFound };
-}
-```
-
-- [ ] **Step 6: Run tests, verify pass**
-
-```bash
-npm test -- lib/multimedia-xform-patch.test.ts
-```
-
-Expected: PASS (4 tests).
-
-- [ ] **Step 7: Commit**
-
-```bash
-git add lib/multimedia-xform-patch.ts lib/multimedia-xform-patch.test.ts test/fixtures/cchq/multimedia-sample-form.xml
-# include package.json/lock if dependency was added
-git commit -m "feat(lib): multimedia-xform-patch — add <image> itext entries"
-```
-
----
-
-## Task 9: `commcare_upload_multimedia` MCP atom
-
-**Files:**
-- Modify: `mcp/connect/backends/commcare.ts` (add `uploadMultimedia` method)
-- Modify: `mcp/connect/capability-map.ts` (add capability)
-- Test: `test/mcp/connect/unit/commcare-upload-multimedia.test.ts`
-- Integration test: `test/mcp/connect/integration/commcare-upload-multimedia.test.ts`
-
-**Live contract** (probed 2026-05-05; see `scripts/probe-multimedia-upload.ts` header):
-
-- **Method**: `POST`
-- **Path**: `/a/<domain>/apps/<app_id>/multimedia/uploaded/<media_type>/` where `<media_type>` ∈ `{image, audio, video, text}` derived from the `content_type` MIME prefix
-- **Body**: multipart/form-data; **required** `Filedata` (bytes) and `path` (`jr://file/commcare/<media_type>/<filename>.<ext>`); **optional** `originalPath`, `shared='t'`, `license`, `author`, `attribution-notes`
-- **Headers**: `X-CSRFToken` (from session cookie), `Referer: <baseUrl>/a/<domain>/apps/view/<app_id>/`
-- **Response 200**: `Content-Type: text/html` (lies; body is JSON):
-  ```json
-  {
-    "ref": {
-      "path": "jr://file/commcare/image/foo.png",
-      "uid":  "<32-hex md5>",        // → file_hash_md5 (CCHQ dedupes on this)
-      "m_id": "<32-hex couch _id>",  // → multimedia_id
-      "url":  "/hq/multimedia/file/CommCareImage/<m_id>/",
-      "updated": false,
-      "media_type": "Image"
-    },
-    "errors": []
-  }
-  ```
-- **Failure**: `400` with non-empty `errors[]`; `302 → /accounts/login/` on session expiry; `403` on CSRF miss
-
-**CRITICAL gotcha — orphan pruning.** CCHQ's `clean_paths()` strips multimedia entries that no form references on the next `make_build`. So this atom alone does NOT bundle the file into the released CCZ — the form-XML must already reference the `jr://...` path before `make_build` runs. The skill (Task 12) is responsible for ordering: patch form XML → upload media → make_build → release. The atom only owns the upload step.
-
-- [ ] **Step 1: Write the failing unit test**
-
-```typescript
-// test/mcp/connect/unit/commcare-upload-multimedia.test.ts
-import { describe, it, expect, vi } from 'vitest';
-import { CommcareBackend } from '../../../../mcp/connect/backends/commcare.js';
-
-function fakeRequest(handler: (url: string, init: any) => { status: number; body: string; contentType?: string }) {
-  return {
-    get: vi.fn().mockImplementation(async () => ({
-      status: () => 200, text: async () => '<html></html>', headers: () => new Headers(),
-    })),
-    post: vi.fn().mockImplementation(async (url: string, init: any) => {
-      const r = handler(url, init);
-      const headers = new Headers({ 'content-type': r.contentType ?? 'text/html' });
-      return { status: () => r.status, text: async () => r.body, headers: () => headers };
-    }),
-    storageState: async () => ({ cookies: [{ name: 'csrftoken', value: 'TOKEN' }] }),
-  };
-}
-
-const SUCCESS_BODY = JSON.stringify({
-  ref: {
-    path: 'jr://file/commcare/image/x.png',
-    uid: 'd'.repeat(32),       // md5 hex
-    m_id: '9'.repeat(32),
-    url: '/hq/multimedia/file/CommCareImage/' + '9'.repeat(32) + '/',
-    updated: false,
-    media_type: 'Image',
-  },
-  errors: [],
-});
-
-describe('commcare uploadMultimedia', () => {
-  it('POSTs to /multimedia/uploaded/image/ for image content types', async () => {
-    let postedUrl = '';
-    const fake = fakeRequest((url) => {
-      postedUrl = url;
-      return { status: 200, body: SUCCESS_BODY };
-    });
-    const backend = new CommcareBackend({ request: fake as any, baseUrl: 'https://test.cchq' });
-    await backend.uploadMultimedia({
-      domain: 'demo', app_id: 'a'.repeat(32),
-      media_path: 'jr://file/commcare/image/x.png',
-      file_bytes: Buffer.from('PNG'), content_type: 'image/png',
-    });
-    expect(postedUrl).toBe('https://test.cchq/a/demo/apps/' + 'a'.repeat(32) + '/multimedia/uploaded/image/');
-  });
-
-  it('returns multimedia_id (m_id) and file_hash_md5 (uid) from ref', async () => {
-    const fake = fakeRequest(() => ({ status: 200, body: SUCCESS_BODY }));
-    const backend = new CommcareBackend({ request: fake as any, baseUrl: 'https://test.cchq' });
-    const out = await backend.uploadMultimedia({
-      domain: 'demo', app_id: 'a'.repeat(32),
-      media_path: 'jr://file/commcare/image/x.png',
-      file_bytes: Buffer.from('PNG'), content_type: 'image/png',
-    });
-    expect(out.multimedia_id).toBe('9'.repeat(32));
-    expect(out.file_hash_md5).toBe('d'.repeat(32));
-  });
-
-  it('routes audio content types to /multimedia/uploaded/audio/', async () => {
-    let postedUrl = '';
-    const fake = fakeRequest((url) => {
-      postedUrl = url;
-      return { status: 200, body: SUCCESS_BODY };
-    });
-    const backend = new CommcareBackend({ request: fake as any, baseUrl: 'https://test.cchq' });
-    await backend.uploadMultimedia({
-      domain: 'd', app_id: 'a'.repeat(32),
-      media_path: 'jr://file/commcare/audio/x.mp3',
-      file_bytes: Buffer.from('MP3'), content_type: 'audio/mpeg',
-    });
-    expect(postedUrl).toMatch(/\/multimedia\/uploaded\/audio\/$/);
-  });
-
-  it('sets X-CSRFToken from cookies and uses the app-view page as Referer', async () => {
-    let init: any = null;
-    const fake = fakeRequest((_url, _init) => {
-      init = _init;
-      return { status: 200, body: SUCCESS_BODY };
-    });
-    const backend = new CommcareBackend({ request: fake as any, baseUrl: 'https://test.cchq' });
-    await backend.uploadMultimedia({
-      domain: 'demo', app_id: 'a'.repeat(32),
-      media_path: 'jr://file/commcare/image/x.png',
-      file_bytes: Buffer.from('PNG'), content_type: 'image/png',
-    });
-    expect(init.headers['X-CSRFToken']).toBe('TOKEN');
-    expect(init.headers.Referer).toBe('https://test.cchq/a/demo/apps/view/' + 'a'.repeat(32) + '/');
-  });
-
-  it('throws with errors[] payload when CCHQ returns 400', async () => {
-    const fake = fakeRequest(() => ({
-      status: 400,
-      body: JSON.stringify({ ref: null, errors: ['File extension does not match content_type'] }),
-    }));
-    const backend = new CommcareBackend({ request: fake as any, baseUrl: 'https://test.cchq' });
-    await expect(
-      backend.uploadMultimedia({
-        domain: 'd', app_id: 'a'.repeat(32),
-        media_path: 'jr://file/commcare/image/x.png',
-        file_bytes: Buffer.from('x'), content_type: 'image/png',
-      }),
-    ).rejects.toThrow(/extension does not match/);
-  });
-
-  it('throws on 302 redirect (session expired)', async () => {
-    const fake = fakeRequest(() => ({ status: 302, body: '<html>login</html>' }));
-    const backend = new CommcareBackend({ request: fake as any, baseUrl: 'https://test.cchq' });
-    await expect(
-      backend.uploadMultimedia({
-        domain: 'd', app_id: 'a'.repeat(32),
-        media_path: 'jr://file/commcare/image/x.png',
-        file_bytes: Buffer.from('x'), content_type: 'image/png',
-      }),
-    ).rejects.toThrow(/302|session/i);
-  });
-});
-```
-
-- [ ] **Step 2: Run test, verify it fails**
-
-```bash
-npm test -- test/mcp/connect/unit/commcare-upload-multimedia.test.ts
-```
-
-Expected: FAIL — `uploadMultimedia` does not exist.
-
-- [ ] **Step 3: Add the backend method**
-
-In `mcp/connect/backends/commcare.ts`, add (place after `patchXform` around line 360+):
-
-```typescript
-export interface UploadMultimediaArgs {
-  domain: string;
-  app_id: string;
-  media_path: string;          // jr://file/commcare/<image|audio|video|text>/<filename>.<ext>
-  file_bytes: Buffer;
-  content_type: string;        // "image/png" | "image/jpeg" | "audio/mpeg" | ...
-}
-
-export interface UploadMultimediaResult {
-  multimedia_id: string;       // CouchDB doc _id (CCHQ's ref.m_id)
-  file_hash_md5: string;       // md5 hex of the file bytes (CCHQ's ref.uid)
-}
-
-// Inside CommcareBackend class:
-async uploadMultimedia(args: UploadMultimediaArgs): Promise<UploadMultimediaResult> {
-  const mediaType = mediaTypeFromContentType(args.content_type);
-  const path = `/a/${args.domain}/apps/${args.app_id}/multimedia/uploaded/${mediaType}/`;
-  const refreshPath = `/a/${args.domain}/apps/view/${args.app_id}/`;
-
-  // Refresh CSRF + session via the app view page (same pattern as patchXform).
-  await this.opts.request.get(`${this.opts.baseUrl}${refreshPath}`);
-  const csrf = await this.csrfFromCookies();
-
-  // Derive filename from media_path (last URI segment).
-  const filename = args.media_path.split('/').pop() ?? 'unnamed';
-
-  const form = new FormData();
-  form.set('Filedata', new Blob([args.file_bytes], { type: args.content_type }), filename);
-  form.set('path', args.media_path);
-
-  const res = await this.opts.request.post(`${this.opts.baseUrl}${path}`, {
-    multipart: form as any,
-    headers: {
-      'X-CSRFToken': csrf ?? '',
-      Referer: `${this.opts.baseUrl}${refreshPath}`,
-    },
-    maxRedirects: 0,
-  });
-  const status = res.status();
-  const body = await res.text();
-
-  if (status === 302) {
-    throw new Error(
-      `commcare_upload_multimedia POST ${path} returned 302 — session expired. Re-run /ace:connect-login.`,
-    );
-  }
-  if (status !== 200) {
-    let errs: string[] = [];
-    try {
-      const j = JSON.parse(body);
-      if (Array.isArray(j?.errors)) errs = j.errors;
-    } catch { /* fall through */ }
-    const errMsg = errs.length ? errs.join('; ') : body.slice(0, 300);
-    throw new Error(
-      `commcare_upload_multimedia POST ${path} returned ${status}: ${errMsg}`,
-    );
-  }
-
-  let parsed: { ref?: { m_id?: string; uid?: string }; errors?: string[] } = {};
-  try {
-    parsed = JSON.parse(body);
-  } catch {
-    throw new Error(`commcare_upload_multimedia non-JSON response: ${body.slice(0, 200)}`);
-  }
-  if (parsed.errors && parsed.errors.length > 0) {
-    throw new Error(`commcare_upload_multimedia errors: ${parsed.errors.join('; ')}`);
-  }
-  if (!parsed.ref?.m_id || !parsed.ref?.uid) {
-    throw new Error(
-      `commcare_upload_multimedia response missing ref.m_id / ref.uid: ${body.slice(0, 200)}`,
-    );
-  }
-  return {
-    multimedia_id: parsed.ref.m_id,
-    file_hash_md5: parsed.ref.uid,
-  };
-}
-
-function mediaTypeFromContentType(ct: string): 'image' | 'audio' | 'video' | 'text' {
-  if (ct.startsWith('image/')) return 'image';
-  if (ct.startsWith('audio/')) return 'audio';
-  if (ct.startsWith('video/')) return 'video';
-  if (ct.startsWith('text/'))  return 'text';
-  throw new Error(`commcare_upload_multimedia: unsupported content_type ${ct}`);
-}
-```
-
-- [ ] **Step 4: Run tests, verify pass**
-
-```bash
-npm test -- test/mcp/connect/unit/commcare-upload-multimedia.test.ts
-```
-
-Expected: PASS (6 tests).
-
-- [ ] **Step 5: Add integration test**
-
-```typescript
-// test/mcp/connect/integration/commcare-upload-multimedia.test.ts
-//
-// Verifies the upload atom against a live CCHQ project. NOTE: this test
-// only verifies the upload step (200 + parseable response). It does NOT
-// build or release because orphan multimedia (no form reference) gets
-// pruned by clean_paths() — that's expected CCHQ behavior, not a bug.
-// Bundling-into-CCZ is the skill's responsibility, exercised end-to-end
-// in Task 14's live smoke test.
-import { describe, it, expect } from 'vitest';
-import { commcareClient } from '../../../../mcp/connect/backends/commcare.js';
-
-const RUN = process.env.CONNECT_INTEGRATION === '1';
-
-const TINY_PNG = Buffer.from(
-  'iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mNkYAAAAAYAAjCB0C8AAAAASUVORK5CYII=',
-  'base64',
-);
-
-describe.skipIf(!RUN)('commcare_upload_multimedia (integration)', () => {
-  it('uploads a tiny PNG and returns multimedia_id + file_hash_md5', async () => {
-    const domain = process.env.ACE_HQ_DOMAIN!;
-    const appId = process.env.ACE_SMOKE_APP_ID!;
-    expect(domain).toBeTruthy();
-    expect(appId).toBeTruthy();
-
-    const c = await commcareClient();
-    const filename = `probe-${Date.now()}.png`;
-    const out = await c.uploadMultimedia({
-      domain, app_id: appId,
-      media_path: `jr://file/commcare/image/${filename}`,
-      file_bytes: TINY_PNG, content_type: 'image/png',
-    });
-    expect(out.multimedia_id).toMatch(/^[0-9a-f]{32}$/);
-    expect(out.file_hash_md5).toMatch(/^[0-9a-f]{32}$/);
-  }, 30_000);
-
-  it('is idempotent — same bytes return the same multimedia_id', async () => {
-    const domain = process.env.ACE_HQ_DOMAIN!;
-    const appId = process.env.ACE_SMOKE_APP_ID!;
-    const c = await commcareClient();
-    const filename = `probe-idem-${Date.now()}.png`;
-    const a = await c.uploadMultimedia({
-      domain, app_id: appId,
-      media_path: `jr://file/commcare/image/${filename}`,
-      file_bytes: TINY_PNG, content_type: 'image/png',
-    });
-    const b = await c.uploadMultimedia({
-      domain, app_id: appId,
-      media_path: `jr://file/commcare/image/${filename}`,
-      file_bytes: TINY_PNG, content_type: 'image/png',
-    });
-    expect(b.multimedia_id).toBe(a.multimedia_id);
-    expect(b.file_hash_md5).toBe(a.file_hash_md5);
-  }, 30_000);
-});
-```
-
-- [ ] **Step 6: Run integration test (gated)**
-
-```bash
-CONNECT_INTEGRATION=1 \
-  ACE_HQ_DOMAIN=connect-ace-prod \
-  ACE_SMOKE_APP_ID=4e20ddf5beca42278c4d2c20383eb943 \
-  npm test -- test/mcp/connect/integration/commcare-upload-multimedia.test.ts
-```
-
-(Domain + app_id values above are from the live probe in Task 2; replace with current smoke target if those are stale.)
-
-Expected: PASS. If it fails because the contract has drifted, re-run the probe script to confirm the live shape and adjust the atom + test together.
-
-- [ ] **Step 7: Commit**
-
-```bash
-git add mcp/connect/backends/commcare.ts \
-        test/mcp/connect/unit/commcare-upload-multimedia.test.ts \
-        test/mcp/connect/integration/commcare-upload-multimedia.test.ts \
-        scripts/probe-multimedia-upload.ts
-git commit -m "feat(connect): commcare_upload_multimedia atom backend"
-```
-
----
-
-## Task 10: Wire the atom into the MCP server + capability map
-
-**Files:**
-- Modify: `mcp/connect-server.ts` (add `server.tool('commcare_upload_multimedia', ...)`)
-- Modify: `mcp/connect/capability-map.ts` (add the capability entry)
-
-- [ ] **Step 1: Add tool registration**
-
-In `mcp/connect-server.ts`, after the `commcare_patch_xform` block (line ~413), add:
-
-```typescript
-// commcare_upload_multimedia — POST a binary multimedia asset to CCHQ.
-// Required companion to commcare_patch_xform: the form-XML patch makes the
-// build *reference* the asset; this atom puts the *bytes* into CouchDB so
-// CCHQ's clean_paths() doesn't prune the reference on the next make_build.
-//
-// Endpoint: POST /a/<domain>/apps/<app_id>/multimedia/uploaded/<media_type>/
-//   <media_type> derives from content_type MIME prefix.
-// Auth: same Playwright session as commcare_patch_xform; X-CSRFToken header.
-// Returns: { multimedia_id, file_hash_md5 } — see backends/commcare.ts.
-//
-// CRITICAL ORDER OF OPERATIONS:
-//   1. patch form XML to reference jr://file/commcare/<type>/<filename>
-//   2. commcare_upload_multimedia (this atom)
-//   3. commcare_make_build + commcare_release_build
-// Reversing 1 and 2 still works (uploads are idempotent), but skipping
-// step 1 means the upload is silently no-op for FLW devices because
-// CCHQ's clean_paths() prunes orphaned media on every build.
-server.tool('commcare_upload_multimedia',
-  {
-    domain: z.string(),
-    app_id: z.string().regex(/^[0-9a-f]{32}$/, '32-char hex'),
-    media_path: z.string().regex(/^jr:\/\/file\/commcare\/(image|audio|video|text)\/[^\/]+$/),
-    file_bytes_base64: z.string().min(1).describe('Asset bytes, base64-encoded'),
-    content_type: z.string().regex(/^(image|audio|video|text)\//),
-  },
-  async (args) =>
-    runAtom(async () => {
-      const { file_bytes_base64, ...rest } = args;
-      return (await commcareClient()).uploadMultimedia({
-        ...rest,
-        file_bytes: Buffer.from(file_bytes_base64, 'base64'),
-      });
-    }),
-);
-```
-
-- [ ] **Step 2: (skipped) capability-map**
-
-`mcp/connect/capability-map.ts` is **Connect-side-only** — it lists the
-21 atoms targeting `connect.dimagi.com`. The four CommCare atoms
-(`make_build`, `release_build`, `download_ccz`, `patch_xform`) all
-target `commcarehq.org` via the separate `commcareClient()` factory
-and are NOT tracked in capability-map. Adding an `'upload_multimedia'`
-entry there would either break the typed `Record<Capability, ...>` or
-imply wrong routing.
-
-Earlier draft of this plan said "add a parallel entry" based on an
-incorrect read of capability-map's scope. **No edit to capability-map
-is needed for CommCare atoms.** Tool registration in `connect-server.ts`
-(Step 1 above) is sufficient.
-
-- [ ] **Step 3: Smoke-test the MCP server boots**
-
-```bash
-npm run mcp:connect 2>&1 | head -20 &
-sleep 2
-kill %1 || true
-```
-
-Expected: server starts without throwing. If a Zod schema or capability-map entry is malformed, the import fails noisily.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add mcp/connect-server.ts mcp/connect/capability-map.ts
-git commit -m "feat(connect): register commcare_upload_multimedia tool"
-```
-
----
-
-## Task 11: Doctor checks for new env vars
-
-**Files:**
-- Modify: `bin/ace-doctor` (add CONTENT_GENERATOR_URL / CONTENT_GENERATOR_API_KEY to the env-drift check)
-
-- [ ] **Step 1: Find the existing env-drift check**
-
-```bash
-grep -n "CONTENT_GENERATOR\|env_drift\|env_file" bin/ace-doctor | head -20
-grep -n "OCS_API_TOKEN\|ACE_HQ_USERNAME" bin/ace-doctor | head -10
-```
-
-The env-drift block enumerates expected `.env` keys and reports any that are missing.
-
-- [ ] **Step 2: Add the two new keys**
-
-In `bin/ace-doctor`, locate the array / list of expected env var names and add `CONTENT_GENERATOR_URL` and `CONTENT_GENERATOR_API_KEY`.
-
-If there's a separate "service health" section that probes each integration, add a passive check: present-and-non-empty only (no live HTTP call — image generation is too slow / costly to ping on every doctor run).
-
-- [ ] **Step 3: Run doctor and verify**
-
-```bash
-/ace:doctor 2>&1 | grep -i content
-```
-
-Expected: output mentions `CONTENT_GENERATOR_URL` and `CONTENT_GENERATOR_API_KEY` either as `OK` (if `.env` was regenerated in Task 3) or `MISSING` (otherwise).
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add bin/ace-doctor
-git commit -m "feat(doctor): check Content Generator env vars"
-```
-
----
-
-## Task 12: `skills/app-multimedia-coverage/SKILL.md` — the orchestration prose
-
-**Files:**
-- Create: `skills/app-multimedia-coverage/SKILL.md`
-
-This is a prompt, not code. Mirror the structure of `skills/commcare-form-patch/SKILL.md` (process, mode behavior, dry-run, failure modes, MCP tools used, change log).
-
-- [ ] **Step 1: Read the reference skill**
-
-Open `skills/commcare-form-patch/SKILL.md` end-to-end and `skills/app-connect-coverage/SKILL.md` for the verify+fix pattern.
-
-- [ ] **Step 2: Author SKILL.md**
-
-```markdown
----
-name: app-multimedia-coverage
-description: >
-  Post-Phase-2 enhancement skill that attaches display-only images to
-  Connect Learn / Deliver app questions. Uses an LLM judge to pick which
-  fields deserve images (criterion: FLW uses it OR shows it to a client),
-  generates them via Dimagi's Content Generator API, patches the form
-  XML to add `<image>` itext references, uploads the assets to CCHQ via
-  `commcare_upload_multimedia`, and re-builds + re-releases the apps.
-  Manual gate; not part of `/ace:run`. Sibling of `commcare-form-patch`.
-  Delete when Nova ships first-class field-level multimedia (see § Removal
-  criteria).
----
-
-# App Multimedia Coverage
-
-Generate and attach display-only images to Connect app questions where
-they meaningfully help frontline workers. This skill closes the loop
-that Nova doesn't today: schema for media on a field, asset generation,
-CCZ bundling, form-XML reference, and a release. Mirrors the
-end-to-end pattern of `commcare-form-patch`.
-
-## Why this skill exists
-
-CommCare apps render images on questions via standard `<image>` itext
-references and CCZ-bundled assets at `commcare/multimedia/image/...`.
-Nova has no schema for this — its `image`/`audio`/`video` field kinds
-are *input capture*, not *display*. Until Nova ships field-level media
-(see § Removal criteria), this skill is the only path from "PDD" to
-"images on screen."
-
-## Removal criteria
-
-Delete this skill (and the supporting helpers + atom) when ALL of:
-
-1. Nova ships a field-level `media: { image_url, alt_text, image_directives }`
-   schema and round-trips it through `compile_app`.
-2. Nova's compile bundles linked media into the produced CCZ at
-   `commcare/multimedia/image/...`.
-3. A clean `/ace:run` against `CRISPR-Test-004-KMC-multimedia` produces
-   images-attached apps without this skill firing.
-4. Each affected opp's `run_state.yaml` has empty
-   `phase_2_backlog.app-multimedia-coverage`.
-
-## Process
-
-Inputs:
-- `<opp-name>` — positional, required
-- `--app=learn|deliver|both` — default `both`
-- `--max-images=N` — default `100` (runaway guard)
-- `--dry-run` — investigate without generating or patching
-
-For each app in scope:
-
-1. **Read deployment summary** `2-commcare/app-deploy_summary.md` →
-   `hq_domain`, `learn_app_id` / `deliver_app_id`, latest released
-   `build_id`. Read PDD for App Context derivation.
-
-2. **Derive Application Context.** If
-   `2-commcare/app-multimedia-coverage_app-context.md` exists, use as-is
-   (operator override wins). Otherwise synthesize from the PDD's
-   intervention description + a target-FLW one-liner + the standard
-   Dimagi guidance ("People should be dressed modestly. All of the
-   users and participants should be representative of the context.").
-   Write the synthesized version back so the operator can edit.
-
-3. **LLM-judge each visible field** via `lib/multimedia-judge.ts`. Skip
-   `hidden` and `calculate` kinds. Skip kinds with no displayed label.
-   Application Context goes in a prompt-cached system block — every
-   per-field call benefits from cache hit on the constant block.
-
-4. **Write candidates YAML** to
-   `2-commcare/app-multimedia-coverage_candidates-<app>.yaml`. If the
-   file already exists, **operator hand-edits win** — load as-is. The
-   judge runs only on first creation; re-run with `--rejudge` to
-   refresh.
-
-5. **Cost preview** — print
-   `Will generate {N} images for <app>; ~30s each ≈ M minutes.`
-   Halt if `N > --max-images`.
-
-6. **Generate images.** For each `generate: true` candidate:
-   - Compute `prompt_hash` via `lib/multimedia-prompt-hash.ts`.
-   - Cache hit (PNG present at expected path) → skip.
-   - Cache miss → `ContentGeneratorClient.generateImage(...)` → save
-     PNG to
-     `2-commcare/app-multimedia-coverage_generated/<app>/<form_unique_id>/<field_id>__<prompt_hash>.png`
-     → update `app-multimedia-coverage_manifest.yaml`.
-   - Default: serial. Bounded parallelism may be added later.
-
-7. **Patch form XML** for each form with ≥1 image:
-   - `commcare_download_ccz` to fetch the released form XML.
-   - `addImageItext()` from `lib/multimedia-xform-patch.ts` to add the
-     `<image>` itext entries.
-   - `commcare_patch_xform` to POST the patched XML.
-   - Re-fetch via `commcare_download_ccz` to confirm the patch stuck.
-
-   **WHY this happens before the upload:** CCHQ's `clean_paths()` prunes
-   any multimedia binary that no form references on the next
-   `make_build`. The form-XML reference is what causes CCHQ to retain
-   the asset in the build's multimedia map. Reverse this order and the
-   asset lands in CouchDB but never reaches FLW devices.
-
-8. **Upload multimedia to CCHQ** via `commcare_upload_multimedia` per
-   image. Record returned `multimedia_id` (CCHQ couch _id) and
-   `file_hash_md5` (CCHQ's md5 of the bytes) into the manifest.
-
-9. **Build + release** — `commcare_make_build` then
-   `commcare_release_build`. Capture new `build_id` + `version`.
-
-10. **Verify** — re-download the released CCZ. Assert every manifest
-    image is at `commcare/multimedia/image/<filename>` AND every patched
-    form XML references its `jr://file/...` URI. Halt on mismatch — if
-    the file is missing despite a successful upload, the most likely
-    cause is step 7 didn't land before step 9 (orphan-prune).
-
-11. **Report.** Write
-    `2-commcare/app-multimedia-coverage_report-<YYYY-MM-DD>.md`
-    (frontmatter + per-form table; see spec § 4 step 11).
-
-12. **Update `run_state.yaml`** with status + per-app counts under
-    `phases.manual.app-multimedia-coverage`.
-
-## Mode behavior
-
-- **Auto** (default): walk → judge → generate → patch → upload → build
-  → release → verify → report. No human gate.
-- **Review**: pause after step 4 (candidates YAML written) and after
-  step 7 (form-XML diff staged) for operator approval.
-- **Dry-run** (`--dry-run`): execute steps 1–4 + cost preview only.
-  Outputs candidates YAML for inspection. State tracks
-  `dry-run-success`.
-
-## Failure modes
-
-| Mode | Cause | Behavior |
-|---|---|---|
-| `judge.error` ≥1 field | LLM Zod validation failed | Skip that field, log to candidates, continue. Status: `partial`. |
-| Content Generator 5xx | Service hiccup | One retry with backoff, then halt. |
-| `ContentGeneratorAuthError` | Bad/missing API key | Halt; point at `/ace:doctor`. |
-| `XformConflictError` | CCHQ form sha1 changed | Halt the form, surface live sha1. |
-| `commcare_upload_multimedia` HTTP 500 | CCHQ rejected the binary | Halt skill; surface response slice. |
-| Verify (step 10) fails | Patch or upload silently dropped | Halt with per-form diff. Status: `blocked`. |
-| `--max-images` exceeded | Runaway opp | Halt before generation. |
-| Nova MCP unavailable | Step 1 fallback | Use released-CCZ XML walk for field discovery. |
-
-## MCP tools used
-
-- **Google Drive:** `drive_read_file`, `drive_create_file`, `drive_update_file`, `drive_create_folder`, `drive_list_folder`
-- **ace-connect (CCHQ atoms):** `commcare_download_ccz`, `commcare_patch_xform`, `commcare_upload_multimedia` (new), `commcare_make_build`, `commcare_release_build`
-- **Nova:** `nova_get_app`, `nova_get_form`, `nova_get_field` (read-only — for field metadata when blueprint is available)
-- **Anthropic SDK:** Sonnet 4.6 via `@anthropic-ai/sdk` (judge calls)
-- **HTTP:** Content Generator API via `lib/content-generator-client.ts`
-
-## Change log
-
-| Date | Change | Author |
-|------|--------|--------|
-| 2026-05-05 | Initial version. Manual gate, sibling of `commcare-form-patch`. | ACE team |
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/app-multimedia-coverage/SKILL.md
-git commit -m "feat(skills): app-multimedia-coverage SKILL.md"
-```
-
----
-
-## Task 13: Smoke fixture — `CRISPR-Test-004-KMC-multimedia`
-
-**Files:**
-- Create: `test/fixtures/CRISPR-Test-004-KMC-multimedia/pdd.md`
-- Create: `test/fixtures/CRISPR-Test-004-KMC-multimedia/expected-multimedia-candidates-learn.yaml` (golden expected output for the judge)
-
-The fixture exists for two purposes: (a) demo target for live runs, (b) Nova-feature-request removal-criteria check (when Nova ships, the same PDD should produce images-attached apps without this skill).
-
-- [ ] **Step 1: Author the PDD**
-
-```markdown
-<!-- test/fixtures/CRISPR-Test-004-KMC-multimedia/pdd.md -->
----
-name: KMC Multimedia Smoke
-archetype: atomic-visit
-target_flws: African community health workers in low-resource settings
----
-
-# Kangaroo Mother Care for Small Vulnerable Newborns
-
-## Intervention
-
-Frontline workers visit mothers of small or vulnerable newborns (SVN —
-under 2.5 kg or born preterm) and teach Kangaroo Mother Care (KMC):
-continuous skin-to-skin contact, exclusive breastfeeding, and early
-recognition of warning signs. Each visit is a single in-person
-encounter with one structured assessment and several teaching points.
-
-## Learn app — module structure
-
-1. **What is KMC?**
-   - Form: instructional. Label-only fields explaining benefits,
-     positioning, duration, and indications.
-2. **How to position the baby**
-   - Form: instructional. Step-by-step visual demonstration: head and
-     neck support, skin contact, wrapping the baby securely.
-3. **Recognising danger signs**
-   - Form: instructional with a quiz. Visual cues for jaundice, apnea,
-     poor feeding, hypothermia.
-4. **Knowledge check**
-   - Form: quiz. Single-select questions on positioning, signs, etc.
-
-## Deliver app — visit structure
-
-Single registration form per visit:
-- Mother's name, age, contact
-- Baby weight at birth, gestational age, current weight
-- Direct observation: is baby positioned correctly? (yes/no with photo)
-- Triage: any danger signs present? (multi-select with visual choices)
-- Counselling delivered: which teaching points? (multi-select)
-- Follow-up date
-
-## Preferred LLOs
-
-(none — smoke fixture, runs without solicitation)
-```
-
-- [ ] **Step 2: Author the expected-judge-output YAML (golden ground truth)**
-
-```yaml
-# test/fixtures/CRISPR-Test-004-KMC-multimedia/expected-multimedia-candidates-learn.yaml
-#
-# Golden ground truth for what the LLM judge SHOULD emit on a clean run.
-# Used to spot regressions in the judge prompt over time. Form/field IDs
-# match the structure the live Nova build will produce; if Nova field
-# naming changes, regenerate this file from a known-good run.
-
-# ~8-12 candidates expected:
-# - "What is KMC" intro screen → generate=true (FLW shows mother)
-# - Positioning step labels (3-5 of them) → generate=true (FLW demonstrates)
-# - Danger sign visual cues (jaundice, apnea, hypothermia) → generate=true
-# - "Mother's name" text field → generate=false
-# - "Baby weight" numeric → generate=false
-# - "Follow-up date" → generate=false
-```
-
-(The actual ground-truth file is filled in after the first live run against this fixture; for now it ships as a placeholder describing what's expected.)
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add test/fixtures/CRISPR-Test-004-KMC-multimedia/
-git commit -m "test(fixture): CRISPR-Test-004-KMC-multimedia smoke PDD"
-```
-
----
-
-## Task 14: Live smoke test against the fixture
-
-**Files:** none (this is a manual verification step)
-
-- [ ] **Step 1: Verify env is wired**
-
-```bash
-/ace:doctor 2>&1 | grep -E 'CONTENT_GENERATOR|env_file' | head -10
-```
-
-Expected: both `CONTENT_GENERATOR_*` keys reported as OK.
-
-- [ ] **Step 2: Pick (or create) a smoke opp that has Nova-built apps released**
-
-The skill needs an existing opp where Phase 1 + Phase 2 have completed. The simplest path: pick the most recent passing smoke opp from `~/.ace/` or Drive, and run against it. (Standing up a new opp from `CRISPR-Test-004` is a longer-form smoke that comes with the Nova-feature-request validation cycle; not required for first live run of this skill.)
-
-```bash
-/ace:status 2>&1 | tail -30
-```
-
-Choose an opp with `phase_2: clean` and noted `learn_app_id` + `deliver_app_id`.
-
-- [ ] **Step 3: Dry-run first**
-
-```bash
-/ace:step app-multimedia-coverage <opp-name> --dry-run
-```
-
-Expected: candidates YAML written, cost preview printed, no API calls to Content Generator, no patches.
-
-- [ ] **Step 4: Inspect the candidates YAML**
-
-Read `2-commcare/app-multimedia-coverage_candidates-learn.yaml` and `_candidates-deliver.yaml` from Drive. Sanity-check: do the `generate: true` choices look right? Are the directives reasonable?
-
-If choices look bad: iterate the judge prompt in `lib/multimedia-judge.ts` (`SYSTEM_HEAD`), regenerate the candidates with `--rejudge`, repeat until the operator is happy.
-
-- [ ] **Step 5: Live run on Learn only first**
-
-```bash
-/ace:step app-multimedia-coverage <opp-name> --app=learn --max-images=10
-```
-
-Expected: 10 images generated, form XML patched, multimedia uploaded, app re-built and re-released, verify step passes, report written.
-
-- [ ] **Step 6: Manual visual check**
-
-Open the new build in CommCare HQ's app preview or pull the CCZ and inspect:
-
-```bash
-gh attestation download ... # OR via CCHQ's preview UI
-```
-
-Confirm at least one image renders alongside the expected question.
-
-- [ ] **Step 7: Iterate**
-
-If something failed: address the failure mode, re-run, repeat. **Convergence is the goal of this task.** Each iteration should commit fixes (judge prompt tweaks, atom edge cases, XML patcher edge cases) as small focused commits.
-
-- [ ] **Step 8: Run on Deliver after Learn passes**
-
-```bash
-/ace:step app-multimedia-coverage <opp-name> --app=deliver --max-images=10
-```
-
-- [ ] **Step 9: Capture the golden run for the fixture's expected YAML**
-
-Once a live run looks right, copy the produced `app-multimedia-coverage_candidates-learn.yaml` from Drive into `test/fixtures/CRISPR-Test-004-KMC-multimedia/expected-multimedia-candidates-learn.yaml`, replacing the placeholder.
-
-```bash
-git add test/fixtures/CRISPR-Test-004-KMC-multimedia/expected-multimedia-candidates-learn.yaml
-git commit -m "test(fixture): capture golden judge output from first live run"
-```
-
----
-
-## Task 15: Update CLAUDE.md and CHANGELOG
-
-**Files:**
-- Modify: `CLAUDE.md`
-- Modify: `CHANGELOG.md`
-
-- [ ] **Step 1: Add a CLAUDE.md "Current state" bullet**
-
-After the existing post-Nova-skill paragraphs (the section that mentions `app-connect-coverage` and `commcare-form-patch`), add:
-
-```markdown
-- **`app-multimedia-coverage` — manual post-Phase-2 multimedia attach.**
-  Sibling of `commcare-form-patch`. LLM-judges each Nova-built field
-  (criterion: would the FLW use it OR show it to a client?), calls
-  Dimagi's Content Generator API for the chosen ones, patches form XML
-  with `<image>` itext entries, uploads PNGs via the new
-  `commcare_upload_multimedia` atom, and re-builds + re-releases. **Not
-  part of `/ace:run`** — invoked manually with `/ace:step
-  app-multimedia-coverage <opp>`. Spec at
-  `docs/superpowers/specs/2026-05-05-app-multimedia-coverage-design.md`;
-  delete when Nova ships first-class field-level multimedia (see the
-  removal criteria in the SKILL.md).
-```
-
-- [ ] **Step 2: Add a CHANGELOG entry under the next version**
-
-(Pick the next minor or patch version per `CLAUDE.md` § "Plugin updates".)
-
-```markdown
-## 0.13.4 — app-multimedia-coverage skill
-
-- New skill `app-multimedia-coverage` (manual gate, post-Phase 2):
-  attaches display-only images to Connect Learn/Deliver app questions
-  via Dimagi's Content Generator + post-Nova CCZ patching.
-- New CCHQ atom `commcare_upload_multimedia` to bundle binary assets
-  into the released CCZ.
-- New helpers under `lib/`: `multimedia-judge`, `content-generator-client`,
-  `multimedia-manifest`, `multimedia-prompt-hash`, `multimedia-xform-patch`.
-- New `.env.tpl` keys: `CONTENT_GENERATOR_URL`, `CONTENT_GENERATOR_API_KEY`.
-- Filed Nova feature request `voidcraft-labs/nova-plugin#<N>` for
-  field-level multimedia; this skill has explicit removal criteria.
-```
-
-- [ ] **Step 3: Bump version**
-
-```bash
-scripts/version-bump.sh
-```
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add CLAUDE.md CHANGELOG.md VERSION package.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-git commit -m "docs: app-multimedia-coverage in CLAUDE.md + CHANGELOG (0.13.4)"
-```
-
----
-
-## Task 16: File the Nova feature request
-
-**Files:** none in this repo
-
-- [ ] **Step 1: Verify gh CLI auth**
-
-```bash
-gh auth status
-```
-
-If not authed for `voidcraft-labs/nova-plugin`, sort that out before proceeding.
-
-- [ ] **Step 2: File the issue**
-
-```bash
-gh issue create --repo voidcraft-labs/nova-plugin \
-  --title "Field-level multimedia (display-only) on Learn/Deliver questions" \
-  --body "$(cat <<'EOF'
-## Problem
-
-Nova has no schema for **display-only** multimedia on a question (the
-image / audio / video that the FLW sees alongside a question label).
-The existing `image` / `audio` / `video` field kinds are *input
-capture* (FLW takes a photo / records audio), not display.
-
-Standard CommCare apps render display media via:
-
-- `<image>` / `<audio>` / `<video>` itext entries in form XML, e.g.
-  `<value form="image">jr://file/commcare/image/foo.png</value>`
-- Bundled assets in the CCZ at `commcare/multimedia/<media_type>/...`
-
-There's no field-level hook in Nova's blueprint to populate either.
-
-## What ACE is doing as a workaround
-
-The new `app-multimedia-coverage` skill in
-[anthropics-friends/ace](https://github.com/...) (ACE plugin) post-builds:
-
-1. LLM-judges each field for "image-worthy" (criterion: FLW uses it
-   themselves OR shows it to a client).
-2. Calls Dimagi's Content Generator API to make the asset.
-3. Patches form XML to add `<image>` itext via existing
-   `commcare_patch_xform`.
-4. Bundles the PNG into the CCZ via a new
-   `commcare_upload_multimedia` atom.
-5. Re-builds and re-releases.
-
-This works but is a maintenance liability — every Nova rebuild loses
-the patches. We'd love to delete the skill once Nova handles this
-natively.
-
-## Proposal
-
-### Schema
-
-New optional `media` property on every field:
-
-```yaml
-fields:
-  - id: kmc_position_demo
-    kind: label
-    label: "Show the mother how to support the baby's head and neck."
-    media:
-      image_url: https://...                 # OR raw bytes via separate atom
-      alt_text: "Mother holding small newborn skin-to-skin"
-      image_directives: "warm lighting, modest clothing, African setting"
-```
-
-### Compile
-
-`compile_app` fetches each `media.image_url`, bundles into the produced
-CCZ at `commcare/multimedia/image/<form_unique_id>__<field_id>.png`,
-and writes the matching `<image>` itext entry into the form XML.
-
-### Optional v2
-
-Pluggable generator hook so authors can specify `media: { generate:
-true, directives: "..." }` and Nova produces the asset at compile time
-via Dimagi's Content Generator (or any pluggable provider).
-
-## Acceptance
-
-- `update_form` accepts `media: { image_url, ... }` on a field.
-- `compile_app` produces a CCZ with the correct `<image>` itext entry
-  AND the bundled asset present at the expected path.
-- Round-trip: `get_form` returns the `media` block as written.
-- ACE's `CRISPR-Test-004-KMC-multimedia` smoke fixture produces an
-  images-attached app without `app-multimedia-coverage` running.
-
-When this ships, ACE will delete `app-multimedia-coverage`,
-`commcare_upload_multimedia`, and the supporting `lib/multimedia-*`
-helpers.
-EOF
-)"
-```
-
-- [ ] **Step 3: Capture the issue number in CHANGELOG**
-
-```bash
-# After the issue is filed and you have its number, fix the CHANGELOG line:
-# `voidcraft-labs/nova-plugin#<N>` → `voidcraft-labs/nova-plugin#42` (or whatever)
-```
-
-```bash
-git add CHANGELOG.md
-git commit --amend --no-edit  # OR a new commit if amend is risky
-```
-
----
-
-## Task 17: Final verification & PR
-
-- [ ] **Step 1: Run the full test suite**
-
-```bash
-npm test
-```
-
-Expected: all unit + integration (where gated env is present) tests pass.
-
-- [ ] **Step 2: `/ace:doctor` clean**
-
-```bash
-/ace:doctor
-```
-
-Expected: no fail-level findings (warnings about Connect / OCS sessions are fine if not authenticated).
-
-- [ ] **Step 3: Push and open a PR**
-
-Use the `dev-utils:create-pr` skill. PR title: `feat: app-multimedia-coverage skill (0.13.4)`.
-
-PR body covers:
-- Spec link
-- Plan link
-- New atom + helpers + skill
-- Nova feature request link
-- Live smoke status (which opp / which app / how many images / verify pass)
-
----
-
-## Plan Self-Review
-
-**Spec coverage check:**
-
-| Spec section | Plan task |
-|---|---|
-| § 1 Problem statement | (covered by entire plan; no specific task needed) |
-| § 2 Goals & non-goals | Tasks 7 (judge), 9 (atom), 12 (skill orchestration) cover the in-scope goals. Out-of-scope items (audio/video, multilingual) are not in the plan — correct. |
-| § 3 Skill shape | Task 12 (SKILL.md) |
-| § 4 End-to-end pipeline (12 steps) | Task 12 (encoded into SKILL.md) |
-| § 5a LLM judge | Task 7 |
-| § 5b Content Generator client | Tasks 1 (probe), 6 (client) |
-| § 5c CCHQ atom | Tasks 2 (probe), 9 (backend), 10 (registration) |
-| § 6 Drive layout | Task 12 (SKILL.md prose); Task 5 (manifest schema) |
-| § 7 Idempotency | Tasks 4 (prompt hash), 12 (skill flow) |
-| § 8 Failure modes | Task 12 (SKILL.md table mirrors spec) |
-| § 9 Nova feature request + removal criteria | Task 16 (file) + Task 12 (removal criteria in SKILL.md) |
-| § 10 Modes | Task 12 |
-| § 11 Testing | Tasks 4–9 (unit), Task 9 (integration), Tasks 13–14 (smoke fixture + live run) |
-| § 12 Open questions | Tasks 1–2 (probes resolve them) |
-| § 13 Out of scope | (correctly absent from plan) |
-
-All spec sections accounted for.
-
-**Placeholder scan:**
-
-- All code blocks contain real code, not stubs.
-- All file paths are exact.
-- All test assertions are concrete.
-- One soft spot: Task 13 step 2 ships a comment-only `expected-multimedia-candidates-learn.yaml`, with the real golden coming from Task 14 step 9. That's intentional — the golden output requires a live run to capture — and it's documented as such.
-
-**Type consistency check:**
-
-- `JudgeOutput` — defined in Task 7, never re-shaped.
-- `MultimediaManifest` / `MultimediaImage` — defined in Task 5, used implicitly in Tasks 9, 12.
-- `ContentGeneratorClient.generateImage(...)` signature — Task 6 defines it; Task 12 SKILL.md uses the same shape.
-- `addImageItext(xml, bindings)` — Task 8 defines, Task 12 SKILL.md references.
-- `commcare_upload_multimedia` schema — Task 9 backend, Task 10 server registration both use the same field shapes (domain, app_id, media_path, file_bytes, content_type).
-
-No drift detected.
-
-**Done.**
diff --git a/docs/superpowers/plans/2026-05-08-decisions-log-pr1.md b/docs/superpowers/plans/2026-05-08-decisions-log-pr1.md
deleted file mode 100644
index a24c6ba1..00000000
--- a/docs/superpowers/plans/2026-05-08-decisions-log-pr1.md
+++ /dev/null
@@ -1,1124 +0,0 @@
-# Decisions Log — PR #1: Schema + Phase 1 Write-Side — Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
-
-**Goal:** Land the schema, helpers, and Phase 1 (`idea-to-pdd`) write-side of the ACE decisions log, producing a per-run `decisions.yaml` source-of-truth artifact with a calibrated set of ~14 Phase 1 rows.
-
-**Architecture:** A Zod-validated YAML at `ACE/<opp>/runs/<run-id>/decisions.yaml`, written by `idea-to-pdd` during Phase 1. Schema lib lives in `lib/decisions-schema.ts` mirroring `lib/qa-types.ts` conventions (Zod schemas + types + YAML read/write helpers). The `idea-to-pdd` skill body gains a `## Decisions Log Convention` section enumerating the calibrated row set, the bar criterion, and the write contract — replacing the existing `## Open Questions Convention`. No renderer, no round-trip, no Phase 2–9 writes — those are PRs #2–4.
-
-**Tech Stack:** TypeScript + Zod (matches `lib/qa-types.ts`), js-yaml for YAML (already in use), vitest for unit tests, Google Drive MCP atoms for runtime YAML write (`drive_create_file` with `findOrCreate` semantics — already in use across ACE skills). No new dependencies.
-
-**Pre-flight:** This PR's spec ([`docs/superpowers/specs/2026-05-08-decisions-log-design.md`](../specs/2026-05-08-decisions-log-design.md)) recommends landing this PR after `idea-to-pdd`'s QA/Eval migration (PR #147 Phase 1 PR #1). If that migration has shipped at land time, also extend `idea-to-pdd-qa` with structural checks against `decisions.yaml` (called out as Task 9 below — skip if pre-migration, follow up if post-migration).
-
----
-
-## Spec coverage map
-
-| Spec section | Covered by |
-|---|---|
-| Schema (Zod + YAML) | Tasks 1–3 |
-| Source of truth at `ACE/<opp>/runs/<run-id>/decisions.yaml` | Task 4 (skill body) |
-| Bar criterion + scope | Task 4 (skill body) |
-| Phase 1 calibration row set (~10–15 rows) | Task 4 (skill body) + Task 5 (fixture) |
-| Phase 1 interaction (default + review modes) | Task 4 (skill body Mode Behavior) |
-| Phase Write-Back Contract clause | Task 6 |
-| Gate brief integration (`Decisions Log:` line) | Task 7 |
-| Migration: retire `open-questions.md` | Task 4 (replaces `## Open Questions Convention`) |
-| Coordination — `idea-to-pdd-qa` structural checks | Task 9 (conditional) |
-| Renderer | OUT OF SCOPE — PR #2 |
-| Phase 2–9 writes | OUT OF SCOPE — PR #3 |
-| Round-trip sync | OUT OF SCOPE — PR #4 |
-
----
-
-## File structure
-
-**Create:**
-- `lib/decisions-schema.ts` — Zod schemas (`DecisionRowSchema`, `DecisionsLogSchema`), exported types, YAML helpers (`parseDecisionsYaml`, `serializeDecisionsLog`).
-- `test/lib/decisions-schema.test.ts` — unit tests for schema + helpers.
-- `test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml` — hand-authored calibration fixture showing the expected Phase 1 row set for a turmeric-shaped input.
-- `test/skills/idea-to-pdd/decisions-fixture.test.ts` — snapshot test asserting the fixture parses and contains all calibration row IDs.
-
-**Modify:**
-- `skills/idea-to-pdd/SKILL.md` — add `## Decisions Log Convention` section, update `## Process` step list, update `## Outputs`, update `## Mode Behavior`, retire `## Open Questions Convention`. Update `## Change Log`.
-- `skills/idea-to-pdd/SKILL.md` (Gate Brief section) — replace `Open Questions Doc:` line with `Decisions Log:` line.
-- `agents/ace-orchestrator.md` — add a clause to `## Phase Write-Back Contract` requiring `decisions.yaml` rows.
-- `VERSION` — bumped via `scripts/version-bump.sh` at the end.
-- `package.json`, `.claude-plugin/plugin.json`, `.claude-plugin/marketplace.json` — auto-synced by the version-bump pre-commit hook.
-
-**No deletes.** The current `open-questions.md` artifact path stops being created by `idea-to-pdd`, but no existing files are removed (existing runs' open-questions docs in Drive stay where they are).
-
----
-
-## Tasks
-
-### Task 1: Define the Zod schema and types
-
-**Files:**
-- Create: `lib/decisions-schema.ts`
-- Test: `test/lib/decisions-schema.test.ts`
-
-- [ ] **Step 1: Write the failing test.**
-
-Create `test/lib/decisions-schema.test.ts`:
-
-```ts
-import { describe, expect, it } from "vitest";
-import { DecisionRowSchema, DecisionsLogSchema } from "../../lib/decisions-schema.js";
-
-describe("DecisionRowSchema", () => {
-  it("accepts a minimal valid row", () => {
-    const row = {
-      id: "flw-count",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "How many FLWs should the program target?",
-      default: "5–8",
-      options_considered: ["3–5", "10–15", "20+"],
-      source: "idea.md §2; atomic-visit archetype norm",
-      status: "applied",
-    };
-    expect(() => DecisionRowSchema.parse(row)).not.toThrow();
-  });
-
-  it("rejects an empty id", () => {
-    const row = {
-      id: "",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Q?",
-      default: "x",
-      options_considered: [],
-      source: "x",
-      status: "applied",
-    };
-    expect(() => DecisionRowSchema.parse(row)).toThrow();
-  });
-
-  it("rejects an invalid status enum value", () => {
-    const row = {
-      id: "flw-count",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Q?",
-      default: "x",
-      options_considered: [],
-      source: "x",
-      status: "resolved",  // not in v1 enum
-    };
-    expect(() => DecisionRowSchema.parse(row)).toThrow();
-  });
-
-  it("rejects a non-string default", () => {
-    const row = {
-      id: "flw-count",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Q?",
-      default: 5,  // must be string
-      options_considered: [],
-      source: "x",
-      status: "applied",
-    };
-    expect(() => DecisionRowSchema.parse(row)).toThrow();
-  });
-
-  it("accepts an optional notes field", () => {
-    const row = {
-      id: "flw-count",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Q?",
-      default: "x",
-      options_considered: [],
-      source: "x",
-      status: "applied",
-      notes: "Atomic-visit norm.",
-    };
-    expect(() => DecisionRowSchema.parse(row)).not.toThrow();
-  });
-});
-
-describe("DecisionsLogSchema", () => {
-  it("accepts a minimal valid log", () => {
-    const log = {
-      schema_version: 1,
-      opportunity: "turmeric",
-      run_id: "20260507-1733",
-      generated_at: "2026-05-07T17:33:00Z",
-      decisions: [],
-    };
-    expect(() => DecisionsLogSchema.parse(log)).not.toThrow();
-  });
-
-  it("rejects schema_version other than 1", () => {
-    const log = {
-      schema_version: 2,
-      opportunity: "turmeric",
-      run_id: "20260507-1733",
-      generated_at: "2026-05-07T17:33:00Z",
-      decisions: [],
-    };
-    expect(() => DecisionsLogSchema.parse(log)).toThrow();
-  });
-
-  it("rejects duplicate decision IDs", () => {
-    const log = {
-      schema_version: 1,
-      opportunity: "turmeric",
-      run_id: "20260507-1733",
-      generated_at: "2026-05-07T17:33:00Z",
-      decisions: [
-        {
-          id: "flw-count",
-          phase: "1-design",
-          skill: "idea-to-pdd",
-          question: "Q?",
-          default: "5–8",
-          options_considered: [],
-          source: "x",
-          status: "applied",
-        },
-        {
-          id: "flw-count",  // duplicate
-          phase: "1-design",
-          skill: "idea-to-pdd",
-          question: "Q?",
-          default: "5–8",
-          options_considered: [],
-          source: "x",
-          status: "applied",
-        },
-      ],
-    };
-    expect(() => DecisionsLogSchema.parse(log)).toThrow(/duplicate/i);
-  });
-});
-```
-
-- [ ] **Step 2: Run the test to verify it fails.**
-
-Run: `npx vitest run test/lib/decisions-schema.test.ts`
-Expected: FAIL with `Cannot find module '../../lib/decisions-schema.js'` (the schema file doesn't exist yet).
-
-- [ ] **Step 3: Implement the schema.**
-
-Create `lib/decisions-schema.ts`:
-
-```ts
-import { z } from "zod";
-
-/**
- * One row in a per-run decisions log. Represents a load-bearing default
- * an ACE phase applied (or a load-bearing decision the AI flagged for
- * human attention while still proceeding with a default).
- *
- * See docs/superpowers/specs/2026-05-08-decisions-log-design.md § Schema
- * for field semantics and the bar criterion that gates row creation.
- */
-export const DecisionRowSchema = z.object({
-  id: z.string().min(1).regex(/^[a-z0-9][a-z0-9-]*$/, {
-    message: "id must be kebab-case (lowercase, digits, hyphens; cannot start with hyphen)",
-  }),
-  phase: z.string().regex(/^[1-9][0-9]*-[a-z][a-z-]*$/, {
-    message: "phase must match <N>-<name> (e.g. 1-design, 2-commcare)",
-  }),
-  skill: z.string().min(1),
-  question: z.string().min(1),
-  default: z.string().min(1),
-  options_considered: z.array(z.string()),
-  source: z.string().min(1),
-  status: z.enum(["applied", "overridden", "open"]),
-  notes: z.string().optional(),
-});
-
-export type DecisionRow = z.infer<typeof DecisionRowSchema>;
-
-/**
- * The full per-run log file shape. Stored at
- * ACE/<opp>/runs/<run-id>/decisions.yaml.
- */
-export const DecisionsLogSchema = z
-  .object({
-    schema_version: z.literal(1),
-    opportunity: z.string().min(1),
-    run_id: z.string().min(1),
-    generated_at: z.string().datetime({ offset: true }),
-    decisions: z.array(DecisionRowSchema),
-  })
-  .superRefine((log, ctx) => {
-    const seen = new Set<string>();
-    for (const [index, row] of log.decisions.entries()) {
-      if (seen.has(row.id)) {
-        ctx.addIssue({
-          code: z.ZodIssueCode.custom,
-          message: `duplicate decision id: ${row.id}`,
-          path: ["decisions", index, "id"],
-        });
-      }
-      seen.add(row.id);
-    }
-  });
-
-export type DecisionsLog = z.infer<typeof DecisionsLogSchema>;
-```
-
-- [ ] **Step 4: Run the test to verify it passes.**
-
-Run: `npx vitest run test/lib/decisions-schema.test.ts`
-Expected: PASS — all 8 assertions green.
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add lib/decisions-schema.ts test/lib/decisions-schema.test.ts
-git commit -m "lib: add Zod schema for decisions log
-
-DecisionRow and DecisionsLog schemas with kebab-case id validation,
-phase-name pattern, status enum (applied|overridden|open), and a
-duplicate-id check via superRefine. Mirrors lib/qa-types.ts pattern.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 2: Add YAML read/write helpers
-
-**Files:**
-- Modify: `lib/decisions-schema.ts`
-- Modify: `test/lib/decisions-schema.test.ts`
-
-- [ ] **Step 1: Write the failing test.**
-
-Append to `test/lib/decisions-schema.test.ts`:
-
-```ts
-import { parseDecisionsYaml, serializeDecisionsLog } from "../../lib/decisions-schema.js";
-
-describe("parseDecisionsYaml", () => {
-  it("parses a valid YAML string into a DecisionsLog", () => {
-    const yaml = `
-schema_version: 1
-opportunity: turmeric
-run_id: 20260507-1733
-generated_at: "2026-05-07T17:33:00Z"
-decisions:
-  - id: flw-count
-    phase: 1-design
-    skill: idea-to-pdd
-    question: How many FLWs?
-    default: "5–8"
-    options_considered: ["3–5", "10–15"]
-    source: idea.md §2
-    status: applied
-`;
-    const log = parseDecisionsYaml(yaml);
-    expect(log.opportunity).toBe("turmeric");
-    expect(log.decisions).toHaveLength(1);
-    expect(log.decisions[0]!.id).toBe("flw-count");
-  });
-
-  it("throws a typed error on schema violation", () => {
-    const yaml = `
-schema_version: 1
-opportunity: turmeric
-run_id: 20260507-1733
-generated_at: "2026-05-07T17:33:00Z"
-decisions:
-  - id: ""  # empty id violates schema
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Q?
-    default: x
-    options_considered: []
-    source: x
-    status: applied
-`;
-    expect(() => parseDecisionsYaml(yaml)).toThrow(/decisions\.0\.id/);
-  });
-
-  it("throws on unparseable YAML", () => {
-    expect(() => parseDecisionsYaml("not: : valid: yaml")).toThrow();
-  });
-});
-
-describe("serializeDecisionsLog", () => {
-  it("round-trips through parse with no data loss", () => {
-    const log = {
-      schema_version: 1 as const,
-      opportunity: "turmeric",
-      run_id: "20260507-1733",
-      generated_at: "2026-05-07T17:33:00Z",
-      decisions: [
-        {
-          id: "flw-count",
-          phase: "1-design",
-          skill: "idea-to-pdd",
-          question: "How many FLWs?",
-          default: "5–8",
-          options_considered: ["3–5", "10–15"],
-          source: "idea.md §2",
-          status: "applied" as const,
-        },
-      ],
-    };
-    const yaml = serializeDecisionsLog(log);
-    const parsed = parseDecisionsYaml(yaml);
-    expect(parsed).toEqual(log);
-  });
-
-  it("preserves non-ASCII characters (em dashes, en dashes)", () => {
-    const log = {
-      schema_version: 1 as const,
-      opportunity: "turmeric",
-      run_id: "20260507-1733",
-      generated_at: "2026-05-07T17:33:00Z",
-      decisions: [
-        {
-          id: "ai-photo-threshold",
-          phase: "1-design",
-          skill: "idea-to-pdd",
-          question: "AI auto-accept confidence threshold?",
-          default: "≥90%",
-          options_considered: ["≥85%", "≥95%"],
-          source: "stress-test verifiability dimension",
-          status: "applied" as const,
-        },
-      ],
-    };
-    const yaml = serializeDecisionsLog(log);
-    expect(yaml).toContain("≥90%");
-    const parsed = parseDecisionsYaml(yaml);
-    expect(parsed.decisions[0]!.default).toBe("≥90%");
-  });
-});
-```
-
-- [ ] **Step 2: Run the test to verify it fails.**
-
-Run: `npx vitest run test/lib/decisions-schema.test.ts`
-Expected: FAIL with `parseDecisionsYaml is not a function` (or import resolution error).
-
-- [ ] **Step 3: Implement the helpers.**
-
-Append to `lib/decisions-schema.ts`:
-
-```ts
-import yaml from "js-yaml";
-
-/**
- * Parse a YAML string into a validated DecisionsLog.
- * Throws ZodError with the offending path if validation fails.
- * Throws YAMLException if the YAML itself is unparseable.
- */
-export function parseDecisionsYaml(input: string): DecisionsLog {
-  const raw = yaml.load(input);
-  return DecisionsLogSchema.parse(raw);
-}
-
-/**
- * Serialize a DecisionsLog into a YAML string suitable for writing to
- * ACE/<opp>/runs/<run-id>/decisions.yaml. Uses lineWidth: -1 so long
- * strings (notes paragraphs) don't get auto-folded — block-scalar
- * folding makes the file harder to diff and harder for humans to read.
- */
-export function serializeDecisionsLog(log: DecisionsLog): string {
-  // Validate before emitting — catches caller errors before we write.
-  DecisionsLogSchema.parse(log);
-  return yaml.dump(log, {
-    lineWidth: -1,
-    noRefs: true,
-    sortKeys: false,
-  });
-}
-```
-
-- [ ] **Step 4: Run the test to verify it passes.**
-
-Run: `npx vitest run test/lib/decisions-schema.test.ts`
-Expected: PASS — all assertions green (now 13 total).
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add lib/decisions-schema.ts test/lib/decisions-schema.test.ts
-git commit -m "lib: add YAML read/write helpers for decisions log
-
-parseDecisionsYaml + serializeDecisionsLog round-trip through
-js-yaml with lineWidth: -1 so notes paragraphs stay readable in
-diffs. Validates on both parse and serialize.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 3: Author the calibration fixture
-
-The fixture documents what `idea-to-pdd` should produce for a turmeric-shaped input. It serves as the row-set ground truth for Task 4's skill update and as a snapshot test target.
-
-**Files:**
-- Create: `test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml`
-- Create: `test/skills/idea-to-pdd/decisions-fixture.test.ts`
-
-- [ ] **Step 1: Write the fixture.**
-
-Create `test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml`:
-
-```yaml
-schema_version: 1
-opportunity: turmeric
-run_id: 20260507-1733
-generated_at: "2026-05-07T17:33:00Z"
-
-decisions:
-  - id: archetype-selection
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Which delivery archetype best fits the intervention?
-    default: atomic-visit
-    options_considered: ["atomic-visit", "focus-group", "multi-stage"]
-    source: idea.md §1; one-FLW-one-delivery pattern
-    status: applied
-    notes: Single per-FLW visit producing one structured delivery.
-
-  - id: flw-count
-    phase: 1-design
-    skill: idea-to-pdd
-    question: How many FLWs should the program target?
-    default: "5–8"
-    options_considered: ["3–5", "5–8", "10–15", "20+"]
-    source: idea.md §2; atomic-visit archetype norm at this geographic scope
-    status: applied
-
-  - id: budget-plausibility
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Is the stated budget plausible for the implied labor + AI infra?
-    default: plausible
-    options_considered: ["plausible", "too-low", "too-high"]
-    source: idea-to-pdd-eval `resource_realism` dimension (PR #144)
-    status: applied
-    notes: |
-      $1,500 / 8 FLWs / 30 visits ≈ $6.25/visit gross — covers stated $3 payment
-      with margin for LLO ops + AI infra at recruitment-realistic rates.
-
-  - id: payment-rate
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Per-visit payment rate to FLW?
-    default: "$3.00"
-    options_considered: ["$2.00", "$3.00", "$5.00"]
-    source: idea.md §3; payment-rate convention for atomic-visit market surveys
-    status: applied
-
-  - id: pilot-sample-size
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Pilot sample size for AI calibration before full rollout?
-    default: "30 photos"
-    options_considered: ["20 photos", "30 photos", "50 photos", "100 photos"]
-    source: stress-test verifiability dimension; calibration-set norm
-    status: applied
-
-  - id: ai-photo-threshold
-    phase: 1-design
-    skill: idea-to-pdd
-    question: AI auto-accept confidence threshold for photo verification?
-    default: "≥90%"
-    options_considered: ["≥85%", "≥90%", "≥95%"]
-    source: idea-to-pdd-eval `verifiability` rubric; Layer-B AI-check norm
-    status: applied
-
-  - id: ai-fallback-design
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Fallback for AI auto-reject — true validation harness or parallel sampling?
-    default: parallel-sampling-N-percent
-    options_considered:
-      - "parallel-sampling-N-percent"
-      - "stratified-validation-of-AI-output"
-      - "no-fallback"
-    source: idea-to-pdd-eval `fallback_validates_primary` dimension (PR #144)
-    status: open
-    notes: |
-      Default is parallel sampling (N% human review of all submissions, independent
-      of AI's classification). NOT a true validation harness — it samples a different
-      population than the AI saw, so it doesn't validate per-decision accuracy.
-      Flagged in gate brief; human edit recommended if ground-truth metrics matter.
-
-  - id: named-downstream-consumer
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Is there a named downstream consumer with pre-committed action?
-    default: none-named-proceed-with-caveat
-    options_considered:
-      - "named-consumer-with-MOU"
-      - "named-consumer-no-MOU"
-      - "none-named-proceed-with-caveat"
-      - "none-named-halt"
-    source: idea-to-pdd-eval `demand_reality` dimension (PR #144)
-    status: open
-    notes: |
-      No consumer named in idea.md. Proceeding with default; flag in gate brief.
-      Human edit recommended before Phase 7 solicitation publishes.
-
-  - id: primary-metric-vs-goal
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Primary success metric — direct goal measurement or upstream proxy?
-    default: proxy-of-goal
-    options_considered: ["direct-goal", "proxy-of-goal", "tbd-during-pilot"]
-    source: idea-to-pdd-eval `mission_alignment` dimension (PR #144)
-    status: applied
-    notes: Photo-quality pass rate is a proxy for "AI replaces human verification"; not the goal itself.
-
-  - id: working-language
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Working language(s) for Learn + Deliver apps?
-    default: "English only"
-    options_considered: ["English only", "English + 1 local", "Multilingual (3+)"]
-    source: idea.md does not specify; LLO directory shows English-fluent candidates
-    status: applied
-
-  - id: verification-layers
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Which evidence-model layers are in scope?
-    default: "A + B"
-    options_considered: ["A only", "A + B", "A + B + C"]
-    source: pdd-template.md `## Evidence Model` section
-    status: applied
-    notes: Layer A self-report + Layer B AI photo-verification. Layer C (independent audit) deferred to Phase 8.
-
-  - id: solicitation-type
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Solicitation type for Phase 7 publication?
-    default: EOI
-    options_considered: ["EOI", "RFP", "custom"]
-    source: skills/idea-to-pdd `## Solicitation` section default
-    status: applied
-
-  - id: solicitation-deadline
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Solicitation response deadline (days from publish)?
-    default: "14 days"
-    options_considered: ["7 days", "14 days", "21 days", "30 days"]
-    source: skills/idea-to-pdd `## Solicitation` section default
-    status: applied
-
-  - id: candidate-llo-roster
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Are PDD-named candidate LLOs in scope for direct invitation?
-    default: any-LLO-via-public-solicitation
-    options_considered:
-      - "named-candidates-direct-invite"
-      - "named-candidates-plus-public-solicitation"
-      - "any-LLO-via-public-solicitation"
-    source: idea.md §LLO Preference; LLO directory
-    status: applied
-    notes: idea.md does not name specific candidates; defaulting to public solicitation.
-```
-
-- [ ] **Step 2: Write the failing snapshot test.**
-
-Create `test/skills/idea-to-pdd/decisions-fixture.test.ts`:
-
-```ts
-import { readFileSync } from "node:fs";
-import { resolve } from "node:path";
-import { describe, expect, it } from "vitest";
-import { parseDecisionsYaml } from "../../../lib/decisions-schema.js";
-
-const FIXTURE = resolve(
-  __dirname,
-  "fixtures/turmeric-decisions.yaml",
-);
-
-describe("turmeric calibration fixture", () => {
-  const yaml = readFileSync(FIXTURE, "utf-8");
-  const log = parseDecisionsYaml(yaml);
-
-  it("parses cleanly against DecisionsLogSchema", () => {
-    expect(log.schema_version).toBe(1);
-    expect(log.opportunity).toBe("turmeric");
-  });
-
-  it("contains the 14 calibrated Phase 1 row IDs", () => {
-    const ids = log.decisions.map((d) => d.id).sort();
-    expect(ids).toEqual([
-      "ai-fallback-design",
-      "ai-photo-threshold",
-      "archetype-selection",
-      "budget-plausibility",
-      "candidate-llo-roster",
-      "flw-count",
-      "named-downstream-consumer",
-      "payment-rate",
-      "pilot-sample-size",
-      "primary-metric-vs-goal",
-      "solicitation-deadline",
-      "solicitation-type",
-      "verification-layers",
-      "working-language",
-    ]);
-  });
-
-  it("scopes every row to phase 1-design and skill idea-to-pdd", () => {
-    for (const row of log.decisions) {
-      expect(row.phase).toBe("1-design");
-      expect(row.skill).toBe("idea-to-pdd");
-    }
-  });
-
-  it("flags the two known load-bearing-but-unresolved rows as status: open", () => {
-    const open = log.decisions.filter((d) => d.status === "open").map((d) => d.id);
-    expect(open.sort()).toEqual(["ai-fallback-design", "named-downstream-consumer"]);
-  });
-});
-```
-
-- [ ] **Step 3: Run the test.**
-
-Run: `npx vitest run test/skills/idea-to-pdd/decisions-fixture.test.ts`
-Expected: PASS — fixture validates and matches the 14-row calibration set.
-
-- [ ] **Step 4: Commit.**
-
-```bash
-git add test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml \
-        test/skills/idea-to-pdd/decisions-fixture.test.ts
-git commit -m "test: turmeric calibration fixture for idea-to-pdd decisions
-
-14-row Phase 1 calibration set covering all four viability axes from
-PR #144 (demand_reality, resource_realism, mission_alignment,
-fallback_validates_primary) plus archetype, FLW count, payment rate,
-pilot size, AI threshold, language, evidence layers, solicitation
-defaults, and candidate-LLO roster. Two rows flagged status: open
-(named consumer absent; AI fallback is parallel sampling not validation).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 4: Update `skills/idea-to-pdd/SKILL.md`
-
-This is the load-bearing change of PR #1: add the `## Decisions Log Convention` section, wire the write step into `## Process`, retire the old `## Open Questions Convention`, update outputs, gate brief, mode behavior, and change log.
-
-**Files:**
-- Modify: `skills/idea-to-pdd/SKILL.md`
-
-- [ ] **Step 1: Update the `## Outputs` table.**
-
-Find the existing `## Outputs` block (around lines 22–25):
-
-```markdown
-## Outputs
-
-- `1-design/idea-to-pdd.md` — the PDD
-- `1-design/idea-to-pdd_gate-brief.md` — gate brief consumed at the Phase 1 → 2 review pause
-- `ACE/<opp-name>/open-questions.md` (Google Doc, optional) — when stress-test rubric surfaces unresolved questions
-```
-
-Replace with:
-
-```markdown
-## Outputs
-
-- `1-design/idea-to-pdd.md` — the PDD
-- `1-design/idea-to-pdd_gate-brief.md` — gate brief consumed at the Phase 1 → 2 review pause
-- `ACE/<opp-name>/runs/<run-id>/decisions.yaml` — structured per-run decisions log (always emitted; see `## Decisions Log Convention` below)
-```
-
-- [ ] **Step 2: Update `## Process` to write `decisions.yaml` before drafting the PDD.**
-
-Find the existing step 4 ("Draft the PDD with the **base sections** below…") and step 5 ("Self-evaluate (LLM-as-Judge)…").
-
-Insert a new step 3a immediately after the existing step 3 ("Research and expand"):
-
-```markdown
-3a. **Author the decisions log.** Before drafting the PDD, populate
-    `ACE/<opp-name>/runs/<run-id>/decisions.yaml` with the Phase 1 row
-    set defined in `## Decisions Log Convention` below. Each row records
-    a load-bearing default the skill is about to apply when drafting the
-    PDD. Use the AI's best inference from the source material for each
-    `default` value; mark `status: open` for any default the AI flags
-    for human attention while still proceeding.
-
-    The skill MUST emit a complete decisions.yaml even when source
-    material answers most questions explicitly — every load-bearing row
-    in the calibration set appears, with `status: applied` and the
-    source-material citation.
-```
-
-Update step 4 (PDD draft) to reference the decisions.yaml as input:
-
-Find step 4's opening:
-
-```markdown
-4. **Draft the PDD** with the **base sections** below, plus **archetype-specific additions** from `## Archetypes`:
-```
-
-Replace with:
-
-```markdown
-4. **Draft the PDD** with the **base sections** below, plus **archetype-specific additions** from `## Archetypes`. Use the values selected in step 3a's `decisions.yaml` as authoritative — every numeric or named-entity in the PDD body should match the corresponding row's `default`. If a re-run reads a `decisions.yaml` from a prior run with `status: overridden` rows (human edited via the renderer + sync skills landing in PRs #2–#4), use those overridden values instead.
-```
-
-- [ ] **Step 3: Add the `## Decisions Log Convention` section.**
-
-Find the existing `## Open Questions Convention` section (starts around line 211 with `## Open Questions Convention`).
-
-Replace the entire `## Open Questions Convention` section (from the heading down to the next top-level `## ` heading — should be `## Archetypes`) with:
-
-```markdown
-## Decisions Log Convention
-
-Every Phase 1 run emits `ACE/<opp-name>/runs/<run-id>/decisions.yaml`
-with a calibrated set of load-bearing default-decisions the skill applied
-while drafting the PDD. The log is the per-run audit trail and the
-human-iteration surface — humans edit it (via the renderer + sync skills
-landing in PRs #2–#4) to redirect a subsequent run's PDD draft.
-
-### Bar criterion — what counts as a row
-
-Two filters, both must be true:
-
-1. **Load-bearing.** A reasonable person could pick differently AND it
-   materially shapes downstream phases or eval scores.
-2. **Maps to a known surface.** The default ties to one of: an
-   `*-eval` rubric dimension, an `*-qa` structural check, a Phase
-   Write-Back field that downstream phases read, or a numeric / named
-   entity surfaced in the PDD body.
-
-Form-field-level choices, Connect program slugs, email copy, font sizes
-— below the bar.
-
-### Required Phase 1 row set
-
-Every Phase 1 run MUST emit at least the rows below (calibrated
-2026-05-08; ground truth fixture at
-`test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml`). The
-calibration set aligns with the four viability dimensions from PR #144
-(`demand_reality`, `resource_realism`, `mission_alignment`,
-`fallback_validates_primary`) plus the existing structural / archetype
-dimensions.
-
-| ID | Question | Map to surface |
-|---|---|---|
-| `archetype-selection` | Which delivery archetype best fits? | `archetype_coherence` eval dimension |
-| `flw-count` | How many FLWs? | PDD `FLW Requirements` numeric |
-| `budget-plausibility` | Is the budget plausible for implied labor + AI infra? | `resource_realism` (PR #144) |
-| `payment-rate` | Per-visit payment rate to FLW? | PDD `FLW Requirements` numeric |
-| `pilot-sample-size` | Pilot sample size for AI calibration? | `verifiability` rubric |
-| `ai-photo-threshold` | AI auto-accept confidence threshold? | `verifiability` rubric |
-| `ai-fallback-design` | True validation harness or parallel sampling? | `fallback_validates_primary` (PR #144) |
-| `named-downstream-consumer` | Pre-committed downstream consumer? | `demand_reality` (PR #144) |
-| `primary-metric-vs-goal` | Direct goal vs upstream proxy? | `mission_alignment` (PR #144) |
-| `working-language` | Working language(s)? | PDD `Learn App Specification` named entity |
-| `verification-layers` | Which evidence-model layers in scope? | PDD `Evidence Model` section |
-| `solicitation-type` | Solicitation type (EOI/RFP/custom)? | PDD `Solicitation` section default |
-| `solicitation-deadline` | Solicitation deadline? | PDD `Solicitation` section default |
-| `candidate-llo-roster` | Named candidates or public-only? | `LLO Preference` named entity |
-
-Skill body MAY add extra rows beyond this set when source material
-surfaces additional load-bearing defaults; SHOULD NOT skip any row in
-the required set. If a row is genuinely irrelevant for an opp (rare),
-emit it with `status: applied` and a `notes` line explaining why the
-default is structural rather than a real choice.
-
-### Schema and write semantics
-
-Schema is defined in `lib/decisions-schema.ts` (`DecisionsLogSchema`).
-Required fields per row: `id`, `phase` (always `1-design` for this skill),
-`skill` (always `idea-to-pdd`), `question`, `default`, `options_considered`,
-`source`, `status`. Optional `notes`.
-
-`status` values:
-- `applied` — default in use; the AI's best inference from source material.
-- `overridden` — human edited via renderer + sync skills (PRs #2–#4); not produced directly by this skill.
-- `open` — load-bearing, the AI proceeded with a default but flags for human attention. Surfaces as `[WARN]` in the gate brief's `Auto-Surfaced Concerns`.
-
-Write via `drive_create_file` (find-or-update semantics) at
-`ACE/<opp-name>/runs/<run-id>/decisions.yaml`. The Drive MCP's parent
-folder is the run-folder file ID resolved at run start.
-
-### Status: `open` policy
-
-A row is marked `status: open` when a load-bearing default exists but the
-AI judges it likely-wrong without human confirmation. Examples:
-
-- `named-downstream-consumer` is `none-named-proceed-with-caveat` AND
-  the opp will publish a public solicitation in Phase 7.
-- `ai-fallback-design` is `parallel-sampling-N-percent` AND the program
-  needs ground-truth per-decision accuracy.
-
-The AI proceeds with the default in either mode; review-mode pauses for
-edit, default-mode ships the gate brief with `[WARN]` entries.
-```
-
-- [ ] **Step 4: Update the `## Gate Brief` section.**
-
-Find the existing `## Gate Brief` section's bullet `- **Open Questions Doc:**`:
-
-```markdown
-- **Open Questions Doc:** if the skill produced an Open Questions doc
-  (see `## Open Questions Convention` below), include its full Drive
-  URL on its own line at the top of the gate brief, prefixed
-  `Open Questions: <url>`. If no Open Questions doc was needed, omit
-  this line entirely.
-```
-
-Replace with:
-
-```markdown
-- **Decisions Log:** the skill always emits `decisions.yaml`. Include
-  its full Drive URL on its own line at the top of the gate brief,
-  prefixed `Decisions Log: <url>`. (The renderer skill landing in
-  PR #2 will also produce a human-readable gdoc rendering at one stable
-  URL per run; until that lands, link the YAML directly.)
-```
-
-Find the `Auto-Surfaced Concerns` bullet and append:
-
-```markdown
-- **Open-status decisions:** every row in `decisions.yaml` with
-  `status: open` produces a `[WARN]` entry naming the row's `id` and
-  one-line `notes`. Example: `[WARN] named-downstream-consumer — no
-  consumer named in idea.md; flag for human edit before Phase 7.`
-```
-
-- [ ] **Step 5: Update `## Mode Behavior`.**
-
-Find:
-
-```markdown
-## Mode Behavior
-- **Auto:** Write PDD, email summary to admin group, proceed
-- **Review:** Write PDD, present for human review, wait for approval
-```
-
-Replace with:
-
-```markdown
-## Mode Behavior
-
-- **Default (auto):** Author `decisions.yaml` (step 3a), draft PDD using
-  those defaults, write PDD + gate brief, email summary to admin group,
-  proceed. The decisions.yaml ships with the run; humans review post-hoc
-  and re-run via `/ace:step idea-to-pdd <opp>/<run-id>` after editing if
-  they want a different PDD.
-- **Review:** Author `decisions.yaml` (step 3a), then **pause** before
-  drafting the PDD. Emit an interim gate brief stating "Decisions log
-  written; edit any defaults you want changed, then resume." On resume,
-  re-read `decisions.yaml` and draft the PDD using the (possibly edited)
-  values. Continue to PDD-final gate brief as today.
-```
-
-- [ ] **Step 6: Update the change log.**
-
-Find the `## Change Log` table and append:
-
-```markdown
-| 2026-05-08 | Replace `## Open Questions Convention` with `## Decisions Log Convention`. Skill always emits `decisions.yaml` with the 14-row calibrated Phase 1 set covering archetype, FLW count, budget plausibility, payment rate, pilot size, AI threshold, AI fallback design, named consumer, primary-metric-vs-goal, language, evidence layers, solicitation defaults, candidate roster. Schema defined in `lib/decisions-schema.ts`; ground-truth fixture in `test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml`. Renderer + round-trip ship in PRs #2–#4. | ACE team |
-```
-
-- [ ] **Step 7: Verify the file parses cleanly.**
-
-Run: `cat skills/idea-to-pdd/SKILL.md | head -1`
-Expected: `---` (frontmatter intact).
-
-Run: `grep -c "^## " skills/idea-to-pdd/SKILL.md`
-Expected: a positive integer matching the section count after edits (no orphan headings).
-
-Sanity-grep for stale references:
-
-Run: `grep -n "open-questions\|Open Questions" skills/idea-to-pdd/SKILL.md`
-Expected: zero matches (or only matches inside the change-log line that *describes* the retirement).
-
-- [ ] **Step 8: Commit.**
-
-```bash
-git add skills/idea-to-pdd/SKILL.md
-git commit -m "skill(idea-to-pdd): emit decisions.yaml with 14-row Phase 1 calibration
-
-Replaces the optional open-questions.md (rubric-gated) with always-on
-decisions.yaml at ACE/<opp>/runs/<run-id>/decisions.yaml. Phase 1 row
-set spans archetype, FLW count, budget, payment rate, pilot size, AI
-threshold, AI fallback, named consumer, primary metric, language,
-evidence layers, solicitation defaults, candidate roster.
-
-Aligns to the four viability dimensions added in PR #144 plus the
-existing archetype/structural dimensions. Schema in lib/decisions-schema.ts;
-ground-truth fixture in test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 5: Update the Phase Write-Back Contract in `agents/ace-orchestrator.md`
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md`
-
-- [ ] **Step 1: Locate the contract section.**
-
-Run: `grep -n "Phase Write-Back Contract" agents/ace-orchestrator.md`
-
-Note the line number — it's the start of the section we'll append to.
-
-- [ ] **Step 2: Add the decisions-log clause.**
-
-Open `agents/ace-orchestrator.md` at the line above. The section already
-codifies "Every phase MUST write `phases.<phase>.{status, verdict,
-completed_at, summary_artifact, steps}` to `run_state.yaml` on
-completion …".
-
-After the existing contract paragraph, append this new paragraph:
-
-```markdown
-**Decisions log clause (added 2026-05-08).** Every phase MUST also
-append rows to `ACE/<opp>/runs/<run-id>/decisions.yaml` for any
-load-bearing default the phase applied that meets the bar criterion
-(see [`docs/superpowers/specs/2026-05-08-decisions-log-design.md`](../docs/superpowers/specs/2026-05-08-decisions-log-design.md) §
-Scope). Each phase's primary writing skill owns the rows it writes.
-The orchestrator stub-fills + warns post-phase if a phase wrote zero
-rows AND the calibration set for that phase has any required rows.
-PR #1 covers Phase 1 (`idea-to-pdd`); Phase 2–9 writes ship in PR #3.
-```
-
-- [ ] **Step 3: Sanity-grep.**
-
-Run: `grep -n "decisions.yaml\|decisions log" agents/ace-orchestrator.md`
-Expected: at least 1 match in the contract section.
-
-- [ ] **Step 4: Commit.**
-
-```bash
-git add agents/ace-orchestrator.md
-git commit -m "agents: extend Phase Write-Back Contract with decisions.yaml clause
-
-Every phase MUST append rows to decisions.yaml for load-bearing defaults
-meeting the bar criterion. Stub-fill + warn at orchestrator if a phase
-emitted zero rows when its calibration set requires at least one. PR #1
-covers Phase 1; Phase 2-9 writes ship in PR #3.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 6: Run the full test suite
-
-- [ ] **Step 1: Run `npm test`.**
-
-Run: `npm test`
-Expected: PASS — full vitest suite green. Existing tests unaffected; 17 new assertions added (8 schema + 5 helpers + 4 fixture) bring total to ~655.
-
-- [ ] **Step 2: If anything fails, fix it.**
-
-Common failure modes:
-- Type errors in `lib/decisions-schema.ts` if Zod version mismatches what's in `package.json`. Fix: `grep "\"zod\":" package.json` to find the version, ensure code matches.
-- `js-yaml` not imported. Fix: confirm `import yaml from "js-yaml"` works (it's already a dependency for other helpers).
-- Snapshot test failure in `decisions-fixture.test.ts` if the fixture YAML has a typo. Fix: re-run with `-t` to see the offending row.
-
-Once green, no commit needed (no edits) — proceed.
-
----
-
-### Task 7: Version bump and PR
-
-- [ ] **Step 1: Run the worktree-safe version bump.**
-
-Run: `bash scripts/version-bump.sh`
-Expected: `VERSION`, `package.json`, `.claude-plugin/plugin.json`, `.claude-plugin/marketplace.json` all bumped by `+1` patch from `max(local, origin/main)`. Output prints the new version.
-
-- [ ] **Step 2: Commit the version bump.**
-
-```bash
-git add VERSION package.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-git commit -m "chore: bump version for decisions-log PR #1
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
-- [ ] **Step 3: Push the branch.**
-
-Run: `git push -u origin emdash/questions-70lfu`
-Expected: pushes the branch with all 7 commits (5 feature + 1 contract + 1 version).
-
-- [ ] **Step 4: Open the PR.**
-
-```bash
-gh pr create --title "decisions-log PR #1: schema + Phase 1 write-side" --body "$(cat <<'EOF'
-## Summary
-
-First of four PRs landing the decisions-log architecture from
-[`docs/superpowers/specs/2026-05-08-decisions-log-design.md`](docs/superpowers/specs/2026-05-08-decisions-log-design.md).
-
-## What ships
-
-- **`lib/decisions-schema.ts`** — Zod schemas (`DecisionRowSchema`, `DecisionsLogSchema`), exported types (`DecisionRow`, `DecisionsLog`), YAML helpers (`parseDecisionsYaml`, `serializeDecisionsLog`). Mirrors `lib/qa-types.ts` conventions.
-- **`skills/idea-to-pdd/SKILL.md`** — replaces `## Open Questions Convention` with `## Decisions Log Convention`; skill now always emits `decisions.yaml` with the 14-row calibrated Phase 1 set covering all four viability dimensions from PR #144 (`demand_reality`, `resource_realism`, `mission_alignment`, `fallback_validates_primary`) plus archetype, FLW count, payment rate, pilot size, AI threshold, language, evidence layers, solicitation defaults, candidate roster.
-- **`test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml`** — calibrated ground-truth fixture.
-- **`test/skills/idea-to-pdd/decisions-fixture.test.ts`** — snapshot tests (4 assertions).
-- **`agents/ace-orchestrator.md`** — Phase Write-Back Contract gains a decisions-log clause.
-
-## What does NOT ship
-
-- Renderer (gdoc rendering) — PR #2.
-- Phase 2–9 writes — PR #3.
-- Round-trip sync skill — PR #4.
-
-## Coordination with QA/Eval migration
-
-This PR's schema lib follows `lib/qa-types.ts` conventions. The spec recommends landing this PR after `idea-to-pdd`'s QA/Eval migration (PR #147 Phase 1 PR #1) so the structural QA checks for `decisions.yaml` (file presence, schema validity, per-phase row coverage) can land natively in `idea-to-pdd-qa`. If that migration has shipped, see `docs/superpowers/specs/2026-05-08-decisions-log-design.md § Coordination` and add the QA checks as a follow-up commit on this PR. If not yet shipped, those checks land alongside the migration PR.
-
-## Test plan
-
-- [ ] CI green
-- [ ] `npm test` passes locally (~655 tests)
-- [ ] Manual verification: `/ace:step idea-to-pdd <opp>/<run-id>` against an existing turmeric-shaped fixture produces a `decisions.yaml` with the 14 calibrated row IDs
-- [ ] After merge: `/ace:update` + `/reload-plugins` to pick up the new schema
-
-🤖 Generated with [Claude Code](https://claude.com/claude-code)
-EOF
-)"
-```
-
-- [ ] **Step 5: Note the PR URL.**
-
-The PR command prints the URL. Save it for follow-up: after CI passes, `gh pr merge <number> --merge` (subject to branch protection — `clean-install` status check is required).
-
-After merge, immediately run `/ace:update` + `/reload-plugins` in the active session per CLAUDE.md's "Plugin updates — NEVER locally patch" rule.
-
----
-
-## Self-review pass
-
-**Spec coverage** — every spec section in the table at the top of this plan has a task. The four sub-projects from the spec map to four PRs; this plan covers PR #1 only and explicitly notes PR #2–#4 as out of scope.
-
-**Placeholder scan** — searched for `TBD`, `TODO`, `implement later`, `Add appropriate error handling`, `Similar to Task N`. None present. Every code step has the literal code; every command has the exact invocation; every file path is exact.
-
-**Type consistency** — `DecisionRow` / `DecisionsLog` are the canonical names; used the same way in `lib/decisions-schema.ts`, the test files, and the SKILL.md prose. `parseDecisionsYaml` / `serializeDecisionsLog` are referenced consistently. Status enum values (`applied | overridden | open`) match across the schema definition, the test fixtures, the SKILL.md convention section, and the PR description.
-
-**Spec → plan alignment** — the plan defers the structural QA checks (`idea-to-pdd-qa`) to a conditional follow-up because the QA/Eval migration of `idea-to-pdd` is the gating dependency, exactly as the spec's `## Coordination` section calls out. The plan also defers the eval rubric re-anchor (`deferred-decision-discipline` branch pointing at `decisions.yaml`) to a separate post-v1 PR per the spec's `### Eval rubric impact` non-goal.
-
----
-
-## Execution handoff
-
-Plan complete and saved to `docs/superpowers/plans/2026-05-08-decisions-log-pr1.md`. Two execution options:
-
-**1. Subagent-Driven (recommended)** — I dispatch a fresh subagent per task, review between tasks, fast iteration.
-
-**2. Inline Execution** — Execute tasks in this session using executing-plans, batch execution with checkpoints.
-
-Which approach?
diff --git a/docs/superpowers/plans/2026-05-08-decisions-log-pr2.md b/docs/superpowers/plans/2026-05-08-decisions-log-pr2.md
deleted file mode 100644
index ac3086e2..00000000
--- a/docs/superpowers/plans/2026-05-08-decisions-log-pr2.md
+++ /dev/null
@@ -1,1059 +0,0 @@
-# Decisions Log — PR #2: Retrofit + Renderer — Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development to implement this plan task-by-task.
-
-**Goal:** (1) Retire the hardcoded 14-row "Required Phase 1 row set" — replace with bar-criterion-driven inclusion plus a small "anchor decisions" list (~5 rows) tied to specific eval rubric dimensions. (2) Build a prose Google Doc renderer (`decisions-render` skill) that produces a human-readable rendering of `decisions.yaml` at one stable URL per run, regenerated at end of every phase.
-
-**Architecture:**
-- **Retrofit** is purely doc + test surface: the bar criterion already drives inclusion in principle; PR #1 hardcoded a 14-row required set as scaffolding. Retrofit removes the hardcoding so the bar criterion is the only filter, and shrinks the required set to anchors that map directly to eval rubric dimensions.
-- **Renderer** is a deterministic pipeline: `lib/decisions-renderer.ts` is a pure function (`DecisionsLog → BatchUpdateRequest[]`) that emits Google Docs API requests; `scripts/decisions-render.ts` is the runner that reads the YAML and applies the requests via Drive MCP. The skill body is a thin wrapper that invokes the script. Same pattern as `idea-to-pdd-qa`'s `checks.ts` + `scripts/qa-run.ts`.
-
-**Tech Stack:** TypeScript ESM, vitest, the existing `ace-gdrive` MCP (`docs_batch_update`, `drive_create_file`, `drive_read_file`), `lib/decisions-schema.ts` (already shipped). No new dependencies.
-
-**Spec deviations from `docs/superpowers/specs/2026-05-08-decisions-log-design.md`:**
-- Sub-project ordering: spec listed renderer as PR #3; this PR ships it as PR #2 because human visibility unblocks Phase 1 iteration. Phase 2–9 writes (originally PR #2 in the spec) move to a later PR.
-- Schema scope shrinks: spec implied a per-phase "required row set" (10–15 rows for Phase 1). Retrofit drops this in favor of bar-criterion-only inclusion + a small anchor set (~5 rows) tied to specific eval rubric dimensions. Rationale: hardcoding 14 rows × 9 phases ≈ 80+ enumerated questions across the codebase is heavy maintenance and constrains LLM judgment. The bar criterion (load-bearing AND maps to known surface) is the right filter; per-phase QA collapses to schema validation + run-level file presence.
-
----
-
-## Spec coverage map
-
-| Concern | Covered by |
-|---|---|
-| Retrofit Phase 1: drop required-row-set | Tasks 1–3 |
-| Renderer pure function | Task 4 |
-| Renderer runner script | Task 5 |
-| Renderer skill body | Task 6 |
-| Wire renderer into Phase 1 + gate brief | Task 7 |
-| Wire renderer into orchestrator (post-phase hook) | Task 8 |
-| Full test suite green | Task 9 |
-| Version bump + push + PR | Task 10 |
-
----
-
-## File structure
-
-**Create:**
-- `lib/decisions-renderer.ts` — pure function `renderDecisionsLog(log: DecisionsLog): docs_v1.Schema$Request[]` plus small helpers (heading paragraph, body paragraph, bullet list, italic notes block).
-- `test/lib/decisions-renderer.test.ts` — unit tests for the renderer (snapshot test on a small input, edge cases like empty decisions, single decision, multi-phase).
-- `scripts/decisions-render.ts` — CLI runner. Reads `decisions.yaml`, calls `renderDecisionsLog`, finds-or-creates the gdoc, clears existing body, applies the requests via `docs_batch_update`. Idempotent; one stable URL per run.
-- `skills/decisions-render/SKILL.md` — skill body wrapping the script.
-- `test/skills/decisions-render/script.test.ts` — integration test that drives the script end-to-end against a fake Drive client (no live MCP).
-
-**Modify:**
-- `skills/idea-to-pdd/SKILL.md` — replace `### Required Phase 1 row set` with `### Anchor decisions` (~5 rows) + `### Recommended additional rows (illustrative)` (~9 rows, non-binding). Update process step 3a's instructions to reference bar criterion as the filter; anchor list as a hint, not a checklist. Retain the existing schema-and-write-semantics + status-open-policy sub-sections. Update process step (post-PDD-write) to invoke `decisions-render`. Update gate brief to link the gdoc URL (not the YAML).
-- `test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml` — keep the 14 rows as an illustrative sample, no semantic changes.
-- `test/skills/idea-to-pdd/decisions-fixture.test.ts` — drop the "exactly these 14 IDs" assertion. Replace with: parses cleanly, every row has phase `1-design` and skill `idea-to-pdd`, every `status: open` row has a populated `notes` field, contains the 5 anchor rows.
-- `agents/ace-orchestrator.md` — extend Phase Write-Back Verifier (existing § Phase Write-Back Verifier) with a step that invokes `decisions-render` after every phase completes successfully. One-paragraph addition.
-- `lib/artifact-manifest.ts` — register the new `decisions-rendering` artifact (gdoc) under the run's per-phase artifacts. (The YAML itself was registered in PR #1.)
-- `VERSION`, `package.json`, `.claude-plugin/plugin.json`, `.claude-plugin/marketplace.json`, `package-lock.json` — version bumped via `scripts/version-bump.sh` at the end.
-
-**No deletes.** PR #1's `lib/decisions-schema.ts` and `test/skills/idea-to-pdd/fixtures/turmeric-decisions.yaml` stay as-is.
-
----
-
-## Tasks
-
-### Task 1: Retrofit `skills/idea-to-pdd/SKILL.md` — replace required-row-set with anchor list
-
-**Files:**
-- Modify: `skills/idea-to-pdd/SKILL.md`
-
-The current `## Decisions Log Convention` section has a `### Required Phase 1 row set` sub-section with a 14-row table that the skill MUST emit. Retrofit:
-
-- [ ] **Step 1: Replace `### Required Phase 1 row set` with `### Anchor decisions (rows the eval rubric depends on)`.**
-
-Find the existing sub-section (the 14-row table starting "Every Phase 1 run MUST emit at least the rows below…"). Replace the entire sub-section (including the table) with:
-
-```markdown
-### Anchor decisions (rows the eval rubric depends on)
-
-A small set of decisions are load-bearing for specific eval rubric dimensions
-— their absence means the rubric grades a missing input and the verdict is
-unreliable. The skill SHOULD emit these rows whenever they apply to the opp:
-
-| ID | Question | Eval rubric anchor |
-|---|---|---|
-| `archetype-selection` | Which delivery archetype best fits? | `archetype_coherence` |
-| `budget-plausibility` | Is the budget plausible for implied labor + AI infra? | `resource_realism` (PR #144) |
-| `named-downstream-consumer` | Pre-committed downstream consumer? | `demand_reality` (PR #144) |
-| `primary-metric-vs-goal` | Direct goal vs upstream proxy? | `mission_alignment` (PR #144) |
-| `ai-fallback-design` | True validation harness or parallel sampling? | `fallback_validates_primary` (PR #144) |
-
-If an anchor is genuinely irrelevant for the opp (rare — usually applies
-only when the question is structurally inapplicable), emit it with
-`status: applied` and a `notes` line explaining why the default is
-structural rather than a real choice. Do not silently omit.
-
-### Recommended additional rows (illustrative, non-binding)
-
-These rows often qualify under the bar criterion. They are examples of
-what the criterion typically catches, not requirements. Skip when not
-applicable; add others not listed when they meet the bar.
-
-| ID | Question | Map to surface |
-|---|---|---|
-| `flw-count` | How many FLWs? | PDD `FLW Requirements` numeric |
-| `payment-rate` | Per-visit payment rate to FLW? | PDD `FLW Requirements` numeric |
-| `pilot-sample-size` | Pilot sample size for AI calibration? | `verifiability` rubric |
-| `ai-photo-threshold` | AI auto-accept confidence threshold? | `verifiability` rubric |
-| `working-language` | Working language(s)? | PDD `Learn App Specification` |
-| `verification-layers` | Which evidence-model layers in scope? | PDD `Evidence Model` section |
-| `solicitation-type` | Solicitation type (EOI/RFP/custom)? | PDD `Solicitation` section |
-| `solicitation-deadline` | Solicitation deadline? | PDD `Solicitation` section |
-| `candidate-llo-roster` | Named candidates or public-only? | `LLO Preference` named entity |
-
-The bar criterion alone determines what rows belong in the log. The
-anchor list above is the only required surface; everything else is the
-LLM's judgment per the criterion.
-```
-
-- [ ] **Step 2: Update process step 3a to reference the anchor list, not a required set.**
-
-Find the existing step 3a (`**Author the decisions log.** Before drafting the PDD …`). Replace its second paragraph (currently "The skill MUST emit a complete decisions.yaml even when …") with:
-
-```
-    The skill MUST emit every anchor row from
-    `## Decisions Log Convention § Anchor decisions` whenever the anchor
-    applies to the opp (handle inapplicable cases by emitting the row with
-    `status: applied` and a notes-line explanation). Beyond the anchor set,
-    the skill emits whatever additional rows meet the bar criterion. The
-    bar is the filter; the recommended-additional list is illustrative.
-```
-
-- [ ] **Step 3: Update process step list to invoke `decisions-render` after writing the YAML.**
-
-Find the process steps. After step 7 (the final step: writing the gate brief), add a new final step:
-
-```
-8. **Render the decisions log to a human-readable Google Doc** by
-   invoking the `decisions-render` skill against the run-id. The
-   renderer produces `ACE/<opp-name>/runs/<run-id>/decisions.gdoc`
-   at one stable URL; humans review and iterate on this doc, not the
-   YAML. The orchestrator also invokes the renderer at end of every
-   subsequent phase, so the gdoc stays current as later phases append
-   rows.
-```
-
-- [ ] **Step 4: Update gate brief — link the gdoc, not the YAML.**
-
-Find the existing `- **Decisions Log:**` bullet in `## Gate Brief`. Replace it with:
-
-```markdown
-- **Decisions Log:** the skill always emits `decisions.yaml` and invokes
-  `decisions-render` to produce a prose Google Doc rendering at one
-  stable URL. Include the gdoc URL on its own line at the top of the
-  gate brief, prefixed `Decisions Log: <gdoc-url>`. The YAML lives at
-  `ACE/<opp-name>/runs/<run-id>/decisions.yaml`; the gdoc is its
-  human-friendly rendering and is regenerated after every phase.
-```
-
-- [ ] **Step 5: Update the `## Change Log` table.**
-
-Append:
-
-```markdown
-| 2026-05-08 | Retrofit: replace `### Required Phase 1 row set` (14 hardcoded rows) with `### Anchor decisions` (5 rows tied to specific eval rubric dimensions) + `### Recommended additional rows` (illustrative, non-binding). Bar criterion is the sole filter; anchors are the only required surface. Process step adds renderer invocation; gate brief links the gdoc rendering instead of the YAML. | ACE team (decisions-log PR #2) |
-```
-
-- [ ] **Step 6: Verify and commit.**
-
-Run: `grep -n "Required Phase 1 row set" skills/idea-to-pdd/SKILL.md`
-Expected: zero matches (the heading is gone).
-
-Run: `grep -n "Anchor decisions" skills/idea-to-pdd/SKILL.md`
-Expected: at least 2 matches (heading + reference from process step).
-
-Commit:
-
-```bash
-git add skills/idea-to-pdd/SKILL.md
-git commit -m "skill(idea-to-pdd): retire hardcoded 14-row required set; bar criterion is the sole filter
-
-Replaces \`### Required Phase 1 row set\` with two sub-sections:
-- \`### Anchor decisions\` (5 rows tied to specific eval rubric dimensions —
-  archetype-selection, budget-plausibility, named-downstream-consumer,
-  primary-metric-vs-goal, ai-fallback-design)
-- \`### Recommended additional rows\` (9 illustrative rows, non-binding)
-
-The bar criterion (load-bearing + maps to known surface) is the only
-filter. Anchors are the only required surface — their absence means an
-eval rubric dimension grades a missing input.
-
-Process step 8 invokes \`decisions-render\` to produce a prose Google Doc
-rendering at one stable URL. Gate brief links the gdoc instead of the YAML.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 2: Update `test/skills/idea-to-pdd/decisions-fixture.test.ts` to match the retrofit
-
-**Files:**
-- Modify: `test/skills/idea-to-pdd/decisions-fixture.test.ts`
-
-PR #1's snapshot test asserts "contains exactly these 14 row IDs." Retrofit assertions to: schema valid; phase + skill correct; anchor rows present; status: open rows have notes.
-
-- [ ] **Step 1: Replace the test contents.**
-
-Replace the entire file body (keep the imports the same) with:
-
-```ts
-import { readFileSync } from "node:fs";
-import { dirname, resolve } from "node:path";
-import { fileURLToPath } from "node:url";
-import { describe, expect, it } from "vitest";
-import { parseDecisionsYaml } from "../../../lib/decisions-schema.js";
-
-const __dirname = dirname(fileURLToPath(import.meta.url));
-const FIXTURE = resolve(__dirname, "fixtures/turmeric-decisions.yaml");
-
-const ANCHOR_IDS = [
-  "ai-fallback-design",
-  "archetype-selection",
-  "budget-plausibility",
-  "named-downstream-consumer",
-  "primary-metric-vs-goal",
-];
-
-describe("turmeric calibration fixture", () => {
-  const yaml = readFileSync(FIXTURE, "utf-8");
-  const log = parseDecisionsYaml(yaml);
-
-  it("parses cleanly against DecisionsLogSchema", () => {
-    expect(log.schema_version).toBe(1);
-    expect(log.opportunity).toBe("turmeric");
-  });
-
-  it("scopes every row to phase 1-design and skill idea-to-pdd", () => {
-    for (const row of log.decisions) {
-      expect(row.phase).toBe("1-design");
-      expect(row.skill).toBe("idea-to-pdd");
-    }
-  });
-
-  it("contains every anchor row from the Phase 1 anchor list", () => {
-    const ids = new Set(log.decisions.map((d) => d.id));
-    for (const anchor of ANCHOR_IDS) {
-      expect(ids.has(anchor)).toBe(true);
-    }
-  });
-
-  it("ensures every status: open row has populated notes", () => {
-    const openRowsWithoutNotes = log.decisions
-      .filter((d) => d.status === "open")
-      .filter((d) => !d.notes || d.notes.trim().length === 0)
-      .map((d) => d.id);
-    expect(openRowsWithoutNotes).toEqual([]);
-  });
-});
-```
-
-- [ ] **Step 2: Run the test.**
-
-Run: `npx vitest run test/skills/idea-to-pdd/decisions-fixture.test.ts`
-Expected: PASS — 4 assertions green. The fixture from PR #1 has all 5 anchor IDs and the 2 open rows have notes, so this should pass without any fixture edits.
-
-- [ ] **Step 3: Commit.**
-
-```bash
-git add test/skills/idea-to-pdd/decisions-fixture.test.ts
-git commit -m "test(idea-to-pdd): retrofit fixture test to anchor + invariants model
-
-Drops the exact-14-IDs assertion. Now asserts:
-- Schema validity
-- Every row scoped to phase 1-design + skill idea-to-pdd
-- Every anchor row from the Phase 1 anchor list is present
-- Every status: open row has populated notes
-
-Aligns with the bar-criterion-only architecture.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 3: Build the renderer pure function — `lib/decisions-renderer.ts`
-
-**Files:**
-- Create: `lib/decisions-renderer.ts`
-- Create: `test/lib/decisions-renderer.test.ts`
-
-The renderer is a pure function: `(log: DecisionsLog) => Request[]` where `Request` matches the `docs.documents.batchUpdate` `requests[]` schema. The output is the list of API requests that, when applied to a fresh empty Google Doc, produces the prose rendering.
-
-The rendering has this layout:
-
-```
-Decisions Log — <opportunity> / run <run_id>           [HEADING_1]
-
-Generated <generated_at>. To override a default, edit the           [body, italic]
-"Default:" line of the relevant decision below. To propose a new
-option, add a bullet to "Considered:". Then run
-/ace:step decisions-sync <opp>/<run-id> to push your edits back.
-
-──────────────────────────────────────────                          [body horizontal divider]
-
-Phase 1 — Design                                       [HEADING_2]
-
-archetype-selection                                    [HEADING_3]
-Which delivery archetype best fits the intervention?   [body, bold]
-
-  Default: atomic-visit                                [body, "Default:" bold]
-  Considered:                                          [body, bold prefix]
-    • atomic-visit                                     [bullet]
-    • focus-group                                      [bullet]
-    • multi-stage                                      [bullet]
-  Source: idea.md §1; one-FLW-one-delivery pattern     [body, "Source:" bold]
-  Status: applied                                      [body, "Status:" bold]
-
-  Single per-FLW visit producing one structured delivery.   [body, italic, indented]
-
-flw-count                                              [HEADING_3]
-...
-```
-
-Status `open` rows get extra emphasis: `Status: OPEN — load-bearing; human edit recommended` rendered with red foreground or bold-red.
-
-The renderer's job: produce a sequence of requests that build this layout. Strategy: produce **only `insertText` requests in document-order**, then `updateParagraphStyle` and `updateTextStyle` requests at the end (after all text is in place — by then, indices are stable). Index management is local (each insert appends at the current end-of-doc offset).
-
-- [ ] **Step 1: Write the failing test.**
-
-Create `test/lib/decisions-renderer.test.ts`:
-
-```ts
-import { describe, expect, it } from "vitest";
-import type { DecisionsLog } from "../../lib/decisions-schema.js";
-import { renderDecisionsLog } from "../../lib/decisions-renderer.js";
-
-const MINIMAL_LOG: DecisionsLog = {
-  schema_version: 1,
-  opportunity: "turmeric",
-  run_id: "20260507-1733",
-  generated_at: "2026-05-07T17:33:00Z",
-  decisions: [
-    {
-      id: "archetype-selection",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Which delivery archetype best fits the intervention?",
-      default: "atomic-visit",
-      options_considered: ["atomic-visit", "focus-group", "multi-stage"],
-      source: "idea.md §1",
-      status: "applied",
-      notes: "Single per-FLW visit producing one structured delivery.",
-    },
-  ],
-};
-
-describe("renderDecisionsLog", () => {
-  it("returns a non-empty array of Docs API requests", () => {
-    const requests = renderDecisionsLog(MINIMAL_LOG);
-    expect(Array.isArray(requests)).toBe(true);
-    expect(requests.length).toBeGreaterThan(0);
-  });
-
-  it("includes an insertText request for the title", () => {
-    const requests = renderDecisionsLog(MINIMAL_LOG);
-    const titleInsert = requests.find(
-      (r) =>
-        "insertText" in r &&
-        r.insertText?.text?.includes("Decisions Log — turmeric / run 20260507-1733"),
-    );
-    expect(titleInsert).toBeDefined();
-  });
-
-  it("includes a HEADING_1 paragraph style update covering the title", () => {
-    const requests = renderDecisionsLog(MINIMAL_LOG);
-    const h1 = requests.find(
-      (r) =>
-        "updateParagraphStyle" in r &&
-        r.updateParagraphStyle?.paragraphStyle?.namedStyleType === "HEADING_1",
-    );
-    expect(h1).toBeDefined();
-  });
-
-  it("includes a HEADING_2 paragraph style update for each phase header", () => {
-    const requests = renderDecisionsLog(MINIMAL_LOG);
-    const h2 = requests.filter(
-      (r) =>
-        "updateParagraphStyle" in r &&
-        r.updateParagraphStyle?.paragraphStyle?.namedStyleType === "HEADING_2",
-    );
-    // Single phase = single H2 ("Phase 1 — Design")
-    expect(h2).toHaveLength(1);
-  });
-
-  it("includes a HEADING_3 paragraph style update for each decision id", () => {
-    const requests = renderDecisionsLog(MINIMAL_LOG);
-    const h3 = requests.filter(
-      (r) =>
-        "updateParagraphStyle" in r &&
-        r.updateParagraphStyle?.paragraphStyle?.namedStyleType === "HEADING_3",
-    );
-    expect(h3).toHaveLength(1);
-  });
-
-  it("creates bullet list for the options_considered items", () => {
-    const requests = renderDecisionsLog(MINIMAL_LOG);
-    const bullets = requests.find(
-      (r) => "createParagraphBullets" in r,
-    );
-    expect(bullets).toBeDefined();
-  });
-
-  it("groups decisions by phase with a HEADING_2 per phase", () => {
-    const multiPhaseLog: DecisionsLog = {
-      ...MINIMAL_LOG,
-      decisions: [
-        { ...MINIMAL_LOG.decisions[0]!, id: "row-a", phase: "1-design" },
-        { ...MINIMAL_LOG.decisions[0]!, id: "row-b", phase: "1-design" },
-        { ...MINIMAL_LOG.decisions[0]!, id: "row-c", phase: "2-commcare" },
-      ],
-    };
-    const requests = renderDecisionsLog(multiPhaseLog);
-    const h2 = requests.filter(
-      (r) =>
-        "updateParagraphStyle" in r &&
-        r.updateParagraphStyle?.paragraphStyle?.namedStyleType === "HEADING_2",
-    );
-    expect(h2).toHaveLength(2); // 1-design + 2-commcare
-  });
-
-  it("returns no requests for an empty decisions array (only the title block)", () => {
-    const empty: DecisionsLog = { ...MINIMAL_LOG, decisions: [] };
-    const requests = renderDecisionsLog(empty);
-    // Should still emit title + intro; no decision sections.
-    expect(requests.length).toBeGreaterThan(0);
-    const h2 = requests.filter(
-      (r) =>
-        "updateParagraphStyle" in r &&
-        r.updateParagraphStyle?.paragraphStyle?.namedStyleType === "HEADING_2",
-    );
-    expect(h2).toHaveLength(0); // no phases
-    const h3 = requests.filter(
-      (r) =>
-        "updateParagraphStyle" in r &&
-        r.updateParagraphStyle?.paragraphStyle?.namedStyleType === "HEADING_3",
-    );
-    expect(h3).toHaveLength(0); // no decisions
-  });
-
-  it("emphasizes status: open rows distinctly from status: applied", () => {
-    const openLog: DecisionsLog = {
-      ...MINIMAL_LOG,
-      decisions: [
-        {
-          ...MINIMAL_LOG.decisions[0]!,
-          id: "named-downstream-consumer",
-          status: "open",
-          notes: "No consumer named.",
-        },
-      ],
-    };
-    const requests = renderDecisionsLog(openLog);
-    // The "Status: OPEN" text should appear (not just "Status: open")
-    const statusText = requests.find(
-      (r) =>
-        "insertText" in r &&
-        (r.insertText?.text?.includes("Status: OPEN") ||
-          r.insertText?.text?.includes("OPEN — load-bearing")),
-    );
-    expect(statusText).toBeDefined();
-  });
-});
-```
-
-- [ ] **Step 2: Run the test to verify it fails.**
-
-Run: `npx vitest run test/lib/decisions-renderer.test.ts`
-Expected: FAIL with `Cannot find module '../../lib/decisions-renderer.js'`.
-
-- [ ] **Step 3: Implement the renderer.**
-
-Create `lib/decisions-renderer.ts`. The implementer chooses the exact structure; the constraints from the tests are:
-- Pure function `renderDecisionsLog(log: DecisionsLog): docs_v1.Schema$Request[]` (or compatible array shape — the test only checks for `insertText`, `updateParagraphStyle`, `createParagraphBullets` on each request).
-- Title: `Decisions Log — <opportunity> / run <run_id>` styled HEADING_1.
-- Per-phase HEADING_2 ("Phase 1 — Design", "Phase 2 — CommCare", etc. — derive the human label from the `<N>-<name>` slug; capitalize the name segment).
-- Per-decision HEADING_3 (the row's `id`).
-- Each decision section: question (bold body), `Default: <value>`, `Considered: <bullet list>`, `Source: <text>`, `Status: applied|overridden|OPEN — load-bearing; human edit recommended`, optional italic notes paragraph.
-- Status `open` rendered as `Status: OPEN — load-bearing; human edit recommended` to differentiate from `applied`/`overridden`.
-- Empty `decisions: []` produces only the title block; no phase or decision sections.
-
-Implementer should reference the `googleapis` package's `docs_v1.Schema$Request` type. If that import isn't already in the repo, define a minimal local `BatchUpdateRequest` type union (insertText | updateParagraphStyle | createParagraphBullets | updateTextStyle) and export it.
-
-Use a `RequestBuilder` class or stateful helper that tracks the current end-of-doc offset and emits requests in order. After all `insertText` requests are emitted, append style requests at the end (their indices are stable because all text is already in).
-
-- [ ] **Step 4: Run the test to verify it passes.**
-
-Run: `npx vitest run test/lib/decisions-renderer.test.ts`
-Expected: PASS — 9 assertions green.
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add lib/decisions-renderer.ts test/lib/decisions-renderer.test.ts
-git commit -m "lib: add pure-function decisions-log renderer
-
-renderDecisionsLog(log) returns a Docs API batch-update request list
-that builds a prose Google Doc from a DecisionsLog. HEADING_1 title,
-HEADING_2 per phase, HEADING_3 per decision id, bold field labels,
-bullet list for options_considered, italic notes block, distinct
-emphasis for status: open rows.
-
-Pure function — no Drive calls. Caller (scripts/decisions-render.ts)
-applies the requests via docs_batch_update.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 4: Build the runner script — `scripts/decisions-render.ts`
-
-**Files:**
-- Create: `scripts/decisions-render.ts`
-- Create: `test/skills/decisions-render/script.test.ts` — integration test against a fake Drive client.
-
-The runner is the glue: read the YAML, call the renderer, find-or-create the gdoc, clear existing content, apply the new requests.
-
-CLI shape:
-
-```
-npx tsx scripts/decisions-render.ts <opp-name>/<run-id>
-# or
-npx tsx scripts/decisions-render.ts <run-folder-fileId>
-```
-
-- [ ] **Step 1: Write the failing test.**
-
-Create `test/skills/decisions-render/script.test.ts`:
-
-```ts
-import { describe, expect, it, vi } from "vitest";
-import type { DecisionsLog } from "../../../lib/decisions-schema.js";
-import { runDecisionsRender } from "../../../scripts/decisions-render.js";
-
-const SAMPLE_LOG: DecisionsLog = {
-  schema_version: 1,
-  opportunity: "turmeric",
-  run_id: "20260507-1733",
-  generated_at: "2026-05-07T17:33:00Z",
-  decisions: [
-    {
-      id: "archetype-selection",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Which delivery archetype?",
-      default: "atomic-visit",
-      options_considered: ["atomic-visit", "focus-group", "multi-stage"],
-      source: "idea.md §1",
-      status: "applied",
-    },
-  ],
-};
-
-function makeFakeDriveClient() {
-  return {
-    readFile: vi.fn().mockResolvedValue({ content: "" }),
-    findOrCreateDoc: vi.fn().mockResolvedValue({ id: "fake-gdoc-id", reused: false }),
-    batchUpdateDoc: vi.fn().mockResolvedValue({ replies: [] }),
-    clearDocBody: vi.fn().mockResolvedValue(undefined),
-  };
-}
-
-describe("runDecisionsRender", () => {
-  it("reads the YAML, finds-or-creates the gdoc, clears it, and applies the rendered requests", async () => {
-    const client = makeFakeDriveClient();
-    client.readFile.mockResolvedValueOnce({
-      content: `schema_version: 1
-opportunity: turmeric
-run_id: 20260507-1733
-generated_at: "2026-05-07T17:33:00Z"
-decisions:
-  - id: archetype-selection
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Which delivery archetype?
-    default: atomic-visit
-    options_considered: ["atomic-visit", "focus-group", "multi-stage"]
-    source: idea.md §1
-    status: applied
-`,
-    });
-
-    const result = await runDecisionsRender({
-      runFolderFileId: "fake-folder-id",
-      driveClient: client,
-    });
-
-    expect(client.readFile).toHaveBeenCalled();
-    expect(client.findOrCreateDoc).toHaveBeenCalledWith({
-      parentFolderId: "fake-folder-id",
-      name: "decisions.gdoc",
-    });
-    expect(client.clearDocBody).toHaveBeenCalledWith("fake-gdoc-id");
-    expect(client.batchUpdateDoc).toHaveBeenCalled();
-    const callArgs = client.batchUpdateDoc.mock.calls[0]![0];
-    expect(callArgs.documentId).toBe("fake-gdoc-id");
-    expect(callArgs.requests.length).toBeGreaterThan(0);
-    expect(result).toMatchObject({ gdocId: "fake-gdoc-id" });
-  });
-
-  it("throws an actionable error when decisions.yaml is missing from the run folder", async () => {
-    const client = makeFakeDriveClient();
-    client.readFile.mockRejectedValueOnce(new Error("File not found"));
-
-    await expect(
-      runDecisionsRender({ runFolderFileId: "fake-folder-id", driveClient: client }),
-    ).rejects.toThrow(/decisions\.yaml/);
-  });
-
-  it("throws on schema-invalid YAML with the schema dot-path", async () => {
-    const client = makeFakeDriveClient();
-    client.readFile.mockResolvedValueOnce({
-      content: `schema_version: 1
-opportunity: turmeric
-run_id: 20260507-1733
-generated_at: "2026-05-07T17:33:00Z"
-decisions:
-  - id: ""
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Q
-    default: x
-    options_considered: []
-    source: x
-    status: applied
-`,
-    });
-
-    await expect(
-      runDecisionsRender({ runFolderFileId: "fake-folder-id", driveClient: client }),
-    ).rejects.toThrow(/decisions\.0\.id/);
-  });
-});
-```
-
-- [ ] **Step 2: Run the test to verify it fails.**
-
-Run: `npx vitest run test/skills/decisions-render/script.test.ts`
-Expected: FAIL with `Cannot find module '../../../scripts/decisions-render.js'` or `runDecisionsRender is not a function`.
-
-- [ ] **Step 3: Implement the script.**
-
-Create `scripts/decisions-render.ts`:
-
-```ts
-#!/usr/bin/env npx tsx
-import { parseDecisionsYaml } from "../lib/decisions-schema.js";
-import { renderDecisionsLog } from "../lib/decisions-renderer.js";
-
-/**
- * Drive client interface — subset of ace-gdrive operations needed by
- * the renderer runner. Production callers pass a wrapper around the
- * MCP atoms; tests pass a mock.
- */
-export interface DecisionsRenderDriveClient {
-  readFile(args: { parentFolderId: string; name: string }): Promise<{ content: string }>;
-  findOrCreateDoc(args: { parentFolderId: string; name: string }): Promise<{ id: string; reused: boolean }>;
-  clearDocBody(docId: string): Promise<void>;
-  batchUpdateDoc(args: { documentId: string; requests: unknown[] }): Promise<{ replies: unknown[] }>;
-}
-
-export interface RunDecisionsRenderArgs {
-  runFolderFileId: string;
-  driveClient: DecisionsRenderDriveClient;
-}
-
-export interface RunDecisionsRenderResult {
-  gdocId: string;
-  reused: boolean;
-  requestCount: number;
-}
-
-/**
- * Read decisions.yaml from a run folder, render it, and apply the
- * rendered requests to the per-run decisions.gdoc. Idempotent: the
- * gdoc lives at one stable URL per run; existing content is cleared
- * before the new render is applied.
- */
-export async function runDecisionsRender(
-  args: RunDecisionsRenderArgs,
-): Promise<RunDecisionsRenderResult> {
-  const { runFolderFileId, driveClient } = args;
-
-  let yamlContent: string;
-  try {
-    const file = await driveClient.readFile({
-      parentFolderId: runFolderFileId,
-      name: "decisions.yaml",
-    });
-    yamlContent = file.content;
-  } catch (err) {
-    const msg = err instanceof Error ? err.message : String(err);
-    throw new Error(
-      `decisions.yaml not found in run folder ${runFolderFileId}: ${msg}`,
-    );
-  }
-
-  const log = parseDecisionsYaml(yamlContent);
-  const requests = renderDecisionsLog(log);
-
-  const gdoc = await driveClient.findOrCreateDoc({
-    parentFolderId: runFolderFileId,
-    name: "decisions.gdoc",
-  });
-
-  await driveClient.clearDocBody(gdoc.id);
-  await driveClient.batchUpdateDoc({
-    documentId: gdoc.id,
-    requests,
-  });
-
-  return {
-    gdocId: gdoc.id,
-    reused: gdoc.reused,
-    requestCount: requests.length,
-  };
-}
-
-// CLI entry point — only when invoked directly as a script.
-if (import.meta.url === `file://${process.argv[1]}`) {
-  const arg = process.argv[2];
-  if (!arg) {
-    console.error("Usage: npx tsx scripts/decisions-render.ts <run-folder-fileId>");
-    process.exit(1);
-  }
-  // CLI mode requires a real Drive client; defer to the skill body which
-  // wires the ace-gdrive MCP atoms into the DecisionsRenderDriveClient
-  // interface. This block only runs for manual ad-hoc invocations from
-  // a developer shell with the MCP available.
-  console.error(
-    "Direct CLI mode not yet wired — invoke via /ace:step decisions-render <opp>/<run-id> instead.",
-  );
-  process.exit(2);
-}
-```
-
-- [ ] **Step 4: Run the test to verify it passes.**
-
-Run: `npx vitest run test/skills/decisions-render/script.test.ts`
-Expected: PASS — 3 assertions green.
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add scripts/decisions-render.ts test/skills/decisions-render/script.test.ts
-git commit -m "scripts: add decisions-render runner
-
-Reads decisions.yaml from a run folder, calls renderDecisionsLog,
-finds-or-creates decisions.gdoc, clears existing content, applies the
-rendered requests via docs_batch_update. Idempotent — one stable URL
-per run, regenerated on every invocation.
-
-DriveClient interface decouples the runner from the ace-gdrive MCP for
-testability (production wires MCP atoms; tests use a mock).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 5: Create `skills/decisions-render/SKILL.md`
-
-**Files:**
-- Create: `skills/decisions-render/SKILL.md`
-
-The skill body is a thin wrapper that wires the MCP atoms into the `DecisionsRenderDriveClient` interface and invokes `runDecisionsRender`.
-
-- [ ] **Step 1: Write the SKILL.md.**
-
-Create `skills/decisions-render/SKILL.md`:
-
-```markdown
----
-name: decisions-render
-description: >
-  Render a per-run decisions.yaml into a prose Google Doc at one stable
-  URL per run. Invoked at end of every phase; idempotent.
-disable-model-invocation: true
----
-
-# Decisions Render
-
-Read `decisions.yaml` from a run folder, render it as a prose Google Doc, and
-write the result to `decisions.gdoc` at one stable URL.
-
-## Inputs
-
-| Source | Artifact | Used for |
-|---|---|---|
-| Per-run state | `ACE/<opp-name>/runs/<run-id>/decisions.yaml` | the structured log to render |
-
-## Outputs
-
-- `ACE/<opp-name>/runs/<run-id>/decisions.gdoc` — prose Google Doc rendering at one stable URL. Find-or-update semantics; existing content is cleared and replaced on every invocation.
-
-## Process
-
-1. **Resolve the run folder file ID** for `ACE/<opp-name>/runs/<run-id>/`.
-
-   Use `drive_list_folder` from the opp folder to find the run folder.
-
-2. **Run the renderer script**:
-
-   ```bash
-   npx tsx scripts/decisions-render.ts <run-folder-fileId>
-   ```
-
-   The script:
-   - Reads `decisions.yaml` via `drive_read_file`.
-   - Parses and validates via `parseDecisionsYaml` from `lib/decisions-schema.ts`.
-   - Renders via `renderDecisionsLog` from `lib/decisions-renderer.ts` (pure function — produces a list of Google Docs API requests).
-   - Finds-or-creates `decisions.gdoc` via `drive_create_file` (with `findOrCreate: true`).
-   - Clears existing body (single `deleteContentRange` request covering the doc).
-   - Applies the rendered requests via `docs_batch_update`.
-
-3. **Confirm the gdoc URL** by reading the create result's `webViewLink` and emit it on stdout. The orchestrator captures this URL for the gate brief's `Decisions Log:` line.
-
-## Failure modes
-
-- **decisions.yaml is missing**: the script throws with the run folder ID; the orchestrator's Phase Write-Back Verifier should have already created an empty decisions.yaml before this skill runs. If it didn't, the skill halts and surfaces the missing-file error to the operator.
-- **Schema-invalid YAML**: the script throws with the dot-path of the offending field. The originating skill (whichever phase wrote the bad row) gets a hard fail; orchestrator surfaces in the gate brief's BLOCKER list.
-- **Docs API rate limit**: rare — the renderer makes one batch update per phase. Retry once after 30s; halt with actionable error if it fails again.
-
-## MCP Tools Used
-
-- Google Drive: `drive_list_folder`, `drive_read_file`, `drive_create_file`, `docs_batch_update`
-
-## Mode Behavior
-
-- **Auto:** Run, no human pause. Stdout includes the gdoc URL for downstream skills.
-- **Review:** Same as Auto — the renderer is deterministic, no human review of the rendering itself.
-
-## Change Log
-
-| Date | Change | Author |
-|------|--------|--------|
-| 2026-05-08 | Initial skill — pairs with `lib/decisions-renderer.ts` and `scripts/decisions-render.ts`. Renders decisions.yaml as a prose Google Doc; idempotent; runs at end of every phase. | ACE team (decisions-log PR #2) |
-```
-
-- [ ] **Step 2: Sanity-check the file structure.**
-
-Run: `head -1 skills/decisions-render/SKILL.md`
-Expected: `---` (frontmatter intact).
-
-Run: `grep -c "^## " skills/decisions-render/SKILL.md`
-Expected: 7 (Inputs, Outputs, Process, Failure modes, MCP Tools Used, Mode Behavior, Change Log).
-
-- [ ] **Step 3: Commit.**
-
-```bash
-git add skills/decisions-render/SKILL.md
-git commit -m "skill(decisions-render): wrap renderer + script as an ACE skill
-
-Thin skill body that resolves the run folder file ID, invokes
-scripts/decisions-render.ts, captures the resulting gdoc URL.
-Idempotent; runs at end of every phase via the orchestrator's Phase
-Write-Back Verifier (wired in the next commit).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 6: Wire `decisions-render` into the orchestrator's Phase Write-Back Verifier
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md`
-
-After every successful phase, the orchestrator should invoke `decisions-render` so the gdoc stays current. The natural integration point is the existing § Phase Write-Back Verifier section (the "after each `Agent(<phase>)` dispatch" loop).
-
-- [ ] **Step 1: Locate the Phase Write-Back Verifier section.**
-
-Run: `grep -n "Phase Write-Back Verifier" agents/ace-orchestrator.md`
-
-- [ ] **Step 2: Append a step to the verifier procedure.**
-
-Find the existing step list inside § Phase Write-Back Verifier (it has a "Procedure." heading followed by numbered steps). Append a new final step after the existing last step:
-
-```markdown
-4. **Re-render the decisions log gdoc.** After verifying the phase
-   wrote back its rows to `decisions.yaml`, invoke
-   `Skill(decisions-render)` against the run-id. The renderer produces
-   `ACE/<opp>/runs/<run-id>/decisions.gdoc` — a prose Google Doc at one
-   stable URL — and is idempotent across re-runs. Capture the gdoc's
-   webViewLink and inject it into the next gate brief's `Decisions Log:`
-   line. The renderer is fast (one batchUpdate call); failure is a
-   `[WARN]` not a `[BLOCKER]` — the YAML is the source of truth, the
-   gdoc is just the rendering.
-```
-
-- [ ] **Step 3: Sanity-check + commit.**
-
-Run: `grep -n "decisions-render" agents/ace-orchestrator.md`
-Expected: at least 1 match in the verifier step.
-
-```bash
-git add agents/ace-orchestrator.md
-git commit -m "agents: wire decisions-render into Phase Write-Back Verifier
-
-After every phase's write-back is verified, invoke decisions-render
-to refresh the per-run decisions.gdoc. Idempotent; one stable URL
-per run; failure is WARN not BLOCKER (YAML is the source of truth).
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 7: Update `lib/artifact-manifest.ts` to register the gdoc artifact
-
-**Files:**
-- Modify: `lib/artifact-manifest.ts`
-
-The artifact manifest is the canonical registry of run artifacts. PR #1 should have registered `decisions.yaml`; this task adds `decisions.gdoc`.
-
-- [ ] **Step 1: Find the existing decisions.yaml registration.**
-
-Run: `grep -n "decisions" lib/artifact-manifest.ts`
-
-If `decisions.yaml` is registered, follow the existing pattern. If not, add both entries together.
-
-- [ ] **Step 2: Add the gdoc entry.**
-
-Add an entry next to the existing decisions.yaml registration, mirroring its structure but with:
-- A name like `decisions-rendering` or `decisions-gdoc`.
-- Path: `ACE/<opp>/runs/<run-id>/decisions.gdoc`.
-- Mime type: `application/vnd.google-apps.document`.
-- Producer: `decisions-render`.
-- Owner role: same as decisions.yaml.
-
-(The exact registration shape depends on the existing manifest schema. Read the file to see how other artifacts like `idea-to-pdd_gate-brief` or `idea-to-pdd-qa_result.yaml` are structured.)
-
-- [ ] **Step 3: Run the manifest test.**
-
-Run: `npx vitest run lib/artifact-manifest.test.ts`
-Expected: PASS — manifest fixtures still validate.
-
-- [ ] **Step 4: Commit.**
-
-```bash
-git add lib/artifact-manifest.ts
-git commit -m "lib: register decisions.gdoc in artifact manifest
-
-Pairs decisions.yaml (registered in PR #1) with decisions.gdoc, the
-prose rendering produced by the new decisions-render skill.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 8: Run the full test suite
-
-- [ ] **Step 1: Run `npm test`.**
-
-Run: `npm test`
-Expected: PASS — full vitest suite green. Existing tests unaffected; new test files added: `test/lib/decisions-renderer.test.ts` (~9 assertions) and `test/skills/decisions-render/script.test.ts` (~3 assertions). Modified: `test/skills/idea-to-pdd/decisions-fixture.test.ts` (4 assertions, was 4).
-
-- [ ] **Step 2: If anything fails, fix it.**
-
-Common failure modes:
-- Type errors in the renderer if `googleapis` types aren't imported. Fix: define a local minimal request type union or import from `@googleapis/docs`.
-- The retrofitted fixture test fails if any anchor row was renamed in PR #1. Fix: align the ANCHOR_IDS list with what's actually in the fixture.
-- Runner test fails because the fake Drive client interface drifts from the production interface. Fix: keep both in sync in `scripts/decisions-render.ts`.
-
-Once green, no commit needed (no edits) — proceed.
-
----
-
-### Task 9: Version bump and PR
-
-- [ ] **Step 1: Run the worktree-safe version bump.**
-
-Run: `bash scripts/version-bump.sh`
-Expected: `VERSION` and the three plugin manifest files bumped to `max(local, origin/main) + patch+1`.
-
-- [ ] **Step 2: Sync `package-lock.json`.**
-
-Run: `npm install --package-lock-only`
-Expected: lockfile's `"version"` field updated to match `package.json`.
-
-- [ ] **Step 3: Commit.**
-
-```bash
-git add VERSION package.json package-lock.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-git commit -m "chore: bump version for decisions-log PR #2 (renderer + retrofit)
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
-- [ ] **Step 4: Push the branch.**
-
-Run: `git push -u origin <branch-name>` (use whatever the current branch is — probably the same `emdash/questions-70lfu` branch reset to origin/main at the start of this PR's work).
-
-- [ ] **Step 5: Open the PR.**
-
-```bash
-gh pr create --title "decisions-log PR #2: renderer + Phase 1 retrofit" --body "$(cat <<'EOF'
-## Summary
-
-Second PR in the decisions-log series ([spec](docs/superpowers/specs/2026-05-08-decisions-log-design.md)). Two coupled changes:
-
-1. **Retrofit Phase 1**: drop the hardcoded 14-row "Required Phase 1 row set" introduced in PR #1. Replace with a 5-row "Anchor decisions" list (rows tied to specific eval rubric dimensions: archetype_coherence, resource_realism, demand_reality, mission_alignment, fallback_validates_primary). Recommended additional rows are illustrative, not binding. The bar criterion (load-bearing + maps to known surface) is the only filter.
-
-2. **Renderer**: new `decisions-render` skill produces a prose Google Doc rendering of `decisions.yaml` at one stable URL per run. Pure-function `lib/decisions-renderer.ts` builds a Docs API request list; `scripts/decisions-render.ts` applies it via `docs_batch_update`. Wired into the orchestrator's Phase Write-Back Verifier — runs at end of every phase, idempotent.
-
-## Why retrofit + renderer in one PR
-
-The retrofit is a small architectural fix (drop hardcoded rows) that should land before the renderer because the renderer's eval rubric and gate-brief integration both depend on the cleaner schema. Bundling avoids landing the renderer over the wrong shape.
-
-## What ships
-
-- `lib/decisions-renderer.ts` — pure-function renderer (HEADING_1 title, HEADING_2 per phase, HEADING_3 per decision, bold field labels, bullet list for options_considered, italic notes, distinct emphasis for `status: open`).
-- `scripts/decisions-render.ts` — runner with a `DecisionsRenderDriveClient` interface (testable; production wires MCP atoms).
-- `skills/decisions-render/SKILL.md` — skill wrapping the script.
-- `agents/ace-orchestrator.md` — Phase Write-Back Verifier invokes `decisions-render` after every phase.
-- `skills/idea-to-pdd/SKILL.md` — retrofit: anchor list (5 rows) + recommended additional rows (illustrative, 9 rows). Process step adds renderer invocation; gate brief links the gdoc.
-- `test/lib/decisions-renderer.test.ts` — 9 assertions on the renderer's output shape.
-- `test/skills/decisions-render/script.test.ts` — 3 assertions on the runner against a fake Drive client.
-- `test/skills/idea-to-pdd/decisions-fixture.test.ts` — retrofitted to assert anchor + invariants (no longer the exact-14-IDs check).
-- `lib/artifact-manifest.ts` — registers `decisions.gdoc`.
-
-## What does NOT ship
-
-- Round-trip sync (`decisions-sync` skill that parses gdoc edits and writes back to YAML) — PR #3.
-- Phase 2–9 writes (one PR per phase) — PRs #4–#11.
-- Eval rubric re-anchor (`idea-to-pdd-eval`'s `deferred-decision-discipline` branch grading on `decisions.yaml` directly) — separate follow-up PR.
-
-## Test plan
-
-- [ ] CI green
-- [ ] `npm test` passes locally
-- [ ] Manual verification: re-run idea-to-pdd against an existing turmeric run, confirm `decisions.gdoc` appears in the run folder with the expected layout
-- [ ] After merge: `/ace:update` + `/reload-plugins` to pick up the new schema
-
-🤖 Generated with [Claude Code](https://claude.com/claude-code)
-EOF
-)"
-```
-
----
-
-## Self-review pass
-
-**Spec coverage** — every item in the coverage map at the top is mapped to a task. PRs #3 and onwards are deliberately out of scope.
-
-**Placeholder scan** — searched for `TBD`, `TODO`, `implement later`, `Add appropriate error handling`, `Similar to Task N`. None present. Every code step has the literal code; every command has the exact invocation; every file path is exact.
-
-**Type consistency** — `renderDecisionsLog` (renderer pure function), `runDecisionsRender` (runner script entry), `DecisionsRenderDriveClient` (driver interface), `DecisionsLog` / `DecisionRow` (already shipped in PR #1) — names used consistently across plan, code blocks, and test assertions.
-
-**Spec → plan alignment** — matches `docs/superpowers/specs/2026-05-08-decisions-log-design.md` § Rendering (prose Google Doc, native heading styles via `docs_batch_update`, find-or-update at one stable URL, regenerated end of every phase). The two named spec deviations (sub-project ordering swap + dropping required-row hardcoding) are called out in the plan header and don't change the architecture; they tighten it.
-
----
-
-## Execution handoff
-
-Plan complete and saved to `docs/superpowers/plans/2026-05-08-decisions-log-pr2.md`. Subagent-driven execution recommended (matches PR #1's pattern).
diff --git a/docs/superpowers/plans/2026-05-08-decisions-log-pr3.md b/docs/superpowers/plans/2026-05-08-decisions-log-pr3.md
deleted file mode 100644
index f8d943da..00000000
--- a/docs/superpowers/plans/2026-05-08-decisions-log-pr3.md
+++ /dev/null
@@ -1,1201 +0,0 @@
-# Decisions Log — PR #3: Round-Trip Sync — Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development to implement this plan task-by-task.
-
-**Goal:** Close the human iteration loop. Build a `decisions-sync` skill that reads the human-edited `decisions.gdoc`, diffs against `decisions.yaml`, and writes overrides back to the YAML so the next `/ace:run` (or `/ace:step idea-to-pdd`) honors the human's edits.
-
-**Architecture:**
-- **Parser** (`lib/decisions-parser.ts`): pure function that walks a Google Docs `documents.get` response and extracts `{id, default, options_considered}` per HEADING_3-anchored decision section. The renderer (PR #2) is the inverse — together they form a round-trip pair.
-- **Merger** (`lib/decisions-sync.ts`): pure function `mergeDecisions(parsedFromGdoc, currentLog)` returns `{mergedLog, changeReport}`. Diffs default values; on changes, updates `default` and sets `status: overridden`. Appends new `Considered:` bullets to `options_considered`. Reports unmatched rows for operator awareness.
-- **Runner** (`scripts/decisions-sync.ts`): the I/O glue — reads gdoc via Drive MCP, reads YAML, calls the merger, writes back via the schema-validated serializer. `DecisionsSyncDriveClient` interface for testability.
-- **Skill** (`skills/decisions-sync/SKILL.md`): thin wrapper, human-triggered via `/ace:step decisions-sync <opp>/<run-id>`.
-
-**Tech Stack:** TypeScript ESM, vitest, `lib/decisions-schema.ts` (helpers shipped in PR #1), `lib/decisions-renderer.ts` (PR #2 — for round-trip integration test).
-
-**Trigger model:** Human-triggered, not orchestrator-automated. Workflow:
-1. Human reads `decisions.gdoc` after a phase completes.
-2. Human edits a `Default:` line or adds a `Considered:` bullet directly in the gdoc.
-3. Human runs `/ace:step decisions-sync <opp>/<run-id>`.
-4. Sync writes the human's edits to `decisions.yaml` with `status: overridden`.
-5. Next `/ace:run` reads the updated YAML and uses overridden values as authoritative inputs.
-
-**Why human-triggered, not auto-on-run:** Auto-pickup risks silently overwriting AI defaults if a stale gdoc has unintended edits. Explicit invocation makes the override deliberate and auditable.
-
----
-
-## Spec coverage map
-
-| Concern | Covered by |
-|---|---|
-| Parse Google Docs structure into per-decision rows | Task 1 (parser) |
-| Diff parsed rows against current YAML; produce merged log + change report | Task 2 (merger) |
-| Read gdoc + read YAML + write merged YAML via Drive MCP | Task 3 (runner) |
-| Skill body wrapping the runner | Task 4 |
-| Round-trip test (renderer + parser inverse) | Task 1 (extra assertion) |
-| Full suite green | Task 5 |
-| Version bump + push + PR | Task 6 |
-
----
-
-## File structure
-
-**Create:**
-- `lib/decisions-parser.ts` — `parseDocumentStructure(doc: GoogleDocsDocument): ParsedDecisionRow[]`. Walks `body.content[]`, identifies HEADING_3 paragraphs as row IDs, extracts `Default:` value and `Considered:` bullets from subsequent body paragraphs until the next HEADING_3 or HEADING_2.
-- `test/lib/decisions-parser.test.ts` — unit tests against hand-authored Document fixtures + a round-trip test using `lib/decisions-renderer.ts`.
-- `lib/decisions-sync.ts` — `mergeDecisions(parsed: ParsedDecisionRow[], current: DecisionsLog): {merged: DecisionsLog; report: ChangeReport}`. Pure function, no I/O.
-- `test/lib/decisions-sync.test.ts` — unit tests on the merger.
-- `scripts/decisions-sync.ts` — runner: `runDecisionsSync({runFolderFileId, driveClient}): Promise<{report, gdocId}>`. `DecisionsSyncDriveClient` interface (`getDoc`, `readFile`, `writeFile`).
-- `test/skills/decisions-sync/script.test.ts` — integration test against fake Drive client.
-- `skills/decisions-sync/SKILL.md` — skill body.
-
-**Modify:**
-- `lib/artifact-manifest.ts` — add `decisions-sync` as a consumer of `decisions.gdoc` and a producer-of-overrides for `decisions.yaml` (in the consumedBy/producedBy fields, where the manifest schema permits — read what's there first).
-- `VERSION`, `package.json`, `package-lock.json`, `.claude-plugin/plugin.json`, `.claude-plugin/marketplace.json` — version-bumped.
-
----
-
-## Tasks
-
-### Task 1: Pure-function parser — `lib/decisions-parser.ts`
-
-**Files:**
-- Create: `lib/decisions-parser.ts`
-- Create: `test/lib/decisions-parser.test.ts`
-
-The parser walks a Google Docs API `documents.get` response and extracts decision rows. The renderer (PR #2) uses HEADING_1 (title), HEADING_2 (per phase), HEADING_3 (per decision id), and body paragraphs with bold field-label prefixes (`Default:`, `Considered:`, `Source:`, `Status:`). The parser is the inverse — it walks paragraphs in document order, finds HEADING_3 as decision boundaries, and extracts the per-decision body content.
-
-#### Document shape (subset we care about)
-
-A Google Docs `documents.get` response looks like:
-
-```ts
-type GoogleDocsDocument = {
-  body?: {
-    content?: GoogleDocsStructuralElement[];
-  };
-  // ... plus other fields we ignore (revisionId, documentStyle, etc.)
-};
-
-type GoogleDocsStructuralElement = {
-  paragraph?: {
-    elements?: Array<{
-      textRun?: { content?: string };
-    }>;
-    paragraphStyle?: {
-      namedStyleType?: string;  // "HEADING_1" | "HEADING_2" | "HEADING_3" | "NORMAL_TEXT" | ...
-    };
-    bullet?: { listId?: string };  // present iff paragraph is a bullet
-  };
-  // ... we ignore tables, sectionBreaks, etc.
-};
-```
-
-The parser only needs `paragraphStyle.namedStyleType`, the concatenated text of `paragraph.elements[].textRun.content`, and presence of `paragraph.bullet`.
-
-#### Output shape
-
-```ts
-type ParsedDecisionRow = {
-  id: string;
-  default?: string;            // text after "Default: " — undefined if no Default line found
-  options_considered?: string[]; // bullets after "Considered:" up to next field — undefined if no Considered: section
-};
-```
-
-The parser does NOT extract `phase`, `skill`, `question`, `source`, `status`, or `notes` — those are not editable surfaces in the round-trip workflow and the merger pulls them from the YAML.
-
-- [ ] **Step 1: Write the failing test.**
-
-Create `test/lib/decisions-parser.test.ts`:
-
-```ts
-import { describe, expect, it } from "vitest";
-import { parseDocumentStructure } from "../../lib/decisions-parser.js";
-import { renderDecisionsLog } from "../../lib/decisions-renderer.js";
-import type { DecisionsLog } from "../../lib/decisions-schema.js";
-
-// Helper: synthesize a minimal Google Docs Document from a list of
-// (text, namedStyleType, isBullet) tuples. Mimics the post-render shape.
-function makeDoc(
-  paragraphs: Array<{ text: string; style?: string; bullet?: boolean }>,
-): { body: { content: unknown[] } } {
-  return {
-    body: {
-      content: paragraphs.map((p) => ({
-        paragraph: {
-          elements: [{ textRun: { content: p.text + "\n" } }],
-          paragraphStyle: { namedStyleType: p.style ?? "NORMAL_TEXT" },
-          ...(p.bullet ? { bullet: { listId: "kix.list1" } } : {}),
-        },
-      })),
-    },
-  };
-}
-
-describe("parseDocumentStructure", () => {
-  it("returns an empty array for an empty document body", () => {
-    const doc = { body: { content: [] } };
-    expect(parseDocumentStructure(doc)).toEqual([]);
-  });
-
-  it("extracts a single decision with default + considered options", () => {
-    const doc = makeDoc([
-      { text: "Decisions Log — turmeric", style: "HEADING_1" },
-      { text: "Phase 1 — Design", style: "HEADING_2" },
-      { text: "archetype-selection", style: "HEADING_3" },
-      { text: "Which delivery archetype best fits?", style: "NORMAL_TEXT" },
-      { text: "  Default: atomic-visit" },
-      { text: "  Considered:" },
-      { text: "atomic-visit", bullet: true },
-      { text: "focus-group", bullet: true },
-      { text: "multi-stage", bullet: true },
-      { text: "  Source: idea.md §1" },
-      { text: "  Status: applied" },
-    ]);
-    const rows = parseDocumentStructure(doc);
-    expect(rows).toHaveLength(1);
-    expect(rows[0]).toEqual({
-      id: "archetype-selection",
-      default: "atomic-visit",
-      options_considered: ["atomic-visit", "focus-group", "multi-stage"],
-    });
-  });
-
-  it("extracts multiple decisions across multiple phases", () => {
-    const doc = makeDoc([
-      { text: "Phase 1 — Design", style: "HEADING_2" },
-      { text: "row-a", style: "HEADING_3" },
-      { text: "  Default: alpha" },
-      { text: "row-b", style: "HEADING_3" },
-      { text: "  Default: beta" },
-      { text: "Phase 2 — CommCare", style: "HEADING_2" },
-      { text: "row-c", style: "HEADING_3" },
-      { text: "  Default: gamma" },
-    ]);
-    const rows = parseDocumentStructure(doc);
-    expect(rows.map((r) => r.id)).toEqual(["row-a", "row-b", "row-c"]);
-    expect(rows.map((r) => r.default)).toEqual(["alpha", "beta", "gamma"]);
-  });
-
-  it("handles a row with no Default: line (undefined default)", () => {
-    const doc = makeDoc([
-      { text: "row-x", style: "HEADING_3" },
-      { text: "Some question?", style: "NORMAL_TEXT" },
-      { text: "  Source: idea.md" },
-    ]);
-    const rows = parseDocumentStructure(doc);
-    expect(rows).toHaveLength(1);
-    expect(rows[0]!.id).toBe("row-x");
-    expect(rows[0]!.default).toBeUndefined();
-  });
-
-  it("handles a row with Default but no Considered: section", () => {
-    const doc = makeDoc([
-      { text: "row-y", style: "HEADING_3" },
-      { text: "  Default: solo-value" },
-    ]);
-    const rows = parseDocumentStructure(doc);
-    expect(rows[0]!.default).toBe("solo-value");
-    expect(rows[0]!.options_considered).toBeUndefined();
-  });
-
-  it("ignores trailing whitespace and the Default: indent", () => {
-    const doc = makeDoc([
-      { text: "row-z", style: "HEADING_3" },
-      { text: "    Default:   spaced-value   " },
-    ]);
-    const rows = parseDocumentStructure(doc);
-    expect(rows[0]!.default).toBe("spaced-value");
-  });
-
-  it("round-trips a DecisionsLog through render → parse without losing default/options", () => {
-    const log: DecisionsLog = {
-      schema_version: 1,
-      opportunity: "turmeric",
-      run_id: "20260507-1733",
-      generated_at: "2026-05-07T17:33:00Z",
-      decisions: [
-        {
-          id: "archetype-selection",
-          phase: "1-design",
-          skill: "idea-to-pdd",
-          question: "Which delivery archetype?",
-          default: "atomic-visit",
-          options_considered: ["atomic-visit", "focus-group", "multi-stage"],
-          source: "idea.md §1",
-          status: "applied",
-        },
-        {
-          id: "flw-count",
-          phase: "1-design",
-          skill: "idea-to-pdd",
-          question: "How many FLWs?",
-          default: "5–8",
-          options_considered: ["3–5", "5–8", "10–15"],
-          source: "idea.md §2",
-          status: "applied",
-        },
-      ],
-    };
-
-    // Render → simulated Document → parse
-    const requests = renderDecisionsLog(log);
-    const doc = simulateDocument(requests);
-    const parsed = parseDocumentStructure(doc);
-
-    // Renderer + parser should be inverses for the round-trip fields.
-    expect(parsed.map((r) => r.id)).toEqual(["archetype-selection", "flw-count"]);
-    expect(parsed[0]!.default).toBe("atomic-visit");
-    expect(parsed[0]!.options_considered).toEqual([
-      "atomic-visit",
-      "focus-group",
-      "multi-stage",
-    ]);
-    expect(parsed[1]!.default).toBe("5–8");
-    expect(parsed[1]!.options_considered).toEqual(["3–5", "5–8", "10–15"]);
-  });
-});
-
-/**
- * Simulate the Google Docs document state produced by applying a list
- * of batchUpdate requests starting from an empty doc. Used only to
- * round-trip the renderer through the parser in tests; not a complete
- * Docs API simulator (only handles insertText, updateParagraphStyle,
- * updateTextStyle, createParagraphBullets — the request kinds the
- * renderer actually emits).
- */
-function simulateDocument(requests: any[]): {
-  body: { content: any[] };
-} {
-  // Build a single text buffer first.
-  let text = "";
-  for (const r of requests) {
-    if ("insertText" in r) {
-      text += r.insertText.text;
-    }
-  }
-  // Split into paragraphs on \n.
-  const lines = text.split("\n");
-  if (lines[lines.length - 1] === "") lines.pop(); // drop trailing empty from last \n
-
-  // Compute each line's start/end indices in the doc-1-indexed coordinate
-  // space the renderer uses.
-  let cursor = 1;
-  const linePositions: Array<{ start: number; end: number; text: string }> = [];
-  for (const line of lines) {
-    const start = cursor;
-    const end = cursor + line.length;
-    linePositions.push({ start, end, text: line });
-    cursor = end + 1; // +1 for the \n
-  }
-
-  // Apply paragraph-style requests by matching range to a line.
-  const paragraphStyles = new Map<number, string>(); // line index → namedStyleType
-  const bulletLines = new Set<number>();
-  for (const r of requests) {
-    if ("updateParagraphStyle" in r) {
-      const { range, paragraphStyle } = r.updateParagraphStyle;
-      const namedStyleType = paragraphStyle?.namedStyleType;
-      if (!namedStyleType) continue;
-      // Find lines whose start falls within [range.startIndex, range.endIndex)
-      for (let i = 0; i < linePositions.length; i++) {
-        const lp = linePositions[i]!;
-        if (lp.start >= range.startIndex && lp.start < range.endIndex) {
-          paragraphStyles.set(i, namedStyleType);
-        }
-      }
-    } else if ("createParagraphBullets" in r) {
-      const { range } = r.createParagraphBullets;
-      for (let i = 0; i < linePositions.length; i++) {
-        const lp = linePositions[i]!;
-        if (lp.start >= range.startIndex && lp.start < range.endIndex) {
-          bulletLines.add(i);
-        }
-      }
-    }
-  }
-
-  return {
-    body: {
-      content: linePositions.map((lp, i) => ({
-        paragraph: {
-          elements: [{ textRun: { content: lp.text + "\n" } }],
-          paragraphStyle: { namedStyleType: paragraphStyles.get(i) ?? "NORMAL_TEXT" },
-          ...(bulletLines.has(i) ? { bullet: { listId: "kix.list1" } } : {}),
-        },
-      })),
-    },
-  };
-}
-```
-
-- [ ] **Step 2: Run the test to verify it fails.**
-
-Run: `npx vitest run test/lib/decisions-parser.test.ts`
-Expected: FAIL with `Cannot find module '../../lib/decisions-parser.js'`.
-
-- [ ] **Step 3: Implement the parser.**
-
-Create `lib/decisions-parser.ts`. Key implementation hints:
-
-```ts
-export type ParsedDecisionRow = {
-  id: string;
-  default?: string;
-  options_considered?: string[];
-};
-
-type DocsParagraph = {
-  elements?: Array<{ textRun?: { content?: string } }>;
-  paragraphStyle?: { namedStyleType?: string };
-  bullet?: { listId?: string };
-};
-
-type DocsStructuralElement = { paragraph?: DocsParagraph };
-
-type GoogleDocsDocument = { body?: { content?: DocsStructuralElement[] } };
-
-const HEADING_3 = "HEADING_3";
-const HEADING_2 = "HEADING_2";
-
-/** Concatenate all textRun.content values inside a paragraph. */
-function paragraphText(p: DocsParagraph): string {
-  if (!p.elements) return "";
-  return p.elements
-    .map((e) => e.textRun?.content ?? "")
-    .join("")
-    .replace(/\n$/, "");  // strip the trailing newline that ends every paragraph
-}
-
-export function parseDocumentStructure(
-  doc: GoogleDocsDocument,
-): ParsedDecisionRow[] {
-  const content = doc.body?.content ?? [];
-  const rows: ParsedDecisionRow[] = [];
-
-  // State machine: walk paragraphs in order. When we hit a HEADING_3
-  // we open a new row. Subsequent body paragraphs (until the next
-  // HEADING_3 or HEADING_2) get scanned for "Default: <value>" and
-  // a "Considered:" header followed by bullets.
-  let current: ParsedDecisionRow | null = null;
-  let inConsidered = false;
-
-  function commit() {
-    if (current) rows.push(current);
-    current = null;
-    inConsidered = false;
-  }
-
-  for (const el of content) {
-    const p = el.paragraph;
-    if (!p) continue;
-    const style = p.paragraphStyle?.namedStyleType ?? "NORMAL_TEXT";
-    const text = paragraphText(p);
-
-    if (style === HEADING_3) {
-      commit();
-      current = { id: text.trim() };
-      continue;
-    }
-    if (style === HEADING_2) {
-      commit();
-      continue;
-    }
-    if (!current) continue;
-
-    // We're inside a decision section.
-    const trimmed = text.trim();
-
-    // Bullet under "Considered:"
-    if (inConsidered && p.bullet) {
-      const bulletText = trimmed;
-      if (bulletText) {
-        current.options_considered = current.options_considered ?? [];
-        current.options_considered.push(bulletText);
-      }
-      continue;
-    }
-
-    // Field-prefix lines: any other field exits the Considered: block.
-    if (/^Default:/i.test(trimmed)) {
-      current.default = trimmed.replace(/^Default:\s*/i, "").trim();
-      inConsidered = false;
-      continue;
-    }
-    if (/^Considered:/i.test(trimmed)) {
-      inConsidered = true;
-      continue;
-    }
-    if (/^Source:/i.test(trimmed) || /^Status:/i.test(trimmed)) {
-      inConsidered = false;
-      continue;
-    }
-    // Any other body paragraph (question, notes, etc.) is ignored —
-    // not a round-trip-editable surface.
-  }
-
-  commit();
-  return rows;
-}
-```
-
-- [ ] **Step 4: Run the test to verify it passes.**
-
-Run: `npx vitest run test/lib/decisions-parser.test.ts`
-Expected: PASS, 7/7 green.
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add lib/decisions-parser.ts test/lib/decisions-parser.test.ts
-git commit -m "lib: add pure-function parser for decisions gdoc
-
-parseDocumentStructure(doc) walks a Google Docs documents.get response,
-identifies HEADING_3 paragraphs as decision-row IDs, and extracts the
-'Default:' value and 'Considered:' bullet list per decision section.
-Inverse of lib/decisions-renderer.ts; round-trip tested.
-
-Other fields (phase, skill, question, source, status, notes) are not
-extracted — they're not editable surfaces in the human-iteration
-workflow. The merger pulls them from the YAML.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 2: Pure-function merger — `lib/decisions-sync.ts`
-
-**Files:**
-- Create: `lib/decisions-sync.ts`
-- Create: `test/lib/decisions-sync.test.ts`
-
-Diffs parsed gdoc rows against the current YAML log; produces a merged log + change report.
-
-#### Behavior
-
-For each parsed row matched to a YAML row by `id`:
-- If `parsed.default` is defined and differs from `yaml.default`: update YAML's `default` to parsed value, set `status: overridden`. Add the prior YAML default to `options_considered` if not already there.
-- If `parsed.options_considered` contains entries not in `yaml.options_considered`: append them.
-- If `parsed.options_considered` is missing entries that are in `yaml.options_considered`: leave the YAML entries alone (don't auto-delete; deletions are too risky to infer from a missing bullet).
-
-Unmatched rows:
-- A parsed row with no matching YAML row → warning in the report; not added to YAML.
-- A YAML row with no matching parsed row → warning in the report; YAML row preserved unchanged.
-
-#### Output
-
-```ts
-type ChangeReport = {
-  defaultsOverridden: Array<{ id: string; from: string; to: string }>;
-  optionsAdded: Array<{ id: string; option: string }>;
-  parsedNotInYaml: string[];   // ids in gdoc but not in YAML
-  yamlNotInParsed: string[];   // ids in YAML but not in gdoc
-};
-```
-
-- [ ] **Step 1: Write the failing test.**
-
-Create `test/lib/decisions-sync.test.ts`:
-
-```ts
-import { describe, expect, it } from "vitest";
-import { mergeDecisions } from "../../lib/decisions-sync.js";
-import type { DecisionsLog } from "../../lib/decisions-schema.js";
-
-const baseLog: DecisionsLog = {
-  schema_version: 1,
-  opportunity: "turmeric",
-  run_id: "20260507-1733",
-  generated_at: "2026-05-07T17:33:00Z",
-  decisions: [
-    {
-      id: "archetype-selection",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "Which delivery archetype?",
-      default: "atomic-visit",
-      options_considered: ["atomic-visit", "focus-group", "multi-stage"],
-      source: "idea.md §1",
-      status: "applied",
-    },
-    {
-      id: "flw-count",
-      phase: "1-design",
-      skill: "idea-to-pdd",
-      question: "How many FLWs?",
-      default: "5–8",
-      options_considered: ["3–5", "5–8", "10–15"],
-      source: "idea.md §2",
-      status: "applied",
-    },
-  ],
-};
-
-describe("mergeDecisions", () => {
-  it("preserves the YAML when parsed rows match it exactly", () => {
-    const parsed = [
-      { id: "archetype-selection", default: "atomic-visit", options_considered: ["atomic-visit", "focus-group", "multi-stage"] },
-      { id: "flw-count", default: "5–8", options_considered: ["3–5", "5–8", "10–15"] },
-    ];
-    const { merged, report } = mergeDecisions(parsed, baseLog);
-    expect(merged).toEqual(baseLog);
-    expect(report.defaultsOverridden).toEqual([]);
-    expect(report.optionsAdded).toEqual([]);
-  });
-
-  it("overrides default when human edits it; sets status=overridden", () => {
-    const parsed = [
-      { id: "flw-count", default: "12", options_considered: ["3–5", "5–8", "10–15"] },
-    ];
-    const { merged, report } = mergeDecisions(parsed, baseLog);
-    const flw = merged.decisions.find((d) => d.id === "flw-count")!;
-    expect(flw.default).toBe("12");
-    expect(flw.status).toBe("overridden");
-    expect(report.defaultsOverridden).toEqual([
-      { id: "flw-count", from: "5–8", to: "12" },
-    ]);
-  });
-
-  it("preserves the prior default in options_considered when overriding", () => {
-    const parsed = [
-      { id: "flw-count", default: "12", options_considered: ["3–5", "5–8", "10–15"] },
-    ];
-    const { merged } = mergeDecisions(parsed, baseLog);
-    const flw = merged.decisions.find((d) => d.id === "flw-count")!;
-    expect(flw.options_considered).toContain("5–8"); // original default
-    expect(flw.options_considered).toContain("12");  // new default
-  });
-
-  it("does not duplicate the prior default if already in options_considered", () => {
-    const parsed = [
-      // Human picked an existing option as new default
-      { id: "flw-count", default: "10–15", options_considered: ["3–5", "5–8", "10–15"] },
-    ];
-    const { merged } = mergeDecisions(parsed, baseLog);
-    const flw = merged.decisions.find((d) => d.id === "flw-count")!;
-    expect(flw.default).toBe("10–15");
-    // 5–8 prior default should be added (it wasn't in the new selection)
-    expect(flw.options_considered.filter((x) => x === "5–8")).toHaveLength(1);
-    // 10–15 should not be duplicated
-    expect(flw.options_considered.filter((x) => x === "10–15")).toHaveLength(1);
-  });
-
-  it("appends new Considered bullets that aren't in YAML", () => {
-    const parsed = [
-      { id: "archetype-selection", default: "atomic-visit", options_considered: ["atomic-visit", "focus-group", "multi-stage", "novel-archetype"] },
-    ];
-    const { merged, report } = mergeDecisions(parsed, baseLog);
-    const arch = merged.decisions.find((d) => d.id === "archetype-selection")!;
-    expect(arch.options_considered).toContain("novel-archetype");
-    expect(report.optionsAdded).toEqual([
-      { id: "archetype-selection", option: "novel-archetype" },
-    ]);
-  });
-
-  it("does NOT delete options that are missing from parsed", () => {
-    const parsed = [
-      // Human deleted "focus-group" bullet from gdoc
-      { id: "archetype-selection", default: "atomic-visit", options_considered: ["atomic-visit", "multi-stage"] },
-    ];
-    const { merged } = mergeDecisions(parsed, baseLog);
-    const arch = merged.decisions.find((d) => d.id === "archetype-selection")!;
-    expect(arch.options_considered).toContain("focus-group"); // preserved
-  });
-
-  it("reports parsed rows not matched in YAML (warns; doesn't add)", () => {
-    const parsed = [
-      { id: "ghost-row", default: "abc", options_considered: ["abc"] },
-    ];
-    const { merged, report } = mergeDecisions(parsed, baseLog);
-    expect(merged.decisions).toHaveLength(2); // unchanged
-    expect(report.parsedNotInYaml).toEqual(["ghost-row"]);
-  });
-
-  it("reports YAML rows not present in parsed (warns; preserves)", () => {
-    const parsed = [
-      { id: "archetype-selection", default: "atomic-visit", options_considered: ["atomic-visit"] },
-      // flw-count is missing from gdoc
-    ];
-    const { merged, report } = mergeDecisions(parsed, baseLog);
-    expect(merged.decisions).toHaveLength(2); // both preserved
-    expect(report.yamlNotInParsed).toEqual(["flw-count"]);
-  });
-
-  it("ignores parsed rows with undefined default (no override applied)", () => {
-    const parsed = [
-      { id: "flw-count" }, // no default, no options
-    ];
-    const { merged, report } = mergeDecisions(parsed, baseLog);
-    const flw = merged.decisions.find((d) => d.id === "flw-count")!;
-    expect(flw.default).toBe("5–8");
-    expect(flw.status).toBe("applied");
-    expect(report.defaultsOverridden).toEqual([]);
-  });
-});
-```
-
-- [ ] **Step 2: Run the test.**
-
-Run: `npx vitest run test/lib/decisions-sync.test.ts`
-Expected: FAIL with module not found.
-
-- [ ] **Step 3: Implement the merger.**
-
-Create `lib/decisions-sync.ts`:
-
-```ts
-import type { DecisionsLog, DecisionRow } from "./decisions-schema.js";
-import type { ParsedDecisionRow } from "./decisions-parser.js";
-
-export type ChangeReport = {
-  defaultsOverridden: Array<{ id: string; from: string; to: string }>;
-  optionsAdded: Array<{ id: string; option: string }>;
-  parsedNotInYaml: string[];
-  yamlNotInParsed: string[];
-};
-
-export function mergeDecisions(
-  parsed: ParsedDecisionRow[],
-  current: DecisionsLog,
-): { merged: DecisionsLog; report: ChangeReport } {
-  const report: ChangeReport = {
-    defaultsOverridden: [],
-    optionsAdded: [],
-    parsedNotInYaml: [],
-    yamlNotInParsed: [],
-  };
-
-  const parsedById = new Map<string, ParsedDecisionRow>();
-  for (const p of parsed) parsedById.set(p.id, p);
-
-  const yamlIds = new Set(current.decisions.map((d) => d.id));
-  for (const p of parsed) {
-    if (!yamlIds.has(p.id)) report.parsedNotInYaml.push(p.id);
-  }
-  for (const d of current.decisions) {
-    if (!parsedById.has(d.id)) report.yamlNotInParsed.push(d.id);
-  }
-
-  const merged: DecisionsLog = {
-    ...current,
-    decisions: current.decisions.map((d) => mergeRow(d, parsedById.get(d.id), report)),
-  };
-
-  return { merged, report };
-}
-
-function mergeRow(
-  yamlRow: DecisionRow,
-  parsedRow: ParsedDecisionRow | undefined,
-  report: ChangeReport,
-): DecisionRow {
-  if (!parsedRow) return yamlRow;
-
-  let updated: DecisionRow = yamlRow;
-
-  // Handle default override.
-  if (
-    parsedRow.default !== undefined &&
-    parsedRow.default !== yamlRow.default
-  ) {
-    const newOptions = [...yamlRow.options_considered];
-    // Preserve the AI's prior default in the option menu.
-    if (!newOptions.includes(yamlRow.default)) {
-      newOptions.push(yamlRow.default);
-    }
-    // Add the new value to options if not already there.
-    if (!newOptions.includes(parsedRow.default)) {
-      newOptions.push(parsedRow.default);
-    }
-    updated = {
-      ...updated,
-      default: parsedRow.default,
-      status: "overridden",
-      options_considered: newOptions,
-    };
-    report.defaultsOverridden.push({
-      id: yamlRow.id,
-      from: yamlRow.default,
-      to: parsedRow.default,
-    });
-  }
-
-  // Handle additive options_considered changes.
-  if (parsedRow.options_considered) {
-    const existing = new Set(updated.options_considered);
-    const newlyAdded: string[] = [];
-    for (const opt of parsedRow.options_considered) {
-      if (!existing.has(opt)) {
-        newlyAdded.push(opt);
-        existing.add(opt);
-      }
-    }
-    if (newlyAdded.length > 0) {
-      updated = {
-        ...updated,
-        options_considered: [...updated.options_considered, ...newlyAdded],
-      };
-      for (const opt of newlyAdded) {
-        report.optionsAdded.push({ id: yamlRow.id, option: opt });
-      }
-    }
-  }
-
-  return updated;
-}
-```
-
-- [ ] **Step 4: Run the test to verify it passes.**
-
-Run: `npx vitest run test/lib/decisions-sync.test.ts`
-Expected: PASS, 9/9 green.
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add lib/decisions-sync.ts test/lib/decisions-sync.test.ts
-git commit -m "lib: add merger for decisions-sync round-trip
-
-mergeDecisions(parsed, current) returns merged DecisionsLog +
-ChangeReport. Diffs parsed gdoc rows against the current YAML log;
-on default override, sets status: overridden and preserves the prior
-default in options_considered. New Considered bullets are appended;
-missing bullets are NOT auto-deleted (too risky to infer from
-absence). Unmatched rows in either direction surface in the report.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 3: Runner script — `scripts/decisions-sync.ts`
-
-**Files:**
-- Create: `scripts/decisions-sync.ts`
-- Create: `test/skills/decisions-sync/script.test.ts`
-
-The runner glues parser + merger + Drive I/O together.
-
-- [ ] **Step 1: Write the failing test.**
-
-Create `test/skills/decisions-sync/script.test.ts`:
-
-```ts
-import { describe, expect, it, vi } from "vitest";
-import { runDecisionsSync } from "../../../scripts/decisions-sync.js";
-
-const VALID_YAML = `schema_version: 1
-opportunity: turmeric
-run_id: 20260507-1733
-generated_at: "2026-05-07T17:33:00Z"
-decisions:
-  - id: flw-count
-    phase: 1-design
-    skill: idea-to-pdd
-    question: How many FLWs?
-    default: "5–8"
-    options_considered: ["3–5", "5–8", "10–15"]
-    source: idea.md §2
-    status: applied
-`;
-
-const FAKE_GDOC = {
-  body: {
-    content: [
-      {
-        paragraph: {
-          elements: [{ textRun: { content: "flw-count\n" } }],
-          paragraphStyle: { namedStyleType: "HEADING_3" },
-        },
-      },
-      {
-        paragraph: {
-          elements: [{ textRun: { content: "  Default: 12\n" } }],
-          paragraphStyle: { namedStyleType: "NORMAL_TEXT" },
-        },
-      },
-    ],
-  },
-};
-
-function makeFakeDriveClient() {
-  return {
-    findFile: vi.fn(),
-    getDoc: vi.fn().mockResolvedValue(FAKE_GDOC),
-    readFile: vi.fn().mockResolvedValue({ content: VALID_YAML }),
-    writeFile: vi.fn().mockResolvedValue(undefined),
-  };
-}
-
-describe("runDecisionsSync", () => {
-  it("reads gdoc + yaml, merges, writes updated yaml, returns the change report", async () => {
-    const client = makeFakeDriveClient();
-    client.findFile.mockResolvedValueOnce({ id: "fake-gdoc-id" });
-
-    const result = await runDecisionsSync({
-      runFolderFileId: "fake-folder-id",
-      driveClient: client,
-    });
-
-    expect(client.findFile).toHaveBeenCalledWith({
-      parentFolderId: "fake-folder-id",
-      name: "decisions.gdoc",
-    });
-    expect(client.getDoc).toHaveBeenCalledWith("fake-gdoc-id");
-    expect(client.readFile).toHaveBeenCalledWith({
-      parentFolderId: "fake-folder-id",
-      name: "decisions.yaml",
-    });
-    expect(client.writeFile).toHaveBeenCalled();
-
-    const writeArgs = client.writeFile.mock.calls[0]![0];
-    expect(writeArgs.parentFolderId).toBe("fake-folder-id");
-    expect(writeArgs.name).toBe("decisions.yaml");
-    expect(writeArgs.content).toContain("default: \"12\"");
-    expect(writeArgs.content).toContain("status: overridden");
-
-    expect(result.report.defaultsOverridden).toEqual([
-      { id: "flw-count", from: "5–8", to: "12" },
-    ]);
-  });
-
-  it("throws an actionable error when decisions.gdoc is missing", async () => {
-    const client = makeFakeDriveClient();
-    client.findFile.mockResolvedValueOnce(null);
-
-    await expect(
-      runDecisionsSync({ runFolderFileId: "fake-folder-id", driveClient: client }),
-    ).rejects.toThrow(/decisions\.gdoc/);
-  });
-
-  it("throws an actionable error when decisions.yaml is missing", async () => {
-    const client = makeFakeDriveClient();
-    client.findFile.mockResolvedValueOnce({ id: "fake-gdoc-id" });
-    client.readFile.mockRejectedValueOnce(new Error("File not found"));
-
-    await expect(
-      runDecisionsSync({ runFolderFileId: "fake-folder-id", driveClient: client }),
-    ).rejects.toThrow(/decisions\.yaml/);
-  });
-});
-```
-
-- [ ] **Step 2: Run the test to verify it fails.**
-
-Run: `npx vitest run test/skills/decisions-sync/script.test.ts`
-Expected: FAIL with module not found.
-
-- [ ] **Step 3: Implement the runner.**
-
-Create `scripts/decisions-sync.ts`:
-
-```ts
-#!/usr/bin/env npx tsx
-import { parseDecisionsYaml, serializeDecisionsLog } from "../lib/decisions-schema.js";
-import { parseDocumentStructure } from "../lib/decisions-parser.js";
-import { mergeDecisions, type ChangeReport } from "../lib/decisions-sync.js";
-
-export interface DecisionsSyncDriveClient {
-  findFile(args: { parentFolderId: string; name: string }): Promise<{ id: string } | null>;
-  getDoc(documentId: string): Promise<unknown>;
-  readFile(args: { parentFolderId: string; name: string }): Promise<{ content: string }>;
-  writeFile(args: { parentFolderId: string; name: string; content: string }): Promise<void>;
-}
-
-export interface RunDecisionsSyncArgs {
-  runFolderFileId: string;
-  driveClient: DecisionsSyncDriveClient;
-}
-
-export interface RunDecisionsSyncResult {
-  gdocId: string;
-  report: ChangeReport;
-}
-
-export async function runDecisionsSync(
-  args: RunDecisionsSyncArgs,
-): Promise<RunDecisionsSyncResult> {
-  const { runFolderFileId, driveClient } = args;
-
-  const gdocFile = await driveClient.findFile({
-    parentFolderId: runFolderFileId,
-    name: "decisions.gdoc",
-  });
-  if (!gdocFile) {
-    throw new Error(
-      `decisions.gdoc not found in run folder ${runFolderFileId}. Run /ace:step decisions-render first to produce the gdoc.`,
-    );
-  }
-
-  const doc = await driveClient.getDoc(gdocFile.id);
-  const parsedRows = parseDocumentStructure(doc as Parameters<typeof parseDocumentStructure>[0]);
-
-  let yamlContent: string;
-  try {
-    const file = await driveClient.readFile({
-      parentFolderId: runFolderFileId,
-      name: "decisions.yaml",
-    });
-    yamlContent = file.content;
-  } catch (err) {
-    const msg = err instanceof Error ? err.message : String(err);
-    throw new Error(
-      `decisions.yaml not found in run folder ${runFolderFileId}: ${msg}`,
-    );
-  }
-
-  const currentLog = parseDecisionsYaml(yamlContent);
-  const { merged, report } = mergeDecisions(parsedRows, currentLog);
-
-  // Bump generated_at on every sync write.
-  merged.generated_at = new Date().toISOString();
-
-  const newYaml = serializeDecisionsLog(merged);
-  await driveClient.writeFile({
-    parentFolderId: runFolderFileId,
-    name: "decisions.yaml",
-    content: newYaml,
-  });
-
-  return { gdocId: gdocFile.id, report };
-}
-
-if (import.meta.url === `file://${process.argv[1]}`) {
-  console.error("Direct CLI mode not yet wired — invoke via /ace:step decisions-sync <opp>/<run-id> instead.");
-  process.exit(2);
-}
-```
-
-- [ ] **Step 4: Run the test.**
-
-Run: `npx vitest run test/skills/decisions-sync/script.test.ts`
-Expected: PASS, 3/3 green.
-
-- [ ] **Step 5: Commit.**
-
-```bash
-git add scripts/decisions-sync.ts test/skills/decisions-sync/script.test.ts
-git commit -m "scripts: add decisions-sync runner
-
-Reads decisions.gdoc + decisions.yaml from a run folder, parses the
-gdoc, merges human edits into the YAML, writes the updated YAML back.
-DriveClient interface (findFile / getDoc / readFile / writeFile)
-decouples from MCP for testability.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 4: Skill body — `skills/decisions-sync/SKILL.md`
-
-**Files:**
-- Create: `skills/decisions-sync/SKILL.md`
-
-- [ ] **Step 1: Write the SKILL.md.**
-
-Create `skills/decisions-sync/SKILL.md`:
-
-```markdown
----
-name: decisions-sync
-description: >
-  Sync human edits from the per-run decisions.gdoc back into
-  decisions.yaml. Human-triggered via /ace:step decisions-sync; not
-  part of the orchestrator's automatic phase loop.
-disable-model-invocation: true
----
-
-# Decisions Sync
-
-Read the human-edited `decisions.gdoc` for a run, diff against
-`decisions.yaml`, and write overrides back to the YAML so subsequent
-runs honor the human's edits.
-
-## Inputs
-
-| Source | Artifact | Used for |
-|---|---|---|
-| Per-run state | `ACE/<opp-name>/runs/<run-id>/decisions.gdoc` | human-edited prose Doc — source of overrides |
-| Per-run state | `ACE/<opp-name>/runs/<run-id>/decisions.yaml` | structured log to update |
-
-## Outputs
-
-- `ACE/<opp-name>/runs/<run-id>/decisions.yaml` — updated in place. Rows where the human changed `Default:` get `status: overridden` and the prior default is preserved in `options_considered`. New `Considered:` bullets are appended.
-
-## Process
-
-1. **Resolve the run folder file ID** for `ACE/<opp-name>/runs/<run-id>/`.
-2. **Run the sync script:**
-
-   ```bash
-   npx tsx scripts/decisions-sync.ts <run-folder-fileId>
-   ```
-
-   The script:
-   - Finds `decisions.gdoc` in the run folder; halts with actionable error if missing.
-   - Reads the doc structure via `docs_get`.
-   - Parses via `parseDocumentStructure` from `lib/decisions-parser.ts`.
-   - Reads `decisions.yaml`; halts if missing.
-   - Merges via `mergeDecisions` from `lib/decisions-sync.ts`.
-   - Writes the merged YAML back via `drive_update_file`.
-   - Returns a change report (defaults overridden, options added, unmatched rows).
-
-3. **Surface the change report** to the operator. Format:
-
-   ```
-   Decisions sync — turmeric/20260507-1733
-     Defaults overridden: 2
-       - flw-count: 5–8 → 12
-       - ai-photo-threshold: ≥90% → ≥95%
-     Options added: 1
-       - archetype-selection: novel-archetype
-     Parsed rows not in YAML: 0
-     YAML rows not in gdoc: 0
-   ```
-
-4. **Suggest the next step**: re-run `/ace:step idea-to-pdd <opp>/<run-id>` (or `/ace:run <opp>`) so subsequent phases consume the overridden values.
-
-## Failure modes
-
-- **decisions.gdoc missing**: halts with "Run /ace:step decisions-render first to produce the gdoc."
-- **decisions.yaml missing**: halts with the path that wasn't found.
-- **Schema-invalid YAML after merge**: should not happen — the merger preserves all required fields. If it does, the merger has a bug; the operator should file an issue with the change report.
-- **Heading mismatch (gdoc has rows YAML doesn't or vice versa)**: warned in the report; sync proceeds with the matched rows.
-
-## Trigger model
-
-This skill is **human-triggered**. The orchestrator's Phase Write-Back Verifier does NOT auto-invoke it — that would silently overwrite AI defaults from any stale gdoc edits. Always run explicitly:
-
-```
-/ace:step decisions-sync <opp>/<run-id>
-```
-
-after editing the gdoc, before re-running the affected phases.
-
-## MCP Tools Used
-
-- Google Drive: `drive_list_folder`, `drive_read_file`, `drive_update_file`, `docs_get`
-
-## Mode Behavior
-
-- **Auto:** Run, surface the report, return.
-- **Review:** Same as Auto — sync is itself the review-and-apply step; no further pause needed.
-
-## Change Log
-
-| Date | Change | Author |
-|------|--------|--------|
-| 2026-05-08 | Initial skill — pairs with `lib/decisions-parser.ts`, `lib/decisions-sync.ts`, `scripts/decisions-sync.ts`. Round-trips human edits from `decisions.gdoc` into `decisions.yaml`. Human-triggered. | ACE team (decisions-log PR #3) |
-```
-
-- [ ] **Step 2: Sanity-check.**
-
-Run: `head -1 skills/decisions-sync/SKILL.md`
-Expected: `---`
-
-Run: `grep -c "^## " skills/decisions-sync/SKILL.md`
-Expected: 8 (Inputs, Outputs, Process, Failure modes, Trigger model, MCP Tools Used, Mode Behavior, Change Log).
-
-- [ ] **Step 3: Commit.**
-
-```bash
-git add skills/decisions-sync/SKILL.md
-git commit -m "skill(decisions-sync): wrap parser + merger + script as an ACE skill
-
-Human-triggered via /ace:step decisions-sync. Reads the gdoc, diffs
-against the YAML, writes overrides back. Surfaces a change report
-(defaults overridden, options added, unmatched rows) to the operator.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-```
-
----
-
-### Task 5: Run full test suite
-
-- [ ] **Step 1: Run `npm test`.**
-
-Run: `npm test`
-Expected: PASS — full vitest suite green. New tests: parser (~7), merger (~9), sync runner (~3) = 19 added assertions.
-
-- [ ] **Step 2: If anything fails, fix it.**
-
-The most likely failure mode is the round-trip test in the parser if the renderer's exact request shape differs from what `simulateDocument` reconstructs. If that happens, debug by logging `requests` from the renderer call and walking through the simulation step by step.
-
----
-
-### Task 6: Version bump and PR
-
-- [ ] **Step 1: Bump version.**
-
-Run: `bash scripts/version-bump.sh`
-Expected: VERSION updated to next patch.
-
-- [ ] **Step 2: Sync lockfile.**
-
-Run: `npm install --package-lock-only`
-
-- [ ] **Step 3: Commit + push + PR.**
-
-```bash
-git add VERSION package.json package-lock.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-git commit -m "chore: bump version for decisions-log PR #3 (round-trip sync)
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-
-git push -u origin emdash/questions-70lfu
-
-gh pr create --title "decisions-log PR #3: round-trip sync" --body "$(cat <<'EOF'
-## Summary
-
-Third PR in the decisions-log series. Closes the human iteration loop:
-human edits the prose gdoc, runs \`/ace:step decisions-sync\`, the YAML
-picks up the changes with \`status: overridden\`, the next \`/ace:run\`
-honors the overrides.
-
-## What ships
-
-- **\`lib/decisions-parser.ts\`** — pure-function parser: walks a Google Docs documents.get response, extracts \`{id, default, options_considered}\` per HEADING_3-anchored decision section. 7 unit tests + 1 round-trip integration test (renderer + parser are inverses).
-- **\`lib/decisions-sync.ts\`** — pure-function merger: \`mergeDecisions(parsed, current)\` returns merged DecisionsLog + change report. 9 unit tests covering override + options-added + non-deletion + unmatched-row reporting.
-- **\`scripts/decisions-sync.ts\`** — runner: glues parser + merger + Drive I/O. \`DecisionsSyncDriveClient\` interface for testability. 3 integration tests against a fake client.
-- **\`skills/decisions-sync/SKILL.md\`** — thin skill body. Human-triggered via \`/ace:step decisions-sync <opp>/<run-id>\`.
-
-## Trigger model
-
-Human-triggered, NOT orchestrator-automated. Auto-pickup risks silently overwriting AI defaults if stale gdoc edits exist. Explicit invocation makes the override deliberate and auditable.
-
-## What does NOT ship
-
-- Phase 2-9 writes (one PR per phase) — PRs #4-#11.
-- Eval rubric re-anchor — separate follow-up.
-
-## Test plan
-
-- [ ] CI green
-- [x] \`npm test\` passes locally — full suite green with 19 new assertions
-- [ ] Manual verification: edit a default in a turmeric run's decisions.gdoc, run \`/ace:step decisions-sync\`, confirm decisions.yaml updated with \`status: overridden\` and the prior default preserved in options_considered
-
-🤖 Generated with [Claude Code](https://claude.com/claude-code)
-EOF
-)"
-```
-
----
-
-## Self-review pass
-
-**Spec coverage** — every concern in the coverage map maps to a task.
-
-**Placeholder scan** — no `TBD`, `TODO`, `Add appropriate error handling`, or `Similar to Task N`. Every code step has the literal code; every command has the exact invocation; every file path is exact.
-
-**Type consistency** — `parseDocumentStructure`, `ParsedDecisionRow`, `mergeDecisions`, `ChangeReport`, `runDecisionsSync`, `DecisionsSyncDriveClient` — all referenced consistently across plan, code blocks, and test assertions. The parser output type is consumed by the merger; the merger output is consumed by the runner.
-
-**Spec → plan alignment** — matches `docs/superpowers/specs/2026-05-08-decisions-log-design.md` § Round-trip — v1: explicit sync skill. The recommended-additional rows from PR #2 (illustrative, non-binding) are unaffected — sync only acts on rows present in BOTH the gdoc and the YAML.
-
----
-
-## Execution handoff
-
-Plan saved to `docs/superpowers/plans/2026-05-08-decisions-log-pr3.md`. Subagent-driven execution.
diff --git a/docs/superpowers/plans/2026-05-08-decisions-log-pr4.md b/docs/superpowers/plans/2026-05-08-decisions-log-pr4.md
deleted file mode 100644
index 345fcc02..00000000
--- a/docs/superpowers/plans/2026-05-08-decisions-log-pr4.md
+++ /dev/null
@@ -1,294 +0,0 @@
-# Decisions Log — PR #4: Phase 2-9 Writes — Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development to implement this plan task-by-task.
-
-**Goal:** Extend decisions-log writing across all 9 phases. Each phase's primary writing skill gains a `## Decisions Log` section that (a) references `skills/idea-to-pdd/SKILL.md § Decisions Log Convention` as the authoritative bar criterion, (b) declares 1-4 anchor decisions specific to the phase, (c) acknowledges additional rows beyond anchors come from LLM judgment per the bar.
-
-**Architecture:** Pure doc surface — no new code, no new tests. The orchestrator's Phase Write-Back Verifier already enforces "every phase MUST append rows to decisions.yaml" (PR #1). The renderer auto-picks up new rows on every phase invocation (PR #2). The sync skill rounds-trips human edits (PR #3). PR #4's job is to give each phase's writing skill phase-specific guidance about which anchors are load-bearing.
-
-**Tech stack:** Markdown editing only.
-
-**Spec deviation:** None — this is the spec's PR #2 scope (Phase 2-9 writes), reordered behind PRs #2 and #3 because human visibility and iteration matter before broader data collection.
-
----
-
-## Phase → primary writing skill → anchor decisions
-
-| Phase | Skill | Anchor rows |
-|---|---|---|
-| 2 commcare | `pdd-to-deliver-app` | `deliver-unit-count`, `one-form-per-module-workaround`, `multimedia-coverage-strategy` |
-| 3 connect | `connect-opp-setup` | `verification-flags`, `payment-unit-shape`, `opportunity-end-date` |
-| 4 ocs | `ocs-agent-setup` | `system-prompt-baseline`, `rag-collection-scope`, `test-prompt-count` |
-| 5 qa-and-training | `app-test-cases` | `test-scenario-count`, `test-archetype-coverage` |
-| 6 synthetic | `synthetic-narrative-plan` | `persona-count`, `scenario-count`, `narrative-arc-shape` |
-| 7 solicitation | `solicitation-create` | `solicitation-type`, `response-deadline`, `response-template-choice` |
-| 8 execution | `llo-launch` | `llo-capacity-actual`, `day-one-readiness`, `downstream-handoff-alignment`, `stop-loss-planning` |
-| 9 closeout | `opp-closeout` | `closeout-depth`, `learnings-summary-scope` |
-
-The Phase 8 anchors come directly from `llo-launch-eval`'s viability axis (PR #145). Other phases' anchors are derived from common load-bearing decisions; each phase's eval rubric has not declared explicit viability dimensions yet, so the bar criterion is the operative filter.
-
----
-
-## Tasks
-
-Each task adds a uniform `## Decisions Log` section to the named skill. The section structure is identical across all 8 skills; only the anchor list and phase tag vary.
-
-### Task structure (applies to Tasks 1-8)
-
-For each skill in the table above, append a new top-level section before the existing `## Change Log` table. The section template:
-
-```markdown
-## Decisions Log
-
-This skill writes load-bearing defaults to the per-run
-`ACE/<opp-name>/runs/<run-id>/decisions.yaml`. The bar criterion and
-schema live in `skills/idea-to-pdd/SKILL.md § Decisions Log Convention`
-(canonical authority); anchors below are the phase-specific subset
-load-bearing for downstream eval rubrics.
-
-### Anchor decisions
-
-| ID | Question | Map to surface |
-|---|---|---|
-<phase-specific rows>
-
-### Beyond anchors
-
-Append additional rows whenever the skill applies a load-bearing default
-meeting the bar criterion (load-bearing + maps to known surface). The
-orchestrator's Phase Write-Back Verifier (`agents/ace-orchestrator.md`
-§ Phase Write-Back Contract § Decisions log clause) enforces the
-contract; the renderer (`skills/decisions-render`) regenerates the gdoc
-at end of every phase.
-
-Each row this skill writes uses `phase: <N>-<phase-name>` and
-`skill: <this-skill-name>`.
-```
-
-Then add a row to the `## Change Log` table:
-
-```markdown
-| 2026-05-08 | Add `## Decisions Log` section: phase-specific anchor rows + bar-criterion reference. Pairs with decisions-log PR #4 (Phase 2-9 writes). | ACE team (decisions-log PR #4) |
-```
-
----
-
-### Task 1: Phase 2 — `skills/pdd-to-deliver-app/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `deliver-unit-count` | How many distinct deliver units (modules × forms) does the Deliver app expose? | PDD `Deliver App Specification` numeric |
-| `one-form-per-module-workaround` | Are we one-form-per-module to dodge Nova's CCZ marker bug? | `pdd-to-deliver-app-eval` connect-marker-coverage dimension; CLAUDE.md gotcha |
-| `multimedia-coverage-strategy` | What multimedia (text vs voice prompts vs both) does the Deliver app surface? | `app-multimedia-coverage` skill output; PDD multimedia note |
-```
-
-Phase tag: `2-commcare`. Skill name: `pdd-to-deliver-app`.
-
-- [ ] Add the section.
-- [ ] Add change-log row.
-- [ ] Commit:
-  ```bash
-  git add skills/pdd-to-deliver-app/SKILL.md
-  git commit -m "skill(pdd-to-deliver-app): wire Phase 2 anchors to decisions log
-
-3 anchor rows: deliver-unit-count, one-form-per-module-workaround,
-multimedia-coverage-strategy. References the bar criterion in
-idea-to-pdd's Decisions Log Convention as the authoritative filter.
-
-Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-  ```
-
----
-
-### Task 2: Phase 3 — `skills/connect-opp-setup/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `verification-flags` | Which verification flags (gps, photo, location toggle, duration thresholds) does the opportunity require? | `connect-opp-setup-eval`; PDD `Verification Mechanism` |
-| `payment-unit-shape` | Per-visit fixed amount, tiered, milestone-gated, etc.? | `connect-opp-setup` payment-unit creation; PDD `Payment Rate` |
-| `opportunity-end-date` | When does the opportunity close? | PDD `Timeline` numeric; gates Phase 8 monitoring cadence |
-```
-
-Phase tag: `3-connect`. Skill name: `connect-opp-setup`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 3: Phase 4 — `skills/ocs-agent-setup/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `system-prompt-baseline` | What baseline system prompt does the per-opp chatbot inherit (golden template default vs. customized for archetype)? | `ocs-chatbot-eval` rubric coverage |
-| `rag-collection-scope` | What documents land in the per-opp RAG collection (golden defaults vs. opp-specific additions)? | `ocs-chatbot-eval` retrieval-quality dimension |
-| `test-prompt-count` | How many test prompts feed the smoke-eval gate (default 5 quick, 90 deep)? | `pdd-to-test-prompts` output cardinality; deep vs shallow QA split |
-```
-
-Phase tag: `4-ocs`. Skill name: `ocs-agent-setup`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 4: Phase 5 — `skills/app-test-cases/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `test-scenario-count` | How many app-walkthrough scenarios feed the qa+eval pair? | `pdd-to-app-journeys-eval` coverage_completeness dimension |
-| `test-archetype-coverage` | Are all archetypes in the PDD covered by at least one scenario? | `pdd-to-app-journeys-eval` archetype_alignment dimension |
-```
-
-Phase tag: `5-qa-and-training`. Skill name: `app-test-cases`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 5: Phase 6 — `skills/synthetic-narrative-plan/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `persona-count` | How many personas does the synthetic data narrative cover? | `synthetic-narrative-plan-eval` persona-coverage dimension |
-| `scenario-count` | How many distinct scenarios per persona? | `synthetic-narrative-plan-eval` scenario-density |
-| `narrative-arc-shape` | Linear, branching, or stage-gated story arc? | `synthetic-narrative-plan-eval` narrative-coherence; archetype alignment |
-```
-
-Phase tag: `6-synthetic-data-and-workflows`. Skill name: `synthetic-narrative-plan`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 6: Phase 7 — `skills/solicitation-create/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `solicitation-type` | EOI vs RFP vs custom? | `solicitation-create-eval`; affects who applies and at what fidelity |
-| `response-deadline` | Days from publish to deadline (default 14)? | `solicitation-create` schema; gates Phase 7→8 timing |
-| `response-template-choice` | Stock template vs opp-custom response form? | `solicitation-create` content; downstream `solicitation-review` rubric input |
-```
-
-Phase tag: `7-solicitation-management`. Skill name: `solicitation-create`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 7: Phase 8 — `skills/llo-launch/SKILL.md`
-
-Anchor rows (these come directly from `llo-launch-eval`'s viability axis added in PR #145):
-
-```markdown
-| `llo-capacity-actual` | Did the LLO actually recruit the team they promised? | `llo-launch-eval` `llo_capacity_actual` dimension (PR #145) |
-| `day-one-readiness` | Are FLWs actually ready Day 1 (training complete, devices provisioned, accounts activated)? | `llo-launch-eval` `day_one_readiness` dimension (PR #145) |
-| `downstream-handoff-alignment` | Is the named downstream consumer ready to receive data on the agreed cadence? | `llo-launch-eval` `downstream_handoff_alignment` dimension (PR #145) |
-| `stop-loss-planning` | Is there a documented halt condition (data-quality floor, recruitment failure, etc.)? | `llo-launch-eval` `stop_loss_planning` dimension (PR #145) |
-```
-
-Phase tag: `8-execution-management`. Skill name: `llo-launch`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 8: Phase 9 — `skills/opp-closeout/SKILL.md`
-
-Anchor rows:
-
-```markdown
-| `closeout-depth` | Standard summary vs. deep retrospective with cycle-grade re-anchor? | `cycle-grade-eval` rubric input |
-| `learnings-summary-scope` | Per-opp only, or cross-opp pattern aggregation? | `learnings-summary` skill output; ACE-wide pattern catalogue |
-```
-
-Phase tag: `9-closeout`. Skill name: `opp-closeout`.
-
-- [ ] Add section + change-log row.
-- [ ] Commit.
-
----
-
-### Task 9: Run full test suite
-
-- [ ] Run `npm test`. Expected: all pre-existing tests pass; no new tests added in this PR.
-
-If anything fails, the SKILL.md edits broke a markdown-grep-shaped test (e.g., a test counts `## ` sections per skill). Fix by reverting the offending change or adjusting the test to allow the new section.
-
----
-
-### Task 10: Version bump + push + PR
-
-- [ ] Run `bash scripts/version-bump.sh` and `npm install --package-lock-only`.
-- [ ] Commit:
-  ```bash
-  git add VERSION package.json package-lock.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-  git commit -m "chore: bump version for decisions-log PR #4 (Phase 2-9 writes)
-
-  Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>"
-  ```
-- [ ] Push: `git push -u origin emdash/questions-70lfu`.
-- [ ] Open the PR:
-  ```bash
-  gh pr create --title "decisions-log PR #4: Phase 2-9 anchor decisions" --body "$(cat <<'EOF'
-  ## Summary
-
-  Fourth PR in the decisions-log series ([spec](docs/superpowers/specs/2026-05-08-decisions-log-design.md), [plan](docs/superpowers/plans/2026-05-08-decisions-log-pr4.md)). Extends decisions-log writing from Phase 1 (PR #1-#3) to all 9 phases.
-
-  ## What ships
-
-  Eight skill bodies updated with a uniform `## Decisions Log` section listing per-phase anchor decisions tied to specific eval rubric dimensions. Doc-only — no new code, no new tests. The orchestrator contract, the renderer, and the sync skill are already wired (PRs #1-#3) and pick up rows from any phase automatically.
-
-  | Phase | Skill | Anchors |
-  |---|---|---|
-  | 2 commcare | \`pdd-to-deliver-app\` | deliver-unit-count, one-form-per-module-workaround, multimedia-coverage-strategy |
-  | 3 connect | \`connect-opp-setup\` | verification-flags, payment-unit-shape, opportunity-end-date |
-  | 4 ocs | \`ocs-agent-setup\` | system-prompt-baseline, rag-collection-scope, test-prompt-count |
-  | 5 qa-and-training | \`app-test-cases\` | test-scenario-count, test-archetype-coverage |
-  | 6 synthetic | \`synthetic-narrative-plan\` | persona-count, scenario-count, narrative-arc-shape |
-  | 7 solicitation | \`solicitation-create\` | solicitation-type, response-deadline, response-template-choice |
-  | 8 execution | \`llo-launch\` | llo-capacity-actual, day-one-readiness, downstream-handoff-alignment, stop-loss-planning |
-  | 9 closeout | \`opp-closeout\` | closeout-depth, learnings-summary-scope |
-
-  Phase 8 anchors come directly from \`llo-launch-eval\`'s viability axis (PR #145); other phases' anchors derive from common load-bearing decisions per the bar criterion.
-
-  ## What does NOT ship
-
-  - Eval rubric re-anchor (\`idea-to-pdd-eval\`'s \`deferred-decision-discipline\` branch grading on \`decisions.yaml\` directly) — separate follow-up.
-
-  ## Test plan
-
-  - [ ] CI green
-  - [x] \`npm test\` passes locally
-  - [ ] Manual verification: re-run any phase, confirm new rows appear in decisions.yaml + decisions.gdoc with the phase tag set correctly
-
-  🤖 Generated with [Claude Code](https://claude.com/claude-code)
-  EOF
-  )"
-  ```
-
----
-
-## Self-review pass
-
-**Spec coverage** — all 8 phases mapped to a primary writing skill with anchor rows.
-
-**Placeholder scan** — anchor row `Map to surface` cells reference real eval rubric dimensions where they exist (Phase 8 viability axis from PR #145; Phase 1 archetype/numbers/etc. dimensions). For phases without explicit viability dimensions in their eval rubrics yet, the surface is the corresponding rubric file or PDD section — concrete enough.
-
-**Type consistency** — every anchor row's phase tag matches the phase name in the table; every skill name matches the actual skill directory.
-
----
-
-## Execution
-
-Subagent-driven not strictly needed (mechanical doc edits, ~5-10 lines per skill). Inline execution is cleaner for this PR.
diff --git a/docs/superpowers/plans/2026-05-08-decisions-log-pr5.md b/docs/superpowers/plans/2026-05-08-decisions-log-pr5.md
deleted file mode 100644
index 0f20e7c4..00000000
--- a/docs/superpowers/plans/2026-05-08-decisions-log-pr5.md
+++ /dev/null
@@ -1,133 +0,0 @@
-# Decisions Log — PR #5: Retire "Anchor" Concept — Implementation Plan
-
-> **For agentic workers:** Inline execution recommended (uniform mechanical edits across 9 skills + 1 test + 1 spec). No new code, no new tests.
-
-**Goal:** Retire the "anchor" framing from PRs #2-#4. Replace with `## Common Load-Bearing Decisions` (illustrative + meaningfully improveable, not normative). Bar criterion is the sole filter; these lists are templates that guide LLM judgment, refined over time.
-
-**Why:** "MUST emit anchor decisions if applicable" reintroduced the hardcoded-required-rows pattern PR #2 was meant to retire — just with fewer rows. Two distinct concepts got conflated:
-1. **Real eval inputs** — Phase 1 (5 rows in `idea-to-pdd-eval`) and Phase 8 (4 rows in `llo-launch-eval`). The eval rubric grades on these dimensions; missing them = ungraded input.
-2. **Speculative templates** — Phase 2-7 + Phase 9 anchors are guesses about what *would* be load-bearing if a viability axis existed for those rubrics. Today nothing actually grades on them.
-
-Treating both kinds as "MUST emit if applicable" overstates the second. The cleaner framing: bar criterion filters; each phase has a list of common load-bearing decisions that informs the LLM but doesn't require any specific row.
-
-**Architecture:** Pure doc renames + one test assertion drop. No new code.
-
----
-
-## File-by-file change
-
-| File | Change |
-|---|---|
-| `skills/idea-to-pdd/SKILL.md` | `### Anchor decisions` → `### Common load-bearing decisions`; drop "MUST emit" prose; merge with `### Recommended additional rows` (was already non-binding) |
-| `skills/pdd-to-deliver-app/SKILL.md` | Same rename + soften wording |
-| `skills/connect-opp-setup/SKILL.md` | Same |
-| `skills/ocs-agent-setup/SKILL.md` | Same |
-| `skills/app-test-cases/SKILL.md` | Same |
-| `skills/synthetic-narrative-plan/SKILL.md` | Same |
-| `skills/solicitation-create/SKILL.md` | Same |
-| `skills/llo-launch/SKILL.md` | Same. The 4 rows still map 1:1 to `llo-launch-eval` viability dimensions; keep that mapping prominent so the eval grade has structured input |
-| `skills/opp-closeout/SKILL.md` | Same |
-| `test/skills/idea-to-pdd/decisions-fixture.test.ts` | Drop the "contains every anchor row" assertion (replace with: schema valid + invariants); the 5 Phase 1 rows that happen to be in the fixture are illustrative, not required |
-| `docs/superpowers/specs/2026-05-08-decisions-log-design.md` | Wording cleanup: replace "anchor" with "common load-bearing decisions" where used as normative |
-
----
-
-## Uniform replacement template (applies to all 9 SKILL.md files)
-
-The current section structure (added in PRs #2 and #4):
-
-```markdown
-## Decisions Log
-
-This skill writes load-bearing defaults to the per-run
-`ACE/<opp-name>/runs/<run-id>/decisions.yaml`. The bar criterion and
-schema live in `skills/idea-to-pdd/SKILL.md § Decisions Log Convention`
-(canonical authority); anchors below are the phase-specific subset
-load-bearing for downstream eval rubrics.
-
-### Anchor decisions
-
-| ID | Question | Map to surface |
-|---|---|---|
-<rows>
-
-### Beyond anchors
-
-Append additional rows whenever the skill applies a load-bearing default
-meeting the bar criterion ...
-```
-
-Replace with:
-
-```markdown
-## Decisions Log
-
-This skill writes load-bearing defaults to the per-run
-`ACE/<opp-name>/runs/<run-id>/decisions.yaml`. The bar criterion and
-schema live in `skills/idea-to-pdd/SKILL.md § Decisions Log Convention`
-(canonical authority). The list below catalogs decisions that commonly
-qualify under the bar for this phase — it's a working template, not a
-required set. The skill applies the bar criterion and emits whatever
-rows meet it; the catalog is a teaching device that improves over time
-as we learn from runs.
-
-### Common load-bearing decisions for this phase
-
-| ID | Question | Map to surface |
-|---|---|---|
-<same rows as before>
-
-The orchestrator's Phase Write-Back Verifier
-(`agents/ace-orchestrator.md` § Phase Write-Back Contract § Decisions
-log clause) enforces the contract; the renderer
-(`skills/decisions-render`) regenerates the gdoc at end of every phase.
-
-Each row this skill writes uses `phase: <N>-<phase-name>` and
-`skill: <skill-name>`.
-```
-
-For Phase 1 (`skills/idea-to-pdd/SKILL.md`), the existing `### Anchor decisions` and `### Recommended additional rows` tables collapse into a single `### Common load-bearing decisions for Phase 1` table containing all 14 rows.
-
-For Phase 8 (`skills/llo-launch/SKILL.md`), keep an explicit note that the 4 rows map 1:1 to `llo-launch-eval`'s viability dimensions (the eval grade depends on them being present).
-
-For Phase 1 (`skills/idea-to-pdd/SKILL.md`), the same applies: the 5 viability-axis rows map 1:1 to `idea-to-pdd-eval`'s viability dimensions.
-
----
-
-## Tasks
-
-Inline execution; one commit per file group for easy revert.
-
-### Task 1: Phase 1 — `skills/idea-to-pdd/SKILL.md`
-
-Merge the `### Anchor decisions` and `### Recommended additional rows` sub-sections into a single `### Common load-bearing decisions for Phase 1`. The combined table has all 14 rows. Keep the eval-rubric mapping in the "Map to surface" column — for the 5 rows tied to viability dimensions, note `(eval anchor)` so it's visible without being normative.
-
-### Task 2: Phase 2-9 — eight SKILL.md files
-
-Apply the uniform replacement template to each. The "Map to surface" column stays as-is (already references real eval rubric dimensions where they exist).
-
-### Task 3: Test fixture assertion
-
-Replace the "contains every anchor row from the Phase 1 anchor list" assertion in `test/skills/idea-to-pdd/decisions-fixture.test.ts` with a softer "the fixture covers the 5 viability-axis decisions Phase 1's eval rubric grades on" — phrased as a fixture quality check, not a schema invariant.
-
-### Task 4: Spec wording cleanup
-
-Replace "anchor" with "common load-bearing decisions" where used as normative in `docs/superpowers/specs/2026-05-08-decisions-log-design.md`.
-
-### Task 5: Run full test suite + tsc
-
-Confirm no regressions.
-
-### Task 6: Version bump + push + PR
-
----
-
-## Self-review
-
-**Spec coverage** — every place "anchor" is used as a normative term gets the rename + soften treatment.
-
-**No code changes** — purely doc + one test assertion edit.
-
-**Backward compatibility** — fixtures with the 5 Phase 1 viability rows still pass tests; the wording shift is invisible to the merger / parser / renderer.
-
----
diff --git a/docs/superpowers/plans/2026-05-10-orchestrator-structural-split.md b/docs/superpowers/plans/2026-05-10-orchestrator-structural-split.md
deleted file mode 100644
index b31409d3..00000000
--- a/docs/superpowers/plans/2026-05-10-orchestrator-structural-split.md
+++ /dev/null
@@ -1,818 +0,0 @@
-# Orchestrator Structural Split (PR 0c) — Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
->
-> **Worktree required.** This is a 1700-line doc refactor — work in an isolated worktree via `superpowers:using-git-worktrees`.
-
-**Goal:** Split `agents/ace-orchestrator.md` (currently 1728 lines) into a procedure doc (top-of-file = top-of-execution-flow) and a sibling reference doc, hoist scattered anti-patterns into one consolidated section near the top of procedure, and convert phase blocks to a uniform template — without losing or rewording any normative content.
-
-**Architecture:** Pure content move + reorder. Two output files: `agents/ace-orchestrator.md` (procedure, ~600 lines) and a NEW `agents/orchestrator-reference.md` (catalogs, schemas, contracts, ~700 lines). All cross-references between the two use H2 anchors (markdown auto-anchors), not line numbers. The procedure doc reads top-to-bottom in execution order.
-
-**Tech Stack:** Markdown editing only. No code changes. Verification is `git diff` review + grep checks for content preservation.
-
-**Line-number conventions in this plan.** Line numbers cited (e.g. "currently line 988") are the start-state values from the inventory in Task 1. After ANY task that moves content, line numbers shift. **Use H2 heading text as the canonical anchor when finding sections to move** — e.g. `grep -n "^## Producer Artifact Verifier" agents/ace-orchestrator.md`. The line numbers are hints, not addresses.
-
----
-
-## Why this PR exists
-
-Three perf PRs already shipped from this lens (0a/0b/0d). They worked despite the orchestrator's size because they were additive and locally-scoped. PR 0c attacks the cross-cutting structural problems the perf-lens analysis surfaced:
-
-1. **Reading order ≠ execution order.** Top of file is `Agent Topology → State Schema → Cruft Management`. Workflow doesn't start until line 872. Pre-flight content is at line 1416. The model reads top-to-bottom; by the time it gets to "what to do" it's been thinking about state schemas for 800 lines.
-2. **Reference and procedure interleaved.** `Phase Write-Back Contract` (line 1031, 100+ lines of spec) sits between the workflow that uses it and the verifier that enforces it. Splitting reference off lets the procedure doc read like a procedure.
-3. **Anti-patterns scattered.** "Don't summarize and continue" (line 749), "don't fan out env probes" (line 723), "don't fake background tasks" (line 394), "don't dispatch two Agents in one message" (line 715). Compliance with negative rules is much higher when they're in one scannable list near the top.
-4. **Phase blocks aren't uniformly shaped.** Lines 876–970 — each phase has a different shape. A uniform `Inputs → Atoms → Outputs → Write-back → Gate` template makes the workflow pattern-matchable instead of re-read each time.
-
-## Background — what the worker needs to know
-
-**Repository conventions** (`CLAUDE.md`):
-- This is the ACE plugin repo. `agents/` holds procedure docs + subagent definitions. The orchestrator is the canonical procedure doc — `/ace:run` reads it inline at level 0.
-- `main` is branch-protected. Ship via PR + `clean-install` CI check. See § Git worktrees and merging to main in `CLAUDE.md`.
-- Version bump via `bash scripts/version-bump.sh` (worktree-safe). Pre-commit hook syncs the four version files. After merge, `/ace:update` in the original session.
-
-**Three perf PRs already merged ahead of this one** (don't repeat their content):
-- **0a (#202)** — Pre-flight Checklist subsection inside `## Performance Conventions` (lines 548–648). Don't move it; it stays in procedure.
-- **0b (#203)** — `bin/ace-doctor --preflight` mode. Touches `bin/ace-doctor`. Orchestrator references it from Pre-flight Checklist Step 1.
-- **0d (#204)** — `### Phase Boundary Fence — when, in one message` subsection inside `## Phase Write-Back Verifier` (lines 1143–1193). The fence is procedure (must STAY in procedure doc); the procedure body of the verifier (lines 1194–1245) is reference (moves to reference doc).
-
-**Run log for context:** `.claude/pm/runs/2026-05-10-orchestrator-perf-lens.md` — captures the lens that produced these PRs.
-
----
-
-## File structure
-
-**Modify:** `agents/ace-orchestrator.md` — current 1728 lines → ~600 lines after extraction. New top-down execution order.
-
-**Create:** `agents/orchestrator-reference.md` — ~700 lines. Pure reference content. Linked from procedure doc.
-
-**Modify:** `CLAUDE.md` — section "Layout" mentions `ace-orchestrator.md`. Likely needs a sibling mention of the new reference doc; check for any line-number references that broke.
-
-**Modify:** `VERSION`, `package.json`, `.claude-plugin/plugin.json`, `.claude-plugin/marketplace.json` — via `scripts/version-bump.sh` at end.
-
----
-
-## Section classification (the contract for the move)
-
-Anchors are H2 headings as they exist today. **Worker: don't fight this classification — if you disagree on a borderline section, ask before moving.**
-
-### Stays in procedure doc (`agents/ace-orchestrator.md`)
-
-| Today's heading | Today's line | New position in procedure (target order) |
-|---|---|---|
-| `## You are ACE` | 32 | 1 (intro) |
-| **NEW** `## Anti-patterns and discipline` | (hoist) | 2 (right after intro) |
-| `## Performance Conventions` (with `### Pre-flight Checklist` and `### Per-phase conventions`) | 543 | 3 (rename to `## Pre-flight & per-phase conventions`) |
-| `## Populated opps are the norm` (incl. `### Why default mode looks like this`) | 266 | 4 (rename to `## Modes — default, review, auto`) |
-| `## Resuming after a halt` | 748 | 5 |
-| `## Starting a New Opportunity` (incl. `### Resolution`, `### Fallback`) | 1416 | 6 |
-| `## Workflow` (Phases 1–9) | 872 | 7 (Task 10 will reshape phase blocks) |
-| `## Between Phases` | 971 | 8 |
-| `### Phase Boundary Fence — when, in one message` (extract from current Phase Write-Back Verifier section) | 1143 | 9 (promote to its own H2: `## Phase boundary fence`) |
-| `## Per-Step Eval Hook` | 1301 | 10 |
-| `## Umbrella Eval` | 1363 | 11 |
-| `## Error Handling` | 1391 | 12 |
-| `## Dry-Run Mode` | 1399 | 13 |
-| `## Sandbox Mode` | 1409 | 14 |
-| `## Post-Run: ace-web Transcript Upload` | 1246 | 15 |
-| At end: forward-link to `agents/orchestrator-reference.md` | (new) | 16 |
-
-### Moves to reference doc (`agents/orchestrator-reference.md`)
-
-| Today's heading | Today's line | New position in reference |
-|---|---|---|
-| `## Agent Topology` | 24 | 1 |
-| `## Your State` | 39 | 2 |
-| `## State Schema` | 53 | 3 |
-| `## Scope boundaries — what goes in run_state.yaml` | 155 | 4 |
-| `## Cruft management — archive: block convention` | 197 | 5 |
-| `## Per-Phase Folder Lifecycle` (incl. `### Current/ shortcut refresh`) | 793 | 6 |
-| `## Producer Artifact Verifier` | 988 | 7 |
-| `## Phase Write-Back Contract` | 1031 | 8 |
-| `## Phase Write-Back Verifier` body (the procedure starting at `### Procedure` line 1194) | 1194 | 9 (under heading `## Phase Write-Back Verifier — procedure`) |
-| `## Pause Points` | 1264 | 10 |
-| `## Touching State — Operator Capture` (incl. `### State-as-canary contract`) | 1665 | 11 |
-
-### Folds into the new `## Anti-patterns and discipline` (procedure doc, position 2)
-
-These sections each contribute a tight rule block; the worker consolidates and removes the original sections from procedure. Worker may also push the *full content* into reference (e.g. as `## Discipline rules — full text`) if any nuance is too rich for the consolidated block — judgment call, but err on the side of fully consolidating.
-
-| Source section | Today's line | What to extract |
-|---|---|---|
-| `## Long-Running Skills — No Fake Background Tasks` (incl. `### When background IS appropriate`, `### When polling IS appropriate`) | 394 | The rules ("no fake background tasks", "background IS appropriate when X", "polling IS appropriate when Y"). Keep the rationale concise — this is anti-pattern territory. |
-| `## Skill Invocation Discipline` | 457 | The discipline rules. |
-| `## External Mutations — Verify After Create` | 489 | The 4-step write→read→compare→halt-loud rule. The canonical-example reference (`skills/connect-opp-setup/SKILL.md`) stays. |
-| **From `## Resuming after a halt`** | 748 | The `**Anti-pattern — do NOT "summarize and continue."**` paragraph specifically. |
-| **From `## Performance Conventions` per-phase block** | 648 | The `**Resolve .env in one shot, not by probing.**`, `**Batch independent operations.**`, `**Agent(...) dispatches DO NOT parallelize.**` rules. |
-
-The consolidated `## Anti-patterns and discipline` should read as a *scannable list*, not a wall of prose. Aim for ~80–120 lines. Cite full content in reference doc if too dense.
-
----
-
-## Tasks
-
-### Task 1: Set up the worktree and read the current state
-
-**Files:**
-- Read: `agents/ace-orchestrator.md` (1728 lines)
-- Read: `CLAUDE.md` (for cross-references)
-- Read: `.claude/pm/runs/2026-05-10-orchestrator-perf-lens.md` (lens context)
-
-- [ ] **Step 1: Confirm worktree is set up**
-
-Run: `git rev-parse --git-dir`
-Expected output: contains `/worktrees/`. If not, create one via `superpowers:using-git-worktrees`.
-
-- [ ] **Step 2: Confirm working tree is clean**
-
-Run: `git status --short`
-Expected output: empty (no modified or staged files).
-
-- [ ] **Step 3: Read the orchestrator end-to-end**
-
-Use `Read` on `agents/ace-orchestrator.md` (full file). Do not skim. As you read, mentally tag each H2 section as PROCEDURE or REFERENCE per the classification table above. If you disagree with the classification on any section, STOP and ask the user before continuing.
-
-- [ ] **Step 4: Take the content checksum baseline**
-
-Run:
-```bash
-awk '!/^[[:space:]]*$/ && !/^#/ {print}' agents/ace-orchestrator.md | sort -u > /tmp/orch-content-before.txt
-wc -l /tmp/orch-content-before.txt
-```
-Save this number — you'll re-run the same command at the end and verify the union of `agents/ace-orchestrator.md` + `agents/orchestrator-reference.md` reproduces the same set of unique non-blank, non-heading lines.
-
-- [ ] **Step 5: Commit no changes; tag the start point**
-
-Run:
-```bash
-git log -1 --oneline
-```
-Note the SHA. You'll diff against this at the end.
-
----
-
-### Task 2: Create the empty reference doc skeleton
-
-**Files:**
-- Create: `agents/orchestrator-reference.md`
-
-- [ ] **Step 1: Write the reference doc with H2 headings only (empty bodies)**
-
-Write `agents/orchestrator-reference.md` with this exact content:
-
-```markdown
-# ACE Orchestrator — Reference
-
-This doc is the *reference* counterpart to `agents/ace-orchestrator.md`. It catalogs schemas, contracts, lifecycle invariants, and architectural diagrams that the orchestrator's procedure references. The procedure doc tells you WHAT to do; this doc tells you the SHAPE of what you're doing.
-
-If you're executing `/ace:run`, read `agents/ace-orchestrator.md` first. Come here only when the procedure points you at a specific section.
-
----
-
-## Agent Topology
-
-(populated in Task 3)
-
-## Your State
-
-(populated in Task 3)
-
-## State Schema
-
-(populated in Task 3)
-
-## Scope boundaries — what goes in `run_state.yaml`
-
-(populated in Task 4)
-
-## Cruft management — `archive:` block convention
-
-(populated in Task 4)
-
-## Per-Phase Folder Lifecycle
-
-(populated in Task 4)
-
-## Producer Artifact Verifier
-
-(populated in Task 5)
-
-## Phase Write-Back Contract
-
-(populated in Task 5)
-
-## Phase Write-Back Verifier — procedure
-
-(populated in Task 5)
-
-## Pause Points
-
-(populated in Task 6)
-
-## Touching State — Operator Capture
-
-(populated in Task 7)
-```
-
-- [ ] **Step 2: Verify file landed**
-
-Run:
-```bash
-ls -la agents/orchestrator-reference.md
-grep -c "^## " agents/orchestrator-reference.md
-```
-Expected: file exists; 11 H2 headings.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add agents/orchestrator-reference.md
-git commit -m "orchestrator-reference: skeleton (PR 0c step 1/12)"
-```
-
----
-
-### Task 3: Move architectural sections to reference (Agent Topology, Your State, State Schema)
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (cut)
-- Modify: `agents/orchestrator-reference.md` (paste)
-
-- [ ] **Step 1: Cut `## Agent Topology` from orchestrator, paste into reference**
-
-In `agents/ace-orchestrator.md`, locate the section starting with `## Agent Topology` (currently line 24). It runs until the next H2. Cut the entire block including the `## Agent Topology` heading line.
-
-In `agents/orchestrator-reference.md`, replace the line `## Agent Topology\n\n(populated in Task 3)\n` with the cut content.
-
-- [ ] **Step 2: Repeat for `## Your State`**
-
-Cut from orchestrator (currently ~line 39), paste into reference replacing the placeholder under `## Your State`.
-
-- [ ] **Step 3: Repeat for `## State Schema`**
-
-Cut from orchestrator (currently ~line 53). This is the largest of the three (~100 lines of YAML schema). Paste into reference replacing the placeholder under `## State Schema`.
-
-- [ ] **Step 4: Verify the moves**
-
-Run:
-```bash
-git diff agents/ace-orchestrator.md | grep -c '^-'
-git diff agents/orchestrator-reference.md | grep -c '^+'
-```
-The difference between adds and removes should be the placeholder lines deleted (`(populated in Task 3)` x3) — i.e. about 3 fewer adds than removes (plus minor whitespace adjustments). If the gap is wider than ~5, something was lost. Investigate before continuing.
-
-Run:
-```bash
-grep -n "^## Agent Topology\|^## Your State\|^## State Schema" agents/ace-orchestrator.md
-```
-Expected: empty output (all three moved out).
-
-```bash
-grep -n "^## Agent Topology\|^## Your State\|^## State Schema" agents/orchestrator-reference.md
-```
-Expected: three matches, in that order.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/ace-orchestrator.md agents/orchestrator-reference.md
-git commit -m "orchestrator: move topology + state inventory + state schema → reference (PR 0c step 2/12)"
-```
-
----
-
-### Task 4: Move scope/cruft/lifecycle to reference
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (cut)
-- Modify: `agents/orchestrator-reference.md` (paste)
-
-- [ ] **Step 1: Cut `## Scope boundaries — what goes in run_state.yaml`**
-
-Cut the section (heading + body until next H2). Paste under the corresponding placeholder in reference.
-
-- [ ] **Step 2: Cut `## Cruft management — archive: block convention`**
-
-Same pattern. Move to reference.
-
-- [ ] **Step 3: Cut `## Per-Phase Folder Lifecycle` (including the `### Current/ shortcut refresh` subsection)**
-
-This section runs from current line 793 to the next H2. Move the entire block including any H3 subsections.
-
-- [ ] **Step 4: Verify**
-
-Run:
-```bash
-grep -n "^## Scope boundaries\|^## Cruft management\|^## Per-Phase Folder Lifecycle" agents/ace-orchestrator.md
-```
-Expected: empty.
-
-```bash
-grep -n "^## Scope boundaries\|^## Cruft management\|^## Per-Phase Folder Lifecycle\|^### Current/ shortcut refresh" agents/orchestrator-reference.md
-```
-Expected: 4 matches.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/ace-orchestrator.md agents/orchestrator-reference.md
-git commit -m "orchestrator: move scope + cruft + folder lifecycle → reference (PR 0c step 3/12)"
-```
-
----
-
-### Task 5: Move write-back contract + verifier procedure to reference (KEEP fence in procedure)
-
-**This is the most delicate task in the plan.** PR 0d added `### Phase Boundary Fence` *inside* `## Phase Write-Back Verifier` (currently lines 1143–1193). The fence is procedure (timing rule + worked examples) and MUST stay in the procedure doc. The verifier's actual procedure (currently lines 1194–1245, starting with `### Procedure`) moves to reference.
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (cut Producer Artifact Verifier; cut Phase Write-Back Contract; KEEP Phase Boundary Fence; cut verifier procedure body)
-- Modify: `agents/orchestrator-reference.md` (paste)
-
-- [ ] **Step 1: Cut `## Producer Artifact Verifier`**
-
-Move the full section (currently ~line 988) to reference under the placeholder.
-
-- [ ] **Step 2: Cut `## Phase Write-Back Contract`**
-
-Move the full section (currently ~line 1031, ~100 lines) to reference under the placeholder.
-
-- [ ] **Step 3: Carefully split `## Phase Write-Back Verifier`**
-
-The current section is structured:
-```
-## Phase Write-Back Verifier
-<intro paragraph>
-
-### Phase Boundary Fence — when, in one message
-<PR 0d content — STAYS in procedure>
-
-### Procedure
-<the verifier's actual checklist procedure — MOVES to reference>
-```
-
-Action: in `agents/ace-orchestrator.md`, replace the H2 `## Phase Write-Back Verifier` line with `## Phase boundary fence` (promoting the H3 fence to its own H2). Delete the original intro paragraph and the `### Phase Boundary Fence — when, in one message` H3 line (the fence content stays under the new H2). Delete the `### Procedure` H3 and its body (move to reference).
-
-In `agents/orchestrator-reference.md`, replace the placeholder under `## Phase Write-Back Verifier — procedure` with: the original intro paragraph + the `### Procedure` body. (Drop the `### Procedure` H3 since the H2 already names it.)
-
-- [ ] **Step 4: Verify**
-
-Run:
-```bash
-grep -n "^## Producer Artifact Verifier\|^## Phase Write-Back Contract\|^## Phase Write-Back Verifier" agents/ace-orchestrator.md
-```
-Expected: empty.
-
-```bash
-grep -n "^## Phase boundary fence" agents/ace-orchestrator.md
-```
-Expected: one match.
-
-```bash
-grep -n "^## Producer Artifact Verifier\|^## Phase Write-Back Contract\|^## Phase Write-Back Verifier — procedure" agents/orchestrator-reference.md
-```
-Expected: 3 matches.
-
-```bash
-grep -n "Anti-pattern.*Boundary observed in real transcripts" agents/ace-orchestrator.md
-```
-Expected: one match (the worked-example anti-pattern from PR 0d, must still be in procedure).
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/ace-orchestrator.md agents/orchestrator-reference.md
-git commit -m "orchestrator: move verifiers + write-back contract → reference; keep boundary fence in procedure (PR 0c step 4/12)"
-```
-
----
-
-### Task 6: Move Pause Points to reference
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (cut)
-- Modify: `agents/orchestrator-reference.md` (paste)
-
-- [ ] **Step 1: Cut `## Pause Points`**
-
-The section is a catalog (table of pause points + per-mode behavior). Move to reference under placeholder.
-
-- [ ] **Step 2: Verify**
-
-```bash
-grep -n "^## Pause Points" agents/ace-orchestrator.md   # expected: empty
-grep -n "^## Pause Points" agents/orchestrator-reference.md   # expected: 1 match
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add agents/ace-orchestrator.md agents/orchestrator-reference.md
-git commit -m "orchestrator: move pause-points catalog → reference (PR 0c step 5/12)"
-```
-
----
-
-### Task 7: Move Touching State — Operator Capture to reference
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (cut)
-- Modify: `agents/orchestrator-reference.md` (paste)
-
-- [ ] **Step 1: Cut `## Touching State — Operator Capture` (including `### State-as-canary contract`)**
-
-Move the full section to reference.
-
-- [ ] **Step 2: Verify**
-
-```bash
-grep -n "^## Touching State\|^### State-as-canary contract" agents/ace-orchestrator.md   # expected: empty
-grep -n "^## Touching State\|^### State-as-canary contract" agents/orchestrator-reference.md   # expected: 2 matches
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add agents/ace-orchestrator.md agents/orchestrator-reference.md
-git commit -m "orchestrator: move operator-capture state contract → reference (PR 0c step 6/12)"
-```
-
----
-
-### Task 8: Hoist anti-patterns into one consolidated `## Anti-patterns and discipline` section
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (consolidate, then delete sources)
-- Modify: `agents/orchestrator-reference.md` (optional — paste full text if too dense for procedure)
-
-This is the most judgment-heavy task. Read the source sections in full first, then decide structure.
-
-- [ ] **Step 1: Read all the anti-pattern source sections**
-
-Use `Read` on `agents/ace-orchestrator.md` and locate:
-- `## Long-Running Skills — No Fake Background Tasks` (incl. `### When background IS appropriate`, `### When polling IS appropriate`) — currently around line 394.
-- `## Skill Invocation Discipline` — currently around line 457.
-- `## External Mutations — Verify After Create` — currently around line 489.
-- The `**Resolve .env in one shot**`, `**Batch independent operations**`, `**Agent(...) dispatches DO NOT parallelize**` rule blocks inside `### Per-phase conventions` (currently around line 663).
-- The `**Anti-pattern — do NOT "summarize and continue."**` paragraph inside `## Resuming after a halt` (currently around line 749).
-
-- [ ] **Step 2: Draft the consolidated section**
-
-Write a new `## Anti-patterns and discipline` section right after `## You are ACE` in `agents/ace-orchestrator.md`. Structure it as a scannable rules list:
-
-```markdown
-## Anti-patterns and discipline
-
-These are the rules the orchestrator MUST follow during `/ace:run`. Each rule is a one-line directive. Where the rule has a worked failure mode (an incident or a transcript pattern), follow with a short **Why** paragraph. Detailed rationale and historical context lives in `agents/orchestrator-reference.md` § Discipline — full text.
-
-### Tool dispatch
-- **Don't fake background tasks.** No prose like "I'll check on this in 5 minutes." If a skill is long-running, run it in the foreground and let it complete. (Background dispatch IS appropriate for: <bullet>; polling IS appropriate when: <bullet>.)
-- **Don't dispatch two `Agent` calls in one message.** Claude Code does not reliably parallelize `Agent` dispatches. Treat all `Agent` and `/nova:autobuild`-style dispatches as serial.
-- **Do batch independent tool calls.** N independent `drive_read_file`, `connect_create_payment_unit`, etc. in a single assistant message. Sequential single-tool messages waste harness parallelism.
-- **Don't fan out env probes.** One Bash to resolve `.env` (use `bin/ace-doctor --preflight`). Not 3–4 separate `ls`/`test -f` probes.
-
-### State writes
-- **Verify after every external create.** Write → read → compare → halt loud on mismatch. (Canonical example: `skills/connect-opp-setup/SKILL.md` Steps 4 + 6.) Mismatch on a load-bearing field is a `[BLOCKER]`.
-- **Don't read-modify-write `run_state.yaml` by hand.** Use `update_yaml_file` with `merge: 'two-level'`. The CAS retry inside `update_yaml_file` is the race-correctness mechanism.
-
-### Procedure discipline
-- **Don't "summarize and continue" to dodge context exhaustion.** Trust the 1M-context window. If the harness genuinely signals exhaustion, write back and resume via `/ace:run <opp>/<run-id>` in a fresh session.
-- **Don't skip producer skills to shortcut to consumers.** § Producer Artifact Verifier (in reference) catches this at the next phase boundary, but the discipline lives here.
-- **Don't add operator-confirmation prompts on populated opps.** The "do you want to overwrite live state?" gate is off-spec — push reuse-vs-rebuild decisions down into phase agent skill logic instead.
-```
-
-(Worker: rephrase as needed for readability; the goal is "scannable rule list," not exhaustive rationale.)
-
-- [ ] **Step 3: Delete the now-consolidated source sections from procedure**
-
-Cut from `agents/ace-orchestrator.md`:
-- `## Long-Running Skills — No Fake Background Tasks` and its H3 subsections.
-- `## Skill Invocation Discipline`.
-- `## External Mutations — Verify After Create`.
-- The three rule blocks inside `### Per-phase conventions` (Resolve .env, Batch independent, Agent... DO NOT parallelize).
-- The `**Anti-pattern — do NOT "summarize and continue."**` paragraph from `## Resuming after a halt`.
-
-- [ ] **Step 4: Optionally paste full source text into reference**
-
-If the consolidated section had to drop important rationale (e.g. specific failure-mode incidents), append a `## Discipline — full text` H2 at the end of `agents/orchestrator-reference.md` containing the original prose.
-
-- [ ] **Step 5: Verify**
-
-```bash
-grep -n "^## Long-Running Skills\|^## Skill Invocation Discipline\|^## External Mutations" agents/ace-orchestrator.md
-```
-Expected: empty.
-
-```bash
-grep -n "^## Anti-patterns and discipline" agents/ace-orchestrator.md
-```
-Expected: one match.
-
-```bash
-grep -c "summarize and continue" agents/ace-orchestrator.md
-```
-Expected: at most one match (in the consolidated section, not in `## Resuming after a halt`).
-
-- [ ] **Step 6: Commit**
-
-```bash
-git add agents/ace-orchestrator.md agents/orchestrator-reference.md
-git commit -m "orchestrator: hoist scattered anti-patterns into one section near top (PR 0c step 7/12)"
-```
-
----
-
-### Task 9: Reorder remaining procedure sections into execution order
-
-After Tasks 3–8, `agents/ace-orchestrator.md` should contain (in some order) only the sections from the procedure column of the classification table, plus the new `## Anti-patterns and discipline`. This task reorders them.
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md`
-
-Target order (top to bottom):
-
-1. `## You are ACE`
-2. `## Anti-patterns and discipline` (NEW — Task 8)
-3. `## Pre-flight & per-phase conventions` (renamed from `## Performance Conventions`)
-4. `## Modes — default, review, auto` (renamed from `## Populated opps are the norm — do NOT pause to ask "are you sure?"`; absorbs `### Why default mode looks like this`)
-5. `## Resuming after a halt`
-6. `## Starting a New Opportunity`
-7. `## Workflow`
-8. `## Between Phases`
-9. `## Phase boundary fence` (renamed from PR 0d's H3, promoted to H2 in Task 5)
-10. `## Per-Step Eval Hook`
-11. `## Umbrella Eval`
-12. `## Error Handling`
-13. `## Dry-Run Mode`
-14. `## Sandbox Mode`
-15. `## Post-Run: ace-web Transcript Upload (optional)`
-16. `## See also: orchestrator-reference.md` (NEW — added in Task 11)
-
-- [ ] **Step 1: Move sections one at a time into the target order**
-
-Use Edit's "cut from here, paste there" pattern. After each move, run:
-```bash
-grep -n "^## " agents/ace-orchestrator.md
-```
-And verify the section list is heading toward the target order.
-
-- [ ] **Step 2: Rename `## Performance Conventions` → `## Pre-flight & per-phase conventions`**
-
-```bash
-sed -i.bak 's/^## Performance Conventions$/## Pre-flight \& per-phase conventions/' agents/ace-orchestrator.md
-rm agents/ace-orchestrator.md.bak
-```
-
-- [ ] **Step 3: Rename `## Populated opps are the norm — do NOT pause to ask "are you sure?"` → `## Modes — default, review, auto`**
-
-The full original heading line is verbose; replace with a concise heading. Use Edit (sed will struggle with the special characters in the heading).
-
-- [ ] **Step 4: Verify final section order**
-
-Run:
-```bash
-grep -n "^## " agents/ace-orchestrator.md
-```
-Expected output (in this order): the 16-item list above.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/ace-orchestrator.md
-git commit -m "orchestrator: reorder procedure sections into execution order (PR 0c step 8/12)"
-```
-
----
-
-### Task 10: Convert phase blocks to a uniform template
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (rewrite each phase block under `## Workflow`)
-
-The current `## Workflow` has 9 phase blocks (currently lines 876–970), each shaped differently. Convert to a uniform template.
-
-**Template (apply to each `### Phase N: <name>`):**
-
-```markdown
-### Phase N: <Name>
-
-**Dispatch:** `Agent(<phase-agent-name>)` (or `inline procedure-doc agents/<name>.md` for Phase 2).
-
-**Inputs (inline at handoff):** PDD, prior-phase verdicts (`<prior-phase>/<producer>-{qa_result,eval_verdict}.yaml`), `run_state.yaml`. See § Pre-flight & per-phase conventions → "Pass artifacts inline at phase handoff" for the template.
-
-**Atoms / skills used (orchestrator-visible only):** <list, e.g. `Agent(design-review)` or `Agent(connect-setup)` — the orchestrator only sees the top-level Agent dispatch except for Phase 2 which dispatches Nova at L0>.
-
-**Outputs:** <list of artifacts the phase produces in `runs/<run-id>/<N>-<phase>/`>.
-
-**Write-back:** `phases.<phase>.{status, started_at, completed_at, verdict, summary_artifact, steps}` per § Phase Write-Back Contract (in reference). The boundary fence (§ Phase boundary fence) governs WHEN.
-
-**Gate:** <`[BLOCKER]` halts; pause behavior per § Pause Points (in reference)>.
-
-**Notes:** <any phase-specific prose worth preserving — e.g. Phase 2's level-0 constraint, Phase 7's HITL boundary, Phase 9's trigger condition>.
-```
-
-- [ ] **Step 1: Read the current `## Workflow` section in full**
-
-You need every phase's existing prose in front of you before rewriting.
-
-- [ ] **Step 2: Rewrite each phase block one at a time, committing per-phase**
-
-For each phase 1–9:
-
-1. Replace the phase block with the uniform template, filling in fields from the existing prose. Preserve all phase-specific facts (Phase 2 level-0 constraint, Phase 5 internal-only contract, Phase 6 reversibility note, Phase 7 HITL boundary, Phase 8 entry gate, Phase 9 trigger).
-2. Run `grep -A 20 "^### Phase N:" agents/ace-orchestrator.md` to spot-check the rewritten block.
-3. Commit with message `orchestrator: phase N uniform template (PR 0c step 9.<N>/12)`.
-
-If a fact in the original prose doesn't fit any of the template's fields, add a "Notes" bullet — don't drop it.
-
-- [ ] **Step 3: After all 9 phases rewritten, verify nothing was lost**
-
-Run:
-```bash
-grep -c "^### Phase " agents/ace-orchestrator.md
-```
-Expected: 9.
-
-```bash
-git diff <start-SHA-from-Task-1>..HEAD -- agents/ace-orchestrator.md | grep -c '^-### Phase '
-```
-Expected: 9 (each old block deleted).
-
-```bash
-git diff <start-SHA-from-Task-1>..HEAD -- agents/ace-orchestrator.md | grep -c '^+### Phase '
-```
-Expected: 9 (each new block added).
-
----
-
-### Task 11: Add forward-link from procedure to reference + update CLAUDE.md
-
-**Files:**
-- Modify: `agents/ace-orchestrator.md` (add link section at end)
-- Modify: `CLAUDE.md` (mention the new reference doc in § Layout)
-
-- [ ] **Step 1: Add forward-link section to end of procedure doc**
-
-Append to `agents/ace-orchestrator.md`:
-
-```markdown
-## See also: orchestrator-reference.md
-
-Reference content for this orchestrator lives in `agents/orchestrator-reference.md`:
-
-- `## Agent Topology` — architectural diagram + level-0/subagent constraints
-- `## State Schema` + `## Your State` — `run_state.yaml` and `opp.yaml` shapes
-- `## Scope boundaries` + `## Cruft management` — what belongs in run_state.yaml; archive convention
-- `## Per-Phase Folder Lifecycle` — Drive folder shape per phase
-- `## Producer Artifact Verifier` — discipline rule pattern
-- `## Phase Write-Back Contract` — required write-back shape
-- `## Phase Write-Back Verifier — procedure` — auto-stub fallback
-- `## Pause Points` — full pause-point catalog with per-mode table
-- `## Touching State — Operator Capture` — operator-bypass write rules
-
-The procedure doc above is the canonical execution flow; the reference doc is normative for the shapes and rules cited above.
-```
-
-- [ ] **Step 2: Update `CLAUDE.md` § Layout**
-
-In `CLAUDE.md`, locate the line in the `## Layout` section that today reads:
-
-```
-- `agents/` — 11 agents. Two procedure docs (`ace-orchestrator`, `commcare-setup`); nine subagents.
-```
-
-Update to:
-
-```
-- `agents/` — 11 agents + 1 reference doc. Two procedure docs (`ace-orchestrator`, `commcare-setup`); nine subagents; `orchestrator-reference.md` is the reference companion to `ace-orchestrator.md` (state schemas, write-back contract, pause-points catalog).
-```
-
-- [ ] **Step 3: Update the `Phase Write-Back Contract` reference in CLAUDE.md**
-
-`CLAUDE.md` line ~109 today reads:
-```
-See `agents/ace-orchestrator.md § Phase Write-Back Contract`
-```
-
-The Phase Write-Back Contract moved to reference in Task 5. Update to:
-```
-See `agents/orchestrator-reference.md § Phase Write-Back Contract`
-```
-
-- [ ] **Step 4: Search for other line-number or section references to ace-orchestrator.md in any doc**
-
-Run:
-```bash
-grep -rn "ace-orchestrator.md" --include='*.md' . | grep -v "docs/superpowers/plans/"
-```
-
-For each hit, evaluate whether the referenced section moved to the reference doc (per the classification table). If so, update to point at `agents/orchestrator-reference.md` instead. Use H2 anchor (`§ <heading>`), never line numbers.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/ace-orchestrator.md CLAUDE.md
-git commit -m "orchestrator: forward-link to reference + CLAUDE.md cross-ref updates (PR 0c step 10/12)"
-```
-
----
-
-### Task 12: Verify content preservation + skim-read
-
-**Files:**
-- Read: `agents/ace-orchestrator.md`
-- Read: `agents/orchestrator-reference.md`
-
-- [ ] **Step 1: Content checksum**
-
-```bash
-awk '!/^[[:space:]]*$/ && !/^#/ {print}' agents/ace-orchestrator.md agents/orchestrator-reference.md | sort -u > /tmp/orch-content-after.txt
-wc -l /tmp/orch-content-after.txt
-diff /tmp/orch-content-before.txt /tmp/orch-content-after.txt | head -50
-```
-
-The diff should show:
-- A handful of *added* lines for the new `## Anti-patterns and discipline` consolidated section (since you rewrote rules into a tighter form).
-- A handful of *added* lines for the `## See also` section.
-- A handful of *added* lines for the new orchestrator-reference.md frontmatter.
-- A handful of *removed* lines for any prose that was *consolidated away* during the anti-patterns hoist.
-- Phase block deltas from Task 10's uniform-template rewrite.
-
-If the diff shows entire paragraphs of the original prose missing (not just rewording), STOP and investigate. Recover lost content from `git log -p` against the start SHA.
-
-- [ ] **Step 2: Skim-read both files top-to-bottom**
-
-Specifically check:
-- Procedure doc reads as a procedure top-to-bottom — pre-flight → run-start → workflow → boundary fence → per-step → post-run.
-- Reference doc reads as reference — schemas, contracts, catalogs, no execution-order prose.
-- No section appears in both files (except the rare case where the original was BOTH procedure and reference — Anti-patterns / Discipline being one of those).
-
-- [ ] **Step 3: Run all PR 0a/0b/0d-specific spot checks**
-
-```bash
-# PR 0a — pre-flight checklist still in procedure doc
-grep -c "^### Pre-flight Checklist" agents/ace-orchestrator.md   # expected: 1
-
-# PR 0b — bin/ace-doctor --preflight reference present
-grep -c "bin/ace-doctor --preflight" agents/ace-orchestrator.md  # expected: at least 1
-
-# PR 0d — phase boundary fence in procedure doc as H2
-grep -c "^## Phase boundary fence" agents/ace-orchestrator.md    # expected: 1
-grep -c "Boundary observed in real transcripts" agents/ace-orchestrator.md  # expected: 1
-```
-
-- [ ] **Step 4: Run a sanity check that orchestrator's H2 list matches the target order**
-
-```bash
-grep -n "^## " agents/ace-orchestrator.md
-```
-Expected (16 H2s, in execution order): see Task 9 list.
-
-```bash
-grep -n "^## " agents/orchestrator-reference.md
-```
-Expected (11 H2s).
-
-- [ ] **Step 5: Bump version + final commit + PR**
-
-```bash
-bash scripts/version-bump.sh
-git add VERSION package.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-git commit -m "VERSION bump for PR 0c"
-git push -u origin <branch-name>
-gh pr create --title "orchestrator: structural split — procedure vs reference, anti-patterns hoisted, uniform phase template (PR 0c)" --body "<see Self-Review checklist below for body>"
-```
-
-PR body should include:
-- Goal statement.
-- The classification table from the top of this plan.
-- A note that PR 0a/0b/0d shipped ahead and their content is preserved (pre-flight checklist, --preflight reference, phase boundary fence — all still in procedure doc).
-- Test plan: doc-only change, CI's `clean-install` is the gate; manual skim-read done.
-
-- [ ] **Step 6: Wait for CI, merge, run `/ace:update`, notify the original session.**
-
-Per `CLAUDE.md` § Plugin updates — NEVER locally patch.
-
----
-
-## Self-review checklist (run before opening the PR)
-
-- [ ] **No section appears in both files** unless intentionally duplicated (Anti-patterns + Discipline full text being the only allowed duplication).
-- [ ] **All H2 anchors used in cross-references resolve** — for every `§ <heading>` mention, grep both files for the heading.
-- [ ] **PR 0a/0b/0d content preserved**:
-  - Pre-flight Checklist (6 numbered steps + Stop Signs subsection) — in procedure.
-  - `bin/ace-doctor --preflight` referenced in Step 1 of Pre-flight Checklist.
-  - Phase Boundary Fence (with anti-pattern → right-pattern transcript example) — in procedure as own H2.
-- [ ] **No line-number references** in either doc to the OTHER doc. All cross-references use H2 anchors.
-- [ ] **VERSION bumped exactly once** at the end (not per-task).
-- [ ] **Each task's commit is atomic** — `git log --oneline` shows 12 PR-0c-step-N/12 commits + the version bump commit.
-
-## Backout
-
-If review surfaces problems and we need to back out:
-- Revert the merge commit on `main` (creates a revert PR).
-- The original `agents/ace-orchestrator.md` content is intact in git history at the start SHA from Task 1.
-- PR 0a/0b/0d are not affected — they were earlier merges.
-
-## Estimated time
-
-10–14 hours of focused work for an engineer with this plan in hand. Most of the time is in Task 8 (anti-pattern hoist — needs careful reading of source sections to consolidate without information loss) and Task 10 (phase template rewrites — 9 blocks × ~10 min each + verification).
diff --git a/docs/superpowers/plans/2026-05-15-ace-sweep-pr1-foundation-and-drive.md b/docs/superpowers/plans/2026-05-15-ace-sweep-pr1-foundation-and-drive.md
deleted file mode 100644
index 690220c1..00000000
--- a/docs/superpowers/plans/2026-05-15-ace-sweep-pr1-foundation-and-drive.md
+++ /dev/null
@@ -1,1148 +0,0 @@
-# `/ace:sweep` PR 1 — Foundation + Drive Sweep — Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
-
-**Goal:** Ship `/ace:sweep drive` end-to-end: walks Drive, builds a live-set of referenced identifiers from active opps, lists everything under `ACE/`, diffs to find orphans, scores them by ACE-fingerprint confidence, renders a markdown report, and trashes approved items via `drive_trash_file`. Lays the skill/procedure-doc/lib scaffolding that PRs 2–5 (Connect, OCS, HQ, labs) will extend.
-
-**Architecture:** Pure-function library (`lib/sweep-*.ts`) for the testable logic — live-set extraction from YAML strings, fingerprint scoring against a live set, markdown report rendering — following the existing `lib/doctor-drive-layout.ts` `DriveLike` pattern (pure logic, dispatcher injects authed client). Skill markdown files (`skills/sweep-*/SKILL.md`) describe agent procedures that call MCP Drive atoms and feed the lib functions. A procedure doc (`agents/sweep.md`) orchestrates the flow, dispatched by a slash command (`commands/sweep.md`). No new MCP atoms — Drive trash uses existing `drive_trash_file`.
-
-**Tech Stack:** TypeScript (ESM, no build step — `npx tsx`); vitest for unit tests; YAML via `yaml` package (already in deps); ACE plugin conventions per `CLAUDE.md` and `skills/README.md`.
-
----
-
-## File Structure
-
-**Create:**
-- `lib/sweep-types.ts` — shared types: `LiveSet`, `Orphan`, `Confidence`, `OrphanReport`
-- `lib/sweep-live-set.ts` — pure extractor: `extractIdentifiers(oppYaml, runStateYamls): LiveSet`
-- `lib/sweep-fingerprint.ts` — pure scorer: `scoreDriveFolder(folder, liveSet): Confidence`
-- `lib/sweep-report.ts` — pure renderer: `renderOrphanReport(orphans, system): string`
-- `test/lib/sweep-live-set.test.ts`
-- `test/lib/sweep-fingerprint.test.ts`
-- `test/lib/sweep-report.test.ts`
-- `skills/sweep-live-set/SKILL.md` — agent procedure: walks Drive, parses YAMLs, calls `extractIdentifiers`, writes `live-set.yaml`
-- `skills/sweep-drive/SKILL.md` — agent procedure: reads `live-set.yaml`, lists Drive `ACE/`, calls `scoreDriveFolder` + `renderOrphanReport`, prompts for approval, calls `drive_trash_file`
-- `agents/sweep.md` — procedure doc orchestrating live-set → per-system sweep
-- `commands/sweep.md` — `/ace:sweep [system]` slash command
-
-**Modify:** none. Skills and commands are auto-discovered from `skills/` and `commands/` by Claude Code's plugin loader; no `plugin.json` registration needed (verify via Task 9).
-
----
-
-## Type contracts (referenced across tasks)
-
-These types are defined in Task 1 and used unchanged in later tasks. If you need to deviate, update Task 1 first.
-
-```typescript
-// lib/sweep-types.ts
-export type Confidence = 'high' | 'medium' | 'low';
-
-export interface LiveSet {
-  /** Generated at this UTC ISO timestamp. */
-  generatedAt: string;
-  /** Opp slugs visible under ACE/ at generation time. */
-  oppSlugs: string[];
-  /** External identifiers referenced by any active opp's opp.yaml or run_state.yaml. */
-  identifiers: {
-    connectProgramIds: string[];
-    connectOpportunityIds: string[];
-    connectPaymentUnitIds: string[];
-    ocsChatbotIds: string[];
-    ocsCollectionIds: string[];
-    ocsSessionIds: string[];
-    commcareAppIds: string[];
-    labsWorkflowIds: string[];
-    labsPipelineIds: string[];
-    labsSyntheticIds: string[];
-    labsRecordIds: string[];  // solicitation/fund/review/response
-    driveFileIds: string[];   // explicit Drive references (rare)
-  };
-}
-
-export interface DriveFolderInfo {
-  id: string;
-  name: string;
-  /** ISO timestamp from Drive `createdTime`. */
-  createdTime: string;
-  /** Parent folder id; for ACE-root sweep this is `ACE_DRIVE_ROOT_FOLDER_ID`. */
-  parentId: string;
-}
-
-export interface Orphan {
-  /** Drive file/folder id. */
-  id: string;
-  /** Display name (folder name). */
-  name: string;
-  /** ISO timestamp. */
-  createdTime: string;
-  confidence: Confidence;
-  /** Human-readable signals that contributed to the score. */
-  signals: string[];
-}
-
-export interface OrphanReport {
-  system: 'drive' | 'connect' | 'ocs' | 'hq' | 'labs';
-  generatedAt: string;
-  liveSetGeneratedAt: string;
-  totals: { high: number; medium: number; low: number };
-  orphans: Orphan[];
-}
-```
-
----
-
-### Task 1: Shared types
-
-**Files:**
-- Create: `lib/sweep-types.ts`
-
-- [ ] **Step 1: Write the types file**
-
-Create `lib/sweep-types.ts` with the full content from the **Type contracts** section above (verbatim — copy from the spec block above). No runtime code, types only; nothing to test.
-
-- [ ] **Step 2: Verify TS compiles**
-
-Run: `npx tsc --noEmit lib/sweep-types.ts`
-Expected: no output (success). If you see "Cannot find module" errors, this file has no imports, so the error is environmental — check `tsconfig.json` is being picked up.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add lib/sweep-types.ts
-git commit -m "feat(sweep): add shared types for /ace:sweep"
-```
-
----
-
-### Task 2: live-set extractor (pure function)
-
-**Files:**
-- Create: `lib/sweep-live-set.ts`
-- Test: `test/lib/sweep-live-set.test.ts`
-
-The extractor takes one `opp.yaml` body string and an array of `run_state.yaml` body strings (one per run under that opp), parses them, and returns a `LiveSet` fragment for that opp. A separate helper (Task 5's skill) walks Drive and merges fragments across all opps.
-
-Per `CLAUDE.md` § Conventions: `opp.yaml` holds the durable `connect.program.{id, url, labs_int_id}`; `run_state.yaml` holds per-run `phases.<phase>.products.*`. Both are YAML.
-
-- [ ] **Step 1: Write the failing test**
-
-Create `test/lib/sweep-live-set.test.ts`:
-
-```typescript
-import { describe, expect, it } from 'vitest';
-import { extractOppFragment, mergeFragments } from '../../lib/sweep-live-set';
-import type { LiveSet } from '../../lib/sweep-types';
-
-const OPP_YAML = `
-display_name: Turmeric
-connect:
-  program:
-    id: prog-abc-123
-    url: https://connect.dimagi.com/programs/prog-abc-123
-    labs_int_id: 42
-`;
-
-const RUN_STATE_YAML = `
-opp: turmeric
-run_id: 20260502-1830
-phases:
-  connect-setup:
-    products:
-      opportunity:
-        id: opp-xyz-789
-      payment_units:
-        - id: pu-001
-        - id: pu-002
-  ocs-setup:
-    products:
-      chatbot:
-        id: chat-555
-        collection_id: coll-666
-  solicitation-management:
-    products:
-      solicitation:
-        id: labs-rec-1001
-        url: https://labs.connect.dimagi.com/solicitations/1001
-  synthetic-data-and-workflows:
-    products:
-      workflow_id: wf-200
-      pipeline_id: pl-300
-      synthetic_opp_id: syn-400
-  commcare-setup:
-    products:
-      learn_app:
-        hq_app_id: app-aaa
-      deliver_app:
-        hq_app_id: app-bbb
-`;
-
-describe('extractOppFragment', () => {
-  it('extracts Connect program id from opp.yaml', () => {
-    const frag = extractOppFragment('turmeric', OPP_YAML, []);
-    expect(frag.identifiers.connectProgramIds).toEqual(['prog-abc-123']);
-    expect(frag.oppSlugs).toEqual(['turmeric']);
-  });
-
-  it('extracts per-phase products from run_state.yaml', () => {
-    const frag = extractOppFragment('turmeric', OPP_YAML, [RUN_STATE_YAML]);
-    expect(frag.identifiers.connectOpportunityIds).toEqual(['opp-xyz-789']);
-    expect(frag.identifiers.connectPaymentUnitIds).toEqual(['pu-001', 'pu-002']);
-    expect(frag.identifiers.ocsChatbotIds).toEqual(['chat-555']);
-    expect(frag.identifiers.ocsCollectionIds).toEqual(['coll-666']);
-    expect(frag.identifiers.labsRecordIds).toEqual(['labs-rec-1001']);
-    expect(frag.identifiers.labsWorkflowIds).toEqual(['wf-200']);
-    expect(frag.identifiers.labsPipelineIds).toEqual(['pl-300']);
-    expect(frag.identifiers.labsSyntheticIds).toEqual(['syn-400']);
-    expect(frag.identifiers.commcareAppIds).toEqual(['app-aaa', 'app-bbb']);
-  });
-
-  it('tolerates missing phases', () => {
-    const frag = extractOppFragment('turmeric', OPP_YAML, ['opp: turmeric\nrun_id: x\nphases: {}\n']);
-    expect(frag.identifiers.connectOpportunityIds).toEqual([]);
-    expect(frag.identifiers.connectProgramIds).toEqual(['prog-abc-123']);
-  });
-
-  it('tolerates invalid YAML by treating it as empty', () => {
-    const frag = extractOppFragment('turmeric', 'this: is: not: yaml: [', []);
-    expect(frag.oppSlugs).toEqual(['turmeric']);
-    expect(frag.identifiers.connectProgramIds).toEqual([]);
-  });
-});
-
-describe('mergeFragments', () => {
-  it('merges identifiers, dedupes, sorts opp slugs', () => {
-    const a: LiveSet = {
-      generatedAt: '2026-05-15T00:00:00Z',
-      oppSlugs: ['turmeric'],
-      identifiers: {
-        connectProgramIds: ['p1'],
-        connectOpportunityIds: ['o1'],
-        connectPaymentUnitIds: [],
-        ocsChatbotIds: ['c1'],
-        ocsCollectionIds: [],
-        ocsSessionIds: [],
-        commcareAppIds: [],
-        labsWorkflowIds: [],
-        labsPipelineIds: [],
-        labsSyntheticIds: [],
-        labsRecordIds: [],
-        driveFileIds: [],
-      },
-    };
-    const b: LiveSet = {
-      generatedAt: '2026-05-15T00:00:00Z',
-      oppSlugs: ['arnica'],
-      identifiers: {
-        connectProgramIds: ['p2'],
-        connectOpportunityIds: ['o1'],  // dup
-        connectPaymentUnitIds: [],
-        ocsChatbotIds: ['c2'],
-        ocsCollectionIds: [],
-        ocsSessionIds: [],
-        commcareAppIds: [],
-        labsWorkflowIds: [],
-        labsPipelineIds: [],
-        labsSyntheticIds: [],
-        labsRecordIds: [],
-        driveFileIds: [],
-      },
-    };
-    const merged = mergeFragments([a, b], '2026-05-15T12:00:00Z');
-    expect(merged.oppSlugs).toEqual(['arnica', 'turmeric']);
-    expect(merged.identifiers.connectProgramIds.sort()).toEqual(['p1', 'p2']);
-    expect(merged.identifiers.connectOpportunityIds).toEqual(['o1']);
-    expect(merged.generatedAt).toBe('2026-05-15T12:00:00Z');
-  });
-});
-```
-
-- [ ] **Step 2: Run test to verify it fails**
-
-Run: `npx vitest run test/lib/sweep-live-set.test.ts`
-Expected: FAIL — "Cannot find module '../../lib/sweep-live-set'".
-
-- [ ] **Step 3: Implement `lib/sweep-live-set.ts`**
-
-```typescript
-/**
- * Pure live-set extraction. Parses one opp's opp.yaml + run_state.yaml bodies
- * and returns a LiveSet fragment scoped to that opp. The caller (the
- * sweep-live-set skill) walks Drive, fetches each opp's YAMLs, calls
- * extractOppFragment for each, then mergeFragments to produce the final
- * cross-opp LiveSet.
- *
- * No I/O. No Drive auth. Pure parsing + shape extraction so tests can
- * exercise the path-extraction logic without mocking Drive.
- */
-
-import { parse as parseYaml } from 'yaml';
-import type { LiveSet } from './sweep-types';
-
-function emptyIdentifiers(): LiveSet['identifiers'] {
-  return {
-    connectProgramIds: [],
-    connectOpportunityIds: [],
-    connectPaymentUnitIds: [],
-    ocsChatbotIds: [],
-    ocsCollectionIds: [],
-    ocsSessionIds: [],
-    commcareAppIds: [],
-    labsWorkflowIds: [],
-    labsPipelineIds: [],
-    labsSyntheticIds: [],
-    labsRecordIds: [],
-    driveFileIds: [],
-  };
-}
-
-function tryParse(yamlText: string): unknown {
-  try {
-    return parseYaml(yamlText) ?? {};
-  } catch {
-    return {};
-  }
-}
-
-function asString(v: unknown): string | null {
-  return typeof v === 'string' && v.length > 0 ? v : null;
-}
-
-function pushIfString(arr: string[], v: unknown): void {
-  const s = asString(v);
-  if (s) arr.push(s);
-}
-
-/**
- * Extract identifier fragment for one opp.
- *
- * `runStateYamls` is an array — one entry per `runs/<run-id>/run_state.yaml`
- * under this opp's folder. Pass [] if the opp has no runs yet.
- */
-export function extractOppFragment(
-  oppSlug: string,
-  oppYaml: string,
-  runStateYamls: string[],
-): LiveSet {
-  const ids = emptyIdentifiers();
-
-  // opp.yaml: durable Connect program reference
-  const opp = tryParse(oppYaml) as Record<string, unknown>;
-  const connect = (opp.connect ?? {}) as Record<string, unknown>;
-  const program = (connect.program ?? {}) as Record<string, unknown>;
-  pushIfString(ids.connectProgramIds, program.id);
-
-  // run_state.yaml: per-phase products
-  for (const text of runStateYamls) {
-    const run = tryParse(text) as Record<string, unknown>;
-    const phases = (run.phases ?? {}) as Record<string, unknown>;
-    for (const phaseBody of Object.values(phases)) {
-      const products = ((phaseBody as Record<string, unknown> | undefined)?.products
-        ?? {}) as Record<string, unknown>;
-
-      // Connect setup phase products
-      const opportunity = (products.opportunity ?? {}) as Record<string, unknown>;
-      pushIfString(ids.connectOpportunityIds, opportunity.id);
-      const paymentUnits = (products.payment_units ?? []) as unknown[];
-      if (Array.isArray(paymentUnits)) {
-        for (const pu of paymentUnits) {
-          pushIfString(ids.connectPaymentUnitIds, (pu as Record<string, unknown>)?.id);
-        }
-      }
-
-      // OCS phase products
-      const chatbot = (products.chatbot ?? {}) as Record<string, unknown>;
-      pushIfString(ids.ocsChatbotIds, chatbot.id);
-      pushIfString(ids.ocsCollectionIds, chatbot.collection_id);
-
-      // Solicitation / labs records
-      const solicitation = (products.solicitation ?? {}) as Record<string, unknown>;
-      pushIfString(ids.labsRecordIds, solicitation.id);
-
-      // Synthetic / workflow phase products (flat fields on products)
-      pushIfString(ids.labsWorkflowIds, products.workflow_id);
-      pushIfString(ids.labsPipelineIds, products.pipeline_id);
-      pushIfString(ids.labsSyntheticIds, products.synthetic_opp_id);
-
-      // CommCare apps
-      const learnApp = (products.learn_app ?? {}) as Record<string, unknown>;
-      const deliverApp = (products.deliver_app ?? {}) as Record<string, unknown>;
-      pushIfString(ids.commcareAppIds, learnApp.hq_app_id);
-      pushIfString(ids.commcareAppIds, deliverApp.hq_app_id);
-    }
-  }
-
-  return {
-    generatedAt: '',  // set by mergeFragments
-    oppSlugs: [oppSlug],
-    identifiers: ids,
-  };
-}
-
-function dedupeSort(xs: string[]): string[] {
-  return [...new Set(xs)].sort();
-}
-
-/** Merge fragments from many opps into one LiveSet, deduping and sorting. */
-export function mergeFragments(fragments: LiveSet[], generatedAt: string): LiveSet {
-  const out = emptyIdentifiers();
-  const slugs: string[] = [];
-  for (const frag of fragments) {
-    slugs.push(...frag.oppSlugs);
-    for (const k of Object.keys(out) as Array<keyof LiveSet['identifiers']>) {
-      out[k].push(...frag.identifiers[k]);
-    }
-  }
-  for (const k of Object.keys(out) as Array<keyof LiveSet['identifiers']>) {
-    out[k] = dedupeSort(out[k]);
-  }
-  return {
-    generatedAt,
-    oppSlugs: dedupeSort(slugs),
-    identifiers: out,
-  };
-}
-```
-
-- [ ] **Step 4: Run test to verify it passes**
-
-Run: `npx vitest run test/lib/sweep-live-set.test.ts`
-Expected: 5 tests pass.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add lib/sweep-live-set.ts test/lib/sweep-live-set.test.ts
-git commit -m "feat(sweep): pure live-set extractor with vitest coverage"
-```
-
----
-
-### Task 3: fingerprint scorer (pure function)
-
-**Files:**
-- Create: `lib/sweep-fingerprint.ts`
-- Test: `test/lib/sweep-fingerprint.test.ts`
-
-Drive-specific scoring rules (per the spec):
-- **high** — folder is directly under `ACE/`, name doesn't appear in `liveSet.oppSlugs`, AND name looks ACE-shaped (matches one of: starts with `CRISPR-`, contains a known archetype keyword, is lowercase-kebab and 3–40 chars).
-- **medium** — folder is directly under `ACE/`, name doesn't appear in `liveSet.oppSlugs`, but doesn't match an ACE-shaped pattern.
-- **low** — folder doesn't match either above (shouldn't normally be returned as an orphan; placeholder so callers always get a tier).
-
-The scorer doesn't filter — the caller does the live-set diff first; the scorer just classifies confidence on items already determined to be orphans.
-
-- [ ] **Step 1: Write the failing test**
-
-Create `test/lib/sweep-fingerprint.test.ts`:
-
-```typescript
-import { describe, expect, it } from 'vitest';
-import { scoreDriveFolder } from '../../lib/sweep-fingerprint';
-import type { LiveSet, DriveFolderInfo } from '../../lib/sweep-types';
-
-const LIVE_SET: LiveSet = {
-  generatedAt: '2026-05-15T12:00:00Z',
-  oppSlugs: ['turmeric', 'arnica'],
-  identifiers: {
-    connectProgramIds: [], connectOpportunityIds: [], connectPaymentUnitIds: [],
-    ocsChatbotIds: [], ocsCollectionIds: [], ocsSessionIds: [],
-    commcareAppIds: [], labsWorkflowIds: [], labsPipelineIds: [],
-    labsSyntheticIds: [], labsRecordIds: [], driveFileIds: [],
-  },
-};
-
-const folder = (overrides: Partial<DriveFolderInfo> = {}): DriveFolderInfo => ({
-  id: 'fld-x',
-  name: 'something',
-  createdTime: '2026-04-01T00:00:00Z',
-  parentId: 'ace-root',
-  ...overrides,
-});
-
-describe('scoreDriveFolder', () => {
-  it('returns high for ACE-shaped name (CRISPR-prefix)', () => {
-    const r = scoreDriveFolder(folder({ name: 'CRISPR-Test-001' }), LIVE_SET, 'ace-root');
-    expect(r.confidence).toBe('high');
-    expect(r.signals.some((s) => s.toLowerCase().includes('crispr'))).toBe(true);
-  });
-
-  it('returns high for kebab-case opp-style name', () => {
-    const r = scoreDriveFolder(folder({ name: 'paprika-pilot' }), LIVE_SET, 'ace-root');
-    expect(r.confidence).toBe('high');
-  });
-
-  it('returns medium for unrecognized name at ACE root', () => {
-    const r = scoreDriveFolder(folder({ name: 'README' }), LIVE_SET, 'ace-root');
-    expect(r.confidence).toBe('medium');
-  });
-
-  it('does not return high for an active opp slug', () => {
-    // Caller is responsible for filtering active opps out before scoring,
-    // but the scorer defensively avoids "high" if the name is in liveSet.oppSlugs.
-    const r = scoreDriveFolder(folder({ name: 'turmeric' }), LIVE_SET, 'ace-root');
-    expect(r.confidence).not.toBe('high');
-  });
-
-  it('returns low for folders not under ACE root', () => {
-    const r = scoreDriveFolder(
-      folder({ name: 'CRISPR-Test-001', parentId: 'some-other-folder' }),
-      LIVE_SET,
-      'ace-root',
-    );
-    expect(r.confidence).toBe('low');
-  });
-});
-```
-
-- [ ] **Step 2: Run test to verify it fails**
-
-Run: `npx vitest run test/lib/sweep-fingerprint.test.ts`
-Expected: FAIL — "Cannot find module".
-
-- [ ] **Step 3: Implement `lib/sweep-fingerprint.ts`**
-
-```typescript
-/**
- * Pure ACE-fingerprint scoring for Drive folders. Used by the sweep-drive
- * skill after the live-set diff has identified candidate orphans; this module
- * decides high / medium / low confidence so the human can triage in chunks.
- *
- * Per-system fingerprint helpers will be added in subsequent PRs (PR 2 for
- * Connect, PR 3 for OCS, PR 4 for HQ, PR 5 for labs). Each gets its own
- * exported function so the heuristics can be tuned independently.
- */
-
-import type { Confidence, DriveFolderInfo, LiveSet } from './sweep-types';
-
-const CRISPR_PREFIX = /^CRISPR-/i;
-const KEBAB_OPP_NAME = /^[a-z][a-z0-9-]{2,39}$/;
-
-export interface ScoreResult {
-  confidence: Confidence;
-  signals: string[];
-}
-
-/**
- * Score a Drive folder. Does NOT do the live-set diff itself — the caller is
- * expected to have already determined this folder is an orphan candidate
- * (i.e. its name does not match an active opp slug). The scorer defensively
- * downgrades to medium if it sees a name that IS in liveSet.oppSlugs in case
- * the caller passed it through.
- */
-export function scoreDriveFolder(
-  folder: DriveFolderInfo,
-  liveSet: LiveSet,
-  aceRootFolderId: string,
-): ScoreResult {
-  const signals: string[] = [];
-
-  if (folder.parentId !== aceRootFolderId) {
-    signals.push(`not under ACE root (parent=${folder.parentId})`);
-    return { confidence: 'low', signals };
-  }
-
-  if (liveSet.oppSlugs.includes(folder.name)) {
-    signals.push('name matches an active opp slug');
-    return { confidence: 'medium', signals };
-  }
-
-  if (CRISPR_PREFIX.test(folder.name)) {
-    signals.push('name has CRISPR- prefix (canonical test opp pattern)');
-    return { confidence: 'high', signals };
-  }
-
-  if (KEBAB_OPP_NAME.test(folder.name)) {
-    signals.push('name is kebab-case opp-style (3-40 chars, lowercase)');
-    return { confidence: 'high', signals };
-  }
-
-  signals.push('under ACE root but does not match a known ACE name pattern');
-  return { confidence: 'medium', signals };
-}
-```
-
-- [ ] **Step 4: Run test to verify it passes**
-
-Run: `npx vitest run test/lib/sweep-fingerprint.test.ts`
-Expected: 5 tests pass.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add lib/sweep-fingerprint.ts test/lib/sweep-fingerprint.test.ts
-git commit -m "feat(sweep): Drive-folder fingerprint scorer with confidence tiers"
-```
-
----
-
-### Task 4: report renderer (pure function)
-
-**Files:**
-- Create: `lib/sweep-report.ts`
-- Test: `test/lib/sweep-report.test.ts`
-
-Renders an `OrphanReport` to markdown. The same renderer is used by every per-system sweep (PRs 2–5), so the table shape needs to be system-agnostic; system-specific notes go in the `signals` field per orphan.
-
-- [ ] **Step 1: Write the failing test**
-
-Create `test/lib/sweep-report.test.ts`:
-
-```typescript
-import { describe, expect, it } from 'vitest';
-import { renderOrphanReport } from '../../lib/sweep-report';
-import type { OrphanReport } from '../../lib/sweep-types';
-
-const baseReport: OrphanReport = {
-  system: 'drive',
-  generatedAt: '2026-05-15T18:00:00Z',
-  liveSetGeneratedAt: '2026-05-15T17:58:00Z',
-  totals: { high: 0, medium: 0, low: 0 },
-  orphans: [],
-};
-
-describe('renderOrphanReport', () => {
-  it('renders header with system, timestamps, and totals', () => {
-    const md = renderOrphanReport({
-      ...baseReport,
-      totals: { high: 2, medium: 1, low: 0 },
-      orphans: [
-        { id: 'a', name: 'CRISPR-Test-001', createdTime: '2026-04-01T00:00:00Z',
-          confidence: 'high', signals: ['CRISPR- prefix'] },
-        { id: 'b', name: 'paprika-pilot', createdTime: '2026-04-02T00:00:00Z',
-          confidence: 'high', signals: ['kebab opp style'] },
-        { id: 'c', name: 'README', createdTime: '2026-03-01T00:00:00Z',
-          confidence: 'medium', signals: ['under ACE root, unknown pattern'] },
-      ],
-    });
-    expect(md).toContain('# Sweep report — drive');
-    expect(md).toContain('Generated: 2026-05-15T18:00:00Z');
-    expect(md).toContain('Live set: 2026-05-15T17:58:00Z');
-    expect(md).toContain('high: 2');
-    expect(md).toContain('medium: 1');
-    expect(md).toContain('low: 0');
-  });
-
-  it('groups orphans by confidence with high first', () => {
-    const md = renderOrphanReport({
-      ...baseReport,
-      totals: { high: 1, medium: 1, low: 0 },
-      orphans: [
-        { id: 'm', name: 'unknown', createdTime: '2026-04-02T00:00:00Z',
-          confidence: 'medium', signals: ['?'] },
-        { id: 'h', name: 'CRISPR-Test-X', createdTime: '2026-04-01T00:00:00Z',
-          confidence: 'high', signals: ['CRISPR-'] },
-      ],
-    });
-    expect(md.indexOf('## High confidence')).toBeLessThan(md.indexOf('## Medium confidence'));
-    expect(md.indexOf('CRISPR-Test-X')).toBeLessThan(md.indexOf('unknown'));
-  });
-
-  it('skips empty confidence sections', () => {
-    const md = renderOrphanReport({
-      ...baseReport,
-      totals: { high: 1, medium: 0, low: 0 },
-      orphans: [
-        { id: 'h', name: 'h', createdTime: '2026-04-01T00:00:00Z',
-          confidence: 'high', signals: ['x'] },
-      ],
-    });
-    expect(md).toContain('## High confidence');
-    expect(md).not.toContain('## Medium confidence');
-    expect(md).not.toContain('## Low confidence');
-  });
-
-  it('renders "No orphans found" when totals are all zero', () => {
-    const md = renderOrphanReport(baseReport);
-    expect(md).toContain('No orphans found.');
-  });
-});
-```
-
-- [ ] **Step 2: Run test to verify it fails**
-
-Run: `npx vitest run test/lib/sweep-report.test.ts`
-Expected: FAIL — "Cannot find module".
-
-- [ ] **Step 3: Implement `lib/sweep-report.ts`**
-
-```typescript
-/**
- * Pure markdown renderer for sweep orphan reports. Used by every per-system
- * sweep skill (PRs 2-5 add Connect, OCS, HQ, labs). Output is human-readable
- * markdown plus enough structure that a human can copy individual rows or
- * approve in chunks.
- */
-
-import type { Confidence, Orphan, OrphanReport } from './sweep-types';
-
-const ORDER: Confidence[] = ['high', 'medium', 'low'];
-
-const HEADER: Record<Confidence, string> = {
-  high: '## High confidence',
-  medium: '## Medium confidence',
-  low: '## Low confidence',
-};
-
-function rowsFor(orphans: Orphan[], tier: Confidence): Orphan[] {
-  return orphans.filter((o) => o.confidence === tier);
-}
-
-function renderTable(rows: Orphan[]): string {
-  const lines: string[] = [
-    '| ID | Name | Created | Signals |',
-    '|----|------|---------|---------|',
-  ];
-  for (const o of rows) {
-    const signals = o.signals.join('; ').replaceAll('|', '\\|');
-    lines.push(`| ${o.id} | ${o.name} | ${o.createdTime} | ${signals} |`);
-  }
-  return lines.join('\n');
-}
-
-export function renderOrphanReport(report: OrphanReport): string {
-  const parts: string[] = [];
-  parts.push(`# Sweep report — ${report.system}`);
-  parts.push('');
-  parts.push(`Generated: ${report.generatedAt}`);
-  parts.push(`Live set: ${report.liveSetGeneratedAt}`);
-  parts.push('');
-  parts.push(
-    `Totals — high: ${report.totals.high}, medium: ${report.totals.medium}, low: ${report.totals.low}`,
-  );
-  parts.push('');
-
-  const total = report.totals.high + report.totals.medium + report.totals.low;
-  if (total === 0) {
-    parts.push('No orphans found.');
-    return parts.join('\n') + '\n';
-  }
-
-  for (const tier of ORDER) {
-    const rows = rowsFor(report.orphans, tier);
-    if (rows.length === 0) continue;
-    parts.push(HEADER[tier]);
-    parts.push('');
-    parts.push(renderTable(rows));
-    parts.push('');
-  }
-
-  return parts.join('\n') + '\n';
-}
-```
-
-- [ ] **Step 4: Run test to verify it passes**
-
-Run: `npx vitest run test/lib/sweep-report.test.ts`
-Expected: 4 tests pass.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add lib/sweep-report.ts test/lib/sweep-report.test.ts
-git commit -m "feat(sweep): markdown orphan-report renderer"
-```
-
----
-
-### Task 5: `sweep-live-set` skill (agent procedure)
-
-**Files:**
-- Create: `skills/sweep-live-set/SKILL.md`
-
-This is a markdown skill (no code in this file). It tells the agent how to walk Drive, fetch YAMLs, call the Task 2 lib functions, and write the resulting `live-set.yaml` back to Drive under `ACE/_sweep/<timestamp>/live-set.yaml`.
-
-- [ ] **Step 1: Write the SKILL.md**
-
-Create `skills/sweep-live-set/SKILL.md`:
-
-```markdown
----
-name: sweep-live-set
-description: >
-  Walk Drive ACE/ and build a live-set of identifiers still referenced by
-  visible opps. Use before any per-system sweep.
-disable-model-invocation: true
----
-
-# sweep-live-set
-
-Build the cross-opp live-set that every per-system `/ace:sweep` consumes. The live-set is the safety mechanism: anything in a target system NOT in the live-set is a candidate orphan. This skill produces it; per-system skills consume it.
-
-## Inputs
-
-- `ACE_DRIVE_ROOT_FOLDER_ID` from `.env` — the Drive folder that contains every opp directory.
-
-## Products
-
-- `ACE/_sweep/<YYYYMMDD-HHMMSS>/live-set.yaml` — the merged `LiveSet` (schema: `lib/sweep-types.ts:LiveSet`).
-- Echoes the timestamped path so the caller (the sweep procedure doc) can pass it to per-system sweep skills.
-
-## Process
-
-1. **Compute the timestamp** for this sweep run: UTC `YYYYMMDD-HHMMSS`.
-2. **Ensure `ACE/_sweep/<timestamp>/` exists** via `drive_create_folder` under `ACE_DRIVE_ROOT_FOLDER_ID`. If `_sweep/` does not yet exist as the parent, create it first.
-3. **List opps:** call `drive_list_folder` on `ACE_DRIVE_ROOT_FOLDER_ID`. For each child folder, treat it as an opp if it contains an `opp.yaml` at the root OR an `inputs/` subfolder (matches `lib/doctor-drive-layout.ts:isOppFolder`). Skip `_sweep/` and any other reserved/leading-underscore folder.
-4. **For each opp:**
-   a. `drive_read_file` on `<opp>/opp.yaml` (if present; else empty string).
-   b. `drive_list_folder` on `<opp>/runs/` (if present; else empty list).
-   c. For each run folder, `drive_read_file` on `<opp>/runs/<run-id>/run_state.yaml` (skip if absent).
-   d. Call `extractOppFragment(oppSlug, oppYaml, runStateYamls)` from `lib/sweep-live-set.ts` to get a fragment.
-5. **Merge fragments:** call `mergeFragments(fragments, generatedAtIso)` to produce the final `LiveSet`.
-6. **Serialize as YAML** and `drive_create_file` to `ACE/_sweep/<timestamp>/live-set.yaml`.
-7. **Echo the Drive path and folder id** of the live-set file to the caller.
-
-## Implementation notes for agents
-
-- Use `npx tsx` to invoke a one-shot script that imports `lib/sweep-live-set.ts` if you need to run the merge from the terminal; or call the functions directly via the in-process TypeScript boundary if your harness allows it. Prefer one-shot script to keep the agent-side logic to MCP calls.
-- The script lives at `scripts/sweep-live-set.ts` if/when an agent needs to execute it directly. (Not in scope for this PR — agents read YAMLs via MCP and call the lib functions inline.)
-- If any opp folder is missing `opp.yaml`, that's a legacy/incomplete opp — still parse its runs but use `''` for the opp.yaml input; the fragment will simply omit `connectProgramIds`.
-
-## Failure modes
-
-- **No opps under `ACE/`** — produce an empty live-set; downstream sweeps will flag everything as an orphan candidate. That's intentional.
-- **Malformed YAML in an opp's files** — `extractOppFragment` silently treats unparseable input as `{}`. Surface a warning in the agent's chat output for each opp where this happens, but don't abort.
-
-## Related skills
-
-- `sweep-drive` consumes this skill's output.
-- Per-system sweep skills `sweep-connect`, `sweep-ocs`, `sweep-hq`, `sweep-labs` (added in PRs 2-5) will also consume it.
-```
-
-- [ ] **Step 2: Verify frontmatter is conformant**
-
-Run: `head -6 skills/sweep-live-set/SKILL.md`
-Expected output shows `name: sweep-live-set` matching the directory name and a `description:` ≤200 chars.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/sweep-live-set/SKILL.md
-git commit -m "feat(sweep): add sweep-live-set skill (Drive walk → live-set.yaml)"
-```
-
----
-
-### Task 6: `sweep-drive` skill (agent procedure)
-
-**Files:**
-- Create: `skills/sweep-drive/SKILL.md`
-
-The Drive sweep proper. Reads the live-set, lists everything under `ACE/`, diffs, scores via `lib/sweep-fingerprint.ts`, renders the report via `lib/sweep-report.ts`, prompts for approval, calls `drive_trash_file`.
-
-- [ ] **Step 1: Write the SKILL.md**
-
-Create `skills/sweep-drive/SKILL.md`:
-
-```markdown
----
-name: sweep-drive
-description: >
-  Diff Drive ACE/ against the live-set, score orphan candidates, render a
-  triage report, and trash approved items. Use when sweeping Drive.
-disable-model-invocation: true
----
-
-# sweep-drive
-
-Find Drive folders under `ACE/` that no current opp references, score them, present them to the human for triage, and trash approved items via `drive_trash_file` (reversible — 30-day Drive bin).
-
-## Inputs
-
-- Live-set file path/id from `sweep-live-set` skill output (a Drive path like `ACE/_sweep/<timestamp>/live-set.yaml`).
-
-## Products
-
-- `ACE/_sweep/<timestamp>/drive-orphans.md` — human-readable triage report (markdown).
-- `ACE/_sweep/<timestamp>/drive-orphans.yaml` — machine-readable `OrphanReport` (for replay / per-item approval).
-- For each approved orphan: a Drive trash operation via `drive_trash_file`.
-
-## Process
-
-1. **Read the live-set:** `drive_read_file` on the path produced by `sweep-live-set`. Parse it as YAML into a `LiveSet`.
-2. **List `ACE/` children:** `drive_list_folder` on `ACE_DRIVE_ROOT_FOLDER_ID`. Filter to folders (mimeType `application/vnd.google-apps.folder`) and skip names starting with `_` (e.g. `_sweep/`).
-3. **Diff:** for each folder whose name is NOT in `liveSet.oppSlugs`, it is an orphan candidate.
-4. **Score:** call `scoreDriveFolder(folder, liveSet, ACE_DRIVE_ROOT_FOLDER_ID)` from `lib/sweep-fingerprint.ts` for each candidate. Collect into an `Orphan[]`.
-5. **Build the `OrphanReport`** with `system: 'drive'`, `generatedAt: now ISO`, `liveSetGeneratedAt: liveSet.generatedAt`, totals, and orphans.
-6. **Render the report** via `renderOrphanReport()` from `lib/sweep-report.ts`. `drive_create_file` to `ACE/_sweep/<timestamp>/drive-orphans.md`. Also serialize the YAML form to `drive-orphans.yaml` in the same folder.
-7. **Surface the report** to the human in chat: print the markdown report directly, then prompt for approval. Suggested chunks:
-   - "Approve all `high` confidence orphans? (N items)"
-   - "Approve all `medium`? (N items)"
-   - "Review individually?"
-8. **On approval:** for each approved orphan, call `drive_trash_file` with `fileId: orphan.id`. Report success/failure per item back to the human.
-9. **Re-verify (optional sanity check):** after trashing, `drive_list_folder` `ACE/` again and confirm the trashed names are gone. This catches partial failures.
-
-## Failure modes
-
-- **Live-set path doesn't resolve:** abort with a clear "Run `sweep-live-set` first" message.
-- **`drive_trash_file` fails on a Shared-Drive permission error:** report the item as "trash failed — needs admin"; don't retry, don't abort the rest of the batch.
-- **An approved orphan was already deleted by something else between report and execution:** treat as success.
-
-## Implementation notes for agents
-
-- This skill must be invoked AFTER `sweep-live-set` in the same `/ace:sweep` run — the live-set is the safety boundary. If the live-set is more than 24 hours old, regenerate it first (active opps may have changed).
-- All scoring is done locally via the `lib/sweep-fingerprint.ts` function; do not paraphrase the scoring rules into prompts.
-
-## Related skills
-
-- `sweep-live-set` produces the live-set this skill diffs against.
-- Future: `sweep-connect`, `sweep-ocs`, `sweep-hq`, `sweep-labs` follow the same pattern for their respective systems.
-```
-
-- [ ] **Step 2: Verify frontmatter**
-
-Run: `head -6 skills/sweep-drive/SKILL.md`
-Expected: `name: sweep-drive` matches directory.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/sweep-drive/SKILL.md
-git commit -m "feat(sweep): add sweep-drive skill (live-set diff → trash)"
-```
-
----
-
-### Task 7: `sweep.md` orchestrator procedure doc
-
-**Files:**
-- Create: `agents/sweep.md`
-
-A procedure doc (not a subagent) per `CLAUDE.md` § Agent topology — it dispatches sub-skills, so it MUST run at level 0. Frontmatter retains `name:` / `description:` / `model:` for `/ace:status` and `/ace:docs` to keep working.
-
-- [ ] **Step 1: Write the procedure doc**
-
-Create `agents/sweep.md`:
-
-```markdown
----
-name: sweep
-description: >
-  Procedure doc for /ace:sweep — orchestrates live-set build then per-system
-  orphan sweep with human triage. Currently supports drive; per-system
-  expansions land in PRs 2-5.
-model: inherit
----
-
-# /ace:sweep — Orchestrator (procedure doc)
-
-This is a procedure doc, not a subagent. The `/ace:sweep` slash command reads it and executes the steps inline at level 0 (so it can call the `Agent` tool to dispatch leaf skills, per `CLAUDE.md` § Agent topology).
-
-## Arguments
-
-- `<system>` (optional) — one of `drive`, `connect`, `ocs`, `hq`, `labs`. If omitted, prompt the user to pick. Today only `drive` is implemented; the others print "not yet implemented; ships in PR <N>".
-
-## Process
-
-### Step 1: Determine system
-
-If the user passed `<system>`, use it. Otherwise, present:
-
-```
-Which system?
-  drive   — Drive folders under ACE/ (this PR)
-  connect — Connect programs / opportunities / payment-units (PR 2)
-  ocs     — OCS chatbots / collections / sessions (PR 3)
-  hq      — CommCare HQ apps (PR 4)
-  labs    — connect-labs workflows / pipelines / synthetic / records (PR 5)
-```
-
-If they pick a system other than `drive`, respond "Not yet implemented. Ships in PR <N>." and stop.
-
-### Step 2: Build the live-set
-
-Dispatch the `sweep-live-set` skill:
-
-```
-Agent(sweep-live-set)
-```
-
-Wait for it to return the live-set Drive path. Capture the timestamped sweep folder (e.g. `ACE/_sweep/20260515-180000/`) — every subsequent step writes into that same folder.
-
-### Step 3: Per-system sweep
-
-For `system == 'drive'`, dispatch `sweep-drive`:
-
-```
-Agent(sweep-drive, with: { liveSetPath: <from step 2>, sweepFolder: <from step 2> })
-```
-
-`sweep-drive` handles the human triage and trash loop itself; this orchestrator only waits for completion.
-
-### Step 4: Summary
-
-Print:
-
-```
-/ace:sweep drive — complete
-
-Sweep folder: ACE/_sweep/<timestamp>/
-Report:       ACE/_sweep/<timestamp>/drive-orphans.md
-Trashed:      <N> high-confidence items, <M> medium-confidence items
-Skipped:      <K> items (low confidence or human-rejected)
-```
-
-## Notes
-
-- The procedure doc is the only thing that calls `Agent`. Each sub-skill (`sweep-live-set`, `sweep-drive`) is a leaf — no nested `Agent` dispatch.
-- Per `CLAUDE.md` § Phase preconditions are restored, not adapted: do not try to detect "is there a stale live-set" — just regenerate it every time. The live-set is cheap (~seconds to build).
-- This procedure doc is invoked once per sweep run; it doesn't persist state across runs. Persistent state (the sweep folders themselves) lives in Drive under `ACE/_sweep/`.
-```
-
-- [ ] **Step 2: Verify frontmatter conforms**
-
-Run: `head -6 agents/sweep.md`
-Expected: `name: sweep` and `model: inherit` present.
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add agents/sweep.md
-git commit -m "feat(sweep): add /ace:sweep orchestrator procedure doc"
-```
-
----
-
-### Task 8: `/ace:sweep` slash command
-
-**Files:**
-- Create: `commands/sweep.md`
-
-The slash command. Reads `agents/sweep.md` and executes it inline. Follows the pattern in `commands/status.md` and `commands/run.md`.
-
-- [ ] **Step 1: Inspect an existing command for the exact frontmatter pattern**
-
-Run: `cat commands/status.md | head -10`
-Look at the `allowed-tools:` list — we need analogous tools (Read for the procedure doc, Bash for any inline shell, the Drive MCP atoms for live-set + sweep-drive, and the Agent tool implicitly).
-
-- [ ] **Step 2: Write the command**
-
-Create `commands/sweep.md`:
-
-```markdown
----
-description: Sweep orphaned ACE artifacts in a given system (drive supported; connect, ocs, hq, labs coming)
-allowed-tools: [Read, Bash, mcp__plugin_ace_ace-gdrive__drive_list_folder, mcp__plugin_ace_ace-gdrive__drive_read_file, mcp__plugin_ace_ace-gdrive__drive_create_file, mcp__plugin_ace_ace-gdrive__drive_create_folder, mcp__plugin_ace_ace-gdrive__drive_trash_file]
----
-
-# /ace:sweep
-
-Find and clean up orphaned artifacts ACE has created across the systems it touches.
-
-## Arguments
-
-- `<system>` (optional) — one of `drive`, `connect`, `ocs`, `hq`, `labs`. Omit to be prompted.
-
-## Process
-
-Read `agents/sweep.md` and execute its procedure inline (this is a procedure doc, not a subagent — see `CLAUDE.md` § Agent topology). Pass `<system>` through if provided.
-
-## Examples
-
-```
-/ace:sweep              # prompts for system
-/ace:sweep drive        # sweeps Drive end-to-end
-/ace:sweep connect      # "Not yet implemented — ships in PR 2"
-```
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add commands/sweep.md
-git commit -m "feat(sweep): add /ace:sweep slash command"
-```
-
----
-
-### Task 9: Verify auto-discovery + end-to-end smoke
-
-**Files:**
-- Modify: (potentially) `.claude-plugin/plugin.json` if skills/commands are NOT auto-discovered
-
-- [ ] **Step 1: Check whether plugin.json registers skills/commands explicitly**
-
-Run: `grep -E 'skills|commands' .claude-plugin/plugin.json | head -20`
-
-If the file lists skills/commands explicitly (e.g. an array of paths), you'll need to add the new entries. If it uses a glob or directory pattern, no change needed. The existing `commands/status.md` and `skills/<existing>/SKILL.md` files weren't registered manually based on prior work — verify by inspection.
-
-Expected: no manual registration required (ACE relies on Claude Code's auto-discovery from `skills/` and `commands/`).
-
-- [ ] **Step 2: Run the full test suite**
-
-Run: `npm test`
-Expected: all existing tests still pass; the 3 new test files (sweep-live-set, sweep-fingerprint, sweep-report) report a total of 14 tests passing.
-
-- [ ] **Step 3: Run version bump (worktree-safe)**
-
-Per `CLAUDE.md` § Plugin updates: bump via `scripts/version-bump.sh`. This updates VERSION, package.json, `.claude-plugin/plugin.json`, `.claude-plugin/marketplace.json` atomically.
-
-Run: `bash scripts/version-bump.sh`
-Expected: prints the new version (e.g. `0.13.213`) and the four files updated.
-
-- [ ] **Step 4: Smoke test on a live machine (optional, only if local 1Password + Drive auth is set up)**
-
-In a fresh Claude Code session with this branch checked out and `/reload-plugins` run:
-
-```
-/ace:sweep drive
-```
-
-Expected: it walks `ACE/`, produces a live-set, prints a Drive orphan report. Don't approve trash unless you're prepared for the trash to actually happen.
-
-If you can't smoke test locally, that's fine — the unit tests cover the pure logic and the SKILL.md / procedure-doc are exercised by code review.
-
-- [ ] **Step 5: Commit version bump and any auto-discovery findings**
-
-```bash
-git add VERSION package.json .claude-plugin/plugin.json .claude-plugin/marketplace.json
-git commit -m "chore: bump version for /ace:sweep PR 1"
-```
-
-- [ ] **Step 6: Push and open PR per `CLAUDE.md` § Git worktrees and merging to main**
-
-```bash
-git push -u origin HEAD
-gh pr create --title "feat(sweep): /ace:sweep PR 1 — foundation + Drive" --body "$(cat <<'EOF'
-## Summary
-
-Ships `/ace:sweep drive` end-to-end and lays the foundation that PRs 2-5 (Connect / OCS / HQ / labs) will extend.
-
-- Pure-function libs: `sweep-live-set.ts`, `sweep-fingerprint.ts`, `sweep-report.ts`
-- Skills: `sweep-live-set`, `sweep-drive`
-- Procedure doc: `agents/sweep.md`
-- Slash command: `/ace:sweep`
-
-Spec: `docs/superpowers/specs/2026-05-15-ace-sweep-design.md`
-Plan: `docs/superpowers/plans/2026-05-15-ace-sweep-pr1-foundation-and-drive.md`
-
-## Test plan
-
-- [ ] `npm test` — all unit tests pass (3 new files, 14 new tests)
-- [ ] `/ace:sweep drive` walks ACE/, produces live-set + orphan report
-- [ ] `/ace:sweep connect` (and ocs/hq/labs) prints "Not yet implemented — ships in PR N"
-- [ ] Approving high-confidence orphans trashes them via `drive_trash_file`
-EOF
-)"
-gh pr merge $(gh pr view --json number -q .number) --auto --merge
-```
-
-- [ ] **Step 7: After PR merges, run `/ace:update` in the current session per CLAUDE.md**
-
-```
-/ace:update
-```
-
----
-
-## Self-review (already done by plan author — kept as a record)
-
-**Spec coverage:**
-- `live-set diff` mechanism → Tasks 2, 5
-- Drive sweep with fingerprint scoring → Tasks 3, 6
-- Triage report → Tasks 4, 6
-- Auto-delete via `drive_trash_file` → Task 6
-- Procedure doc + slash command → Tasks 7, 8
-- Per-system skills as a pattern PRs 2–5 can extend → all skill files are shaped generically (`renderOrphanReport` is system-agnostic; fingerprint module has system-specific functions added per PR)
-
-**Placeholder scan:** no TBDs. Type contracts are pinned at top of plan; every task uses the same names.
-
-**Type consistency:** `LiveSet`, `Orphan`, `OrphanReport`, `DriveFolderInfo`, `Confidence` defined in Task 1, referenced verbatim in Tasks 2–4 and 6. Function names match across plan and code: `extractOppFragment`, `mergeFragments`, `scoreDriveFolder`, `renderOrphanReport`.
diff --git a/docs/superpowers/plans/2026-05-21-work-order-skill.md b/docs/superpowers/plans/2026-05-21-work-order-skill.md
deleted file mode 100644
index d95863f1..00000000
--- a/docs/superpowers/plans/2026-05-21-work-order-skill.md
+++ /dev/null
@@ -1,1454 +0,0 @@
-# pdd-to-work-order Skill Implementation Plan
-
-> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
-
-**Goal:** Add a new Phase 1 skill (`pdd-to-work-order`) plus its `-qa` and `-eval` companions that draft a contractual Work Order from the approved PDD and the run's `decisions.yaml`, render it to a clean Google Doc, and integrate into the `idea-to-design` agent.
-
-**Architecture:** Three new skills under `skills/` (producer + QA + eval), one new bootstrap script that creates the Drive template once per environment, and an additive edit to `agents/idea-to-design.md` that wires the new skills in as Step 2 / 2.4 / 2.5. QA static checks live as pure functions in `checks.ts` with vitest coverage; producer and eval skills are SKILL.md instruction artifacts. Work-order-specific load-bearing fields are prefixed `wo-*` in `decisions.yaml` to avoid colliding with future Phase 8 solicitation rows.
-
-**Tech Stack:** TypeScript (ESM, `npx tsx`), Google Drive + Docs MCPs, vitest, YAML for verdicts and decisions.
-
-**Spec:** `docs/superpowers/specs/2026-05-21-work-order-skill-design.md`
-
----
-
-## File Structure
-
-**Create:**
-- `skills/pdd-to-work-order/SKILL.md` — producer skill instructions
-- `skills/pdd-to-work-order-qa/SKILL.md` — QA skill instructions
-- `skills/pdd-to-work-order-qa/checks.ts` — static check functions (importable)
-- `skills/pdd-to-work-order-eval/SKILL.md` — eval rubric instructions
-- `scripts/bootstrap-work-order-template.ts` — one-time Drive template creation
-- `templates/work-order-template.md` — canonical template content (uploaded by bootstrap script)
-- `test/skills/pdd-to-work-order-qa/checks.test.ts` — vitest unit tests for QA checks
-- `test/skills/pdd-to-work-order-qa/fixtures/good-work-order.md` — passes all checks
-- `test/skills/pdd-to-work-order-qa/fixtures/missing-sections.md` — fails section check
-- `test/skills/pdd-to-work-order-qa/fixtures/bad-payment-schedule.md` — fails sum-to-100 check
-- `test/skills/pdd-to-work-order-qa/fixtures/missing-wo-decisions.yaml` — fails decision-rows check
-- `test/skills/pdd-to-work-order-qa/fixtures/good-decisions.yaml` — has all required `wo-*` rows
-- `playbook/integrations/work-order-template.md` — operator-facing bootstrap docs
-
-**Modify:**
-- `agents/idea-to-design.md` — add Step 2 / 2.4 / 2.5; update frontmatter `skills:` array
-- `.env.tpl` — add `WORK_ORDER_TEMPLATE_ID` 1P reference
-- `lib/artifact-manifest.ts` — register new artifacts (`pdd-to-work-order.gdoc`, `pdd-to-work-order-qa_result.yaml`, `pdd-to-work-order-eval_verdict.yaml`)
-- `skills/_qa-decisions.md` — add row for `pdd-to-work-order-qa` (status: has-qa)
-
----
-
-## Task 1: Register new artifacts in the manifest
-
-**Files:**
-- Modify: `lib/artifact-manifest.ts`
-- Test: `test/artifact-manifest.test.ts` (existing; adding entries should leave it green)
-
-- [ ] **Step 1: Read the existing PDD artifact entry to mirror its shape**
-
-Read `lib/artifact-manifest.ts` and locate the entry for `path: '1-design/idea-to-pdd.md'`. Copy its shape exactly.
-
-- [ ] **Step 2: Add three new artifact entries**
-
-Append to the artifact list (alphabetical-ish by path within `1-design/` is fine; put right after the `idea-to-pdd*` cluster):
-
-```ts
-  {
-    path: '1-design/pdd-to-work-order.gdoc',
-    producedBy: 'pdd-to-work-order',
-    consumedBy: [
-      'pdd-to-work-order-qa',
-      'pdd-to-work-order-eval',
-    ],
-    phase: 'design',
-    required: false,
-    description: 'Contractual Work Order draft derived from the PDD and decisions.yaml. Generic by default — Partner identity is a placeholder unless an LLO was supplied. Re-runs create pdd-to-work-order-2.gdoc, pdd-to-work-order-3.gdoc, etc.; products.work_order in run_state.yaml points at the latest. Parallel to Phase 8 solicitation, not a replacement. Spec: docs/superpowers/specs/2026-05-21-work-order-skill-design.md',
-  },
-  {
-    path: '1-design/pdd-to-work-order-qa_result.yaml',
-    producedBy: 'pdd-to-work-order-qa',
-    consumedBy: ['ace-orchestrator', 'pdd-to-work-order-eval'],
-    phase: 'design',
-    required: false,
-    description: 'QA verdict for pdd-to-work-order: structural pass/fail across the 8 checks defined in skills/pdd-to-work-order-qa/checks.ts.',
-  },
-  {
-    path: '1-design/pdd-to-work-order-eval_verdict.yaml',
-    producedBy: 'pdd-to-work-order-eval',
-    consumedBy: ['ace-orchestrator', 'opp-eval'],
-    phase: 'design',
-    required: false,
-    description: 'Per-skill -eval verdict for pdd-to-work-order: contractual clarity, PDD alignment, decisions traceability, verification realism, archetype fit. Shape matches skills/README.md § QA vs Eval.',
-  },
-```
-
-- [ ] **Step 3: Run manifest tests**
-
-Run: `npx vitest run test/artifact-manifest.test.ts`
-Expected: PASS (the three new entries should not break existing fixture validations because they are `required: false`).
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add lib/artifact-manifest.ts
-git commit -m "feat(manifest): register work-order artifacts"
-```
-
----
-
-## Task 2: Author the canonical work-order template (markdown)
-
-**Files:**
-- Create: `templates/work-order-template.md`
-
-- [ ] **Step 1: Write the template**
-
-The template is the canonical body content the bootstrap script will upload to Drive as a Google Doc. Tokens use `{{...}}` (snake_case) so the skill can replace them via `docs_batch_update`. Mirror the malaria example exactly, with tokenized fields.
-
-```markdown
-# Work Order Agreement #{{wo_number}}
-## {{opp_title}}
-
-| Work Order Number | {{wo_number}} |
-|---|---|
-| Work Order Date | {{wo_date}} |
-| Work Order Title | {{opp_title}} |
-| Period of Performance | {{wo_period_of_performance}} |
-
-## 1. Background
-
-{{background_body}}
-
-## 2. Scope of Work
-
-{{scope_body}}
-
-## 3. Geographic Coverage
-
-{{geographic_coverage_body}}
-
-## 4. Deliverables and Verification
-
-### 4.1 Primary Deliverable
-
-{{primary_deliverable_body}}
-
-### 4.2 Definition of a Verified Unit
-
-{{verified_unit_body}}
-
-### 4.3 Reporting Deliverables
-
-{{reporting_body}}
-
-## 5. Timeline and Milestones
-
-{{timeline_table}}
-
-The Partner will flag any timeline risk in writing to Dimagi within 24 hours of identification.
-
-## 6. Payment Terms
-
-### 6.1 Total Not-to-Exceed
-
-Dimagi's total financial commitment under this Work Order is USD {{wo_total_not_to_exceed_usd}}, not-to-exceed, inclusive of all field collection costs, FLW compensation, supervision, transport, and partner reporting time.
-
-### 6.2 Payment Schedule
-
-{{payment_schedule_table}}
-
-Dimagi will pay only for verified units.
-
-## 7. Roles and Responsibilities
-
-{{roles_raci_table}}
-
-## 8. Permissions, Ethics, and Compliance
-
-### 8.1 Permissions
-
-{{permissions_body}}
-
-### 8.2 Ethics
-
-{{ethics_body}}
-
-### 8.3 Security and Duty of Care
-
-The Partner retains sole responsibility for the safety and security of its field teams. The Partner will not deploy FLWs to areas where the prevailing security situation, in the Partner's professional judgement, poses an unacceptable risk.
-
-## 9. Data Handling
-
-{{data_handling_table}}
-
-## Signatures
-
-IN WITNESS WHEREOF, the parties hereto have caused this Work Order to be executed by their authorized agents as of the date first above written, and annexed to the parties' MSA dated __________________.
-
-**Subcontractor**
-
-By: __________________________
-
-Name: [Partner Name]
-
-Title: [Partner Title]
-
-Date: ________________________
-
-Address for correspondence:
-[Partner Address]
-
-**Dimagi, Inc.**
-
-By: __________________________
-
-Name: Lucina Tse
-
-Title: COO Dimagi, Inc.
-
-Date: ________________________
-
-Address for correspondence:
-245 Main Street, 2nd Floor,
-Cambridge, MA 02142
-(617) 649-2214; legal@dimagi.com
-
-## Annexures
-
-- Annexure A: Program Design Document — see {{pdd_link}}
-- Annexure B: {{annexure_b_placeholder}}
-```
-
-- [ ] **Step 2: Commit**
-
-```bash
-git add templates/work-order-template.md
-git commit -m "feat(templates): canonical work-order template content"
-```
-
----
-
-## Task 3: Bootstrap script for the Drive template
-
-**Files:**
-- Create: `scripts/bootstrap-work-order-template.ts`
-
-- [ ] **Step 1: Read the OCS bootstrap script as a pattern reference**
-
-Read `scripts/bootstrap-ocs-golden-template.ts` (first 200 lines is enough). Note the env loading + duplicate-check + force-mode flag pattern. The work-order bootstrap is simpler — no Playwright, no chatbot cloning — just Drive MCP operations.
-
-- [ ] **Step 2: Write the bootstrap script**
-
-```typescript
-/**
- * Bootstrap the ACE Work Order template (Google Doc).
- *
- * One-time (or refresh) setup. Uploads templates/work-order-template.md to
- * Drive as a Google Doc, lives at the configured ACE templates folder, and
- * prints the resulting file_id for recording as WORK_ORDER_TEMPLATE_ID in
- * the ACE environment's .env.
- *
- * Usage:
- *   ACE_TEMPLATES_FOLDER_ID=<folder id> \
- *     npx tsx scripts/bootstrap-work-order-template.ts
- *
- * Refresh: set WORK_ORDER_BOOTSTRAP_FORCE=1 to delete the existing template
- * (by name) and recreate.
- */
-
-import * as fs from 'node:fs/promises';
-import * as path from 'node:path';
-import { google } from 'googleapis';
-import { loadEnv } from '../lib/env-loader.js'; // or whatever the existing helper is
-
-const TEMPLATE_NAME = process.env.WORK_ORDER_TEMPLATE_NAME ?? 'ACE Work Order Template';
-const PARENT_FOLDER_ID = process.env.ACE_TEMPLATES_FOLDER_ID;
-const FORCE = process.env.WORK_ORDER_BOOTSTRAP_FORCE === '1';
-
-async function main() {
-  loadEnv(); // populates process.env from $CLAUDE_PLUGIN_DATA/.env
-
-  if (!PARENT_FOLDER_ID) {
-    console.error('ACE_TEMPLATES_FOLDER_ID is required.');
-    process.exit(2);
-  }
-
-  const templatePath = path.resolve(__dirname, '..', 'templates', 'work-order-template.md');
-  const body = await fs.readFile(templatePath, 'utf-8');
-
-  const auth = new google.auth.GoogleAuth({
-    keyFile: process.env.GOOGLE_APPLICATION_CREDENTIALS,
-    scopes: ['https://www.googleapis.com/auth/drive', 'https://www.googleapis.com/auth/documents'],
-  });
-  const drive = google.drive({ version: 'v3', auth });
-
-  // Duplicate check
-  const existing = await drive.files.list({
-    q: `name = '${TEMPLATE_NAME.replace(/'/g, "\\'")}' and '${PARENT_FOLDER_ID}' in parents and trashed = false`,
-    fields: 'files(id,name)',
-    supportsAllDrives: true,
-    includeItemsFromAllDrives: true,
-  });
-  if ((existing.data.files ?? []).length > 0) {
-    const f = existing.data.files![0];
-    if (!FORCE) {
-      console.error(`Template already exists at file_id=${f.id} (name=${f.name}). Re-run with WORK_ORDER_BOOTSTRAP_FORCE=1 to recreate.`);
-      console.log(f.id);
-      process.exit(0);
-    }
-    await drive.files.update({ fileId: f.id!, requestBody: { trashed: true }, supportsAllDrives: true });
-    console.error(`Trashed existing template file_id=${f.id}.`);
-  }
-
-  // Upload markdown body, convert to Google Doc
-  const created = await drive.files.create({
-    requestBody: {
-      name: TEMPLATE_NAME,
-      mimeType: 'application/vnd.google-apps.document',
-      parents: [PARENT_FOLDER_ID],
-    },
-    media: { mimeType: 'text/markdown', body },
-    fields: 'id',
-    supportsAllDrives: true,
-  });
-
-  console.error(`Created ACE Work Order template file_id=${created.data.id}`);
-  console.log(created.data.id); // stdout: bare file_id for capture
-}
-
-main().catch((err) => {
-  console.error(err);
-  process.exit(1);
-});
-```
-
-Adapt imports to match what the existing scripts use (`loadEnv` may not exist with that name — check the imports in `bootstrap-ocs-golden-template.ts` and align).
-
-- [ ] **Step 3: Smoke-test the script**
-
-Run (will hit live Drive — only do this when ready to provision the real template):
-```
-ACE_TEMPLATES_FOLDER_ID=<real-folder-id> npx tsx scripts/bootstrap-work-order-template.ts
-```
-Expected stdout: a Drive file_id. Verify by opening that file_id in the browser.
-
-If not ready to provision live, validate the script compiles:
-```
-npx tsc --noEmit scripts/bootstrap-work-order-template.ts
-```
-Expected: no errors.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add scripts/bootstrap-work-order-template.ts
-git commit -m "feat(scripts): bootstrap-work-order-template script"
-```
-
----
-
-## Task 4: QA `checks.ts` — TDD the static checks
-
-**Files:**
-- Create: `skills/pdd-to-work-order-qa/checks.ts`
-- Create: `test/skills/pdd-to-work-order-qa/checks.test.ts`
-- Create: `test/skills/pdd-to-work-order-qa/fixtures/good-work-order.md`
-- Create: `test/skills/pdd-to-work-order-qa/fixtures/missing-sections.md`
-- Create: `test/skills/pdd-to-work-order-qa/fixtures/bad-payment-schedule.md`
-- Create: `test/skills/pdd-to-work-order-qa/fixtures/missing-wo-decisions.yaml`
-- Create: `test/skills/pdd-to-work-order-qa/fixtures/good-decisions.yaml`
-
-The 8 checks (per spec):
-1. `all_required_sections_present`
-2. `required_wo_decisions_present`
-3. `period_of_performance_complete`
-4. `payment_schedule_sums_to_100`
-5. `total_nte_present`
-6. `signature_blocks_present`
-7. `archetype_appropriate_scope`
-8. `no_scaffolding_markers`
-
-- [ ] **Step 1: Write fixture — `good-work-order.md`**
-
-Copy `templates/work-order-template.md` and replace every `{{...}}` token with realistic content. Set archetype-marker phrasing inside the `Scope of Work` section (`per visit`, photo + GPS phrasing) so the archetype-fit check has something to grep. Save to the fixture path.
-
-- [ ] **Step 2: Write fixture — `missing-sections.md`**
-
-Same as `good-work-order.md` but delete the entire `## 6. Payment Terms` section and its sub-sections.
-
-- [ ] **Step 3: Write fixture — `bad-payment-schedule.md`**
-
-Same as `good-work-order.md` but in the payment schedule table, set the milestone percentages to 40% + 50% (sums to 90, not 100).
-
-- [ ] **Step 4: Write fixture — `good-decisions.yaml`**
-
-```yaml
-decisions:
-  - id: archetype-selection
-    phase: 1-design
-    skill: idea-to-pdd
-    question: Which delivery archetype best fits?
-    default: atomic-visit
-    options_considered: [atomic-visit, focus-group, multi-stage]
-    source: pdd-archetype-frontmatter
-    status: applied
-  - id: wo-number
-    phase: 1-design
-    skill: pdd-to-work-order
-    question: Sequence number for this WO under the MSA
-    default: "[WO-NUMBER]"
-    options_considered: []
-    source: placeholder
-    status: open
-    notes: Human fills in the next free WO number under the active MSA.
-  - id: wo-period-of-performance
-    phase: 1-design
-    skill: pdd-to-work-order
-    question: Start + end dates for the engagement
-    default: "2026-05-22 to 2026-07-31"
-    options_considered: []
-    source: pdd-timeline-section
-    status: applied
-  - id: wo-total-not-to-exceed-usd
-    phase: 1-design
-    skill: pdd-to-work-order
-    question: Total NTE budget cap
-    default: "2500"
-    options_considered: []
-    source: pdd-budget-plausibility
-    status: applied
-  - id: wo-payment-schedule-split
-    phase: 1-design
-    skill: pdd-to-work-order
-    question: Milestone payment percentages
-    default: "40/60"
-    options_considered: ["50/50", "40/60", "30/40/30"]
-    source: ace-default
-    status: applied
-```
-
-- [ ] **Step 5: Write fixture — `missing-wo-decisions.yaml`**
-
-Same as `good-decisions.yaml` but delete the `wo-total-not-to-exceed-usd` and `wo-payment-schedule-split` rows.
-
-- [ ] **Step 6: Write the failing test file**
-
-```typescript
-// test/skills/pdd-to-work-order-qa/checks.test.ts
-import { describe, it, expect } from 'vitest';
-import * as fs from 'node:fs';
-import * as path from 'node:path';
-import {
-  checkAllRequiredSectionsPresent,
-  checkRequiredWoDecisionsPresent,
-  checkPeriodOfPerformanceComplete,
-  checkPaymentScheduleSumsTo100,
-  checkTotalNtePresent,
-  checkSignatureBlocksPresent,
-  checkArchetypeAppropriateScope,
-  checkNoScaffoldingMarkers,
-  CHECKS,
-} from '../../../skills/pdd-to-work-order-qa/checks';
-
-const FX = (name: string) => fs.readFileSync(path.join(__dirname, 'fixtures', name), 'utf-8');
-
-describe('pdd-to-work-order-qa checks', () => {
-  it('exports CHECKS array in canonical order', () => {
-    expect(CHECKS.map((c) => c.id)).toEqual([
-      'all_required_sections_present',
-      'required_wo_decisions_present',
-      'period_of_performance_complete',
-      'payment_schedule_sums_to_100',
-      'total_nte_present',
-      'signature_blocks_present',
-      'archetype_appropriate_scope',
-      'no_scaffolding_markers',
-    ]);
-  });
-
-  describe('all_required_sections_present', () => {
-    it('passes on good fixture', () => {
-      const r = checkAllRequiredSectionsPresent(FX('good-work-order.md'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when Payment Terms section is missing', () => {
-      const r = checkAllRequiredSectionsPresent(FX('missing-sections.md'));
-      expect(r.pass).toBe(false);
-      expect(r.detail).toMatch(/Payment Terms/i);
-    });
-  });
-
-  describe('required_wo_decisions_present', () => {
-    it('passes when all four wo-* rows present', () => {
-      const r = checkRequiredWoDecisionsPresent(FX('good-decisions.yaml'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when wo-total-not-to-exceed-usd row missing', () => {
-      const r = checkRequiredWoDecisionsPresent(FX('missing-wo-decisions.yaml'));
-      expect(r.pass).toBe(false);
-      expect(r.detail).toMatch(/wo-total-not-to-exceed-usd/);
-    });
-  });
-
-  describe('period_of_performance_complete', () => {
-    it('passes when both dates present', () => {
-      const r = checkPeriodOfPerformanceComplete(FX('good-work-order.md'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when only one date is present', () => {
-      const text = FX('good-work-order.md').replace(/2026-05-22 to 2026-07-31/g, '2026-05-22');
-      const r = checkPeriodOfPerformanceComplete(text);
-      expect(r.pass).toBe(false);
-    });
-  });
-
-  describe('payment_schedule_sums_to_100', () => {
-    it('passes on good fixture', () => {
-      const r = checkPaymentScheduleSumsTo100(FX('good-work-order.md'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when percentages sum to 90', () => {
-      const r = checkPaymentScheduleSumsTo100(FX('bad-payment-schedule.md'));
-      expect(r.pass).toBe(false);
-      expect(r.detail).toMatch(/sum to 100/i);
-    });
-  });
-
-  describe('total_nte_present', () => {
-    it('passes when a USD amount or placeholder is present in section 6.1', () => {
-      const r = checkTotalNtePresent(FX('good-work-order.md'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when section 6.1 lacks any USD amount or placeholder', () => {
-      const text = FX('good-work-order.md').replace(/USD 2500/i, 'USD ');
-      const r = checkTotalNtePresent(text);
-      expect(r.pass).toBe(false);
-    });
-  });
-
-  describe('signature_blocks_present', () => {
-    it('passes when both Subcontractor and Dimagi blocks present', () => {
-      const r = checkSignatureBlocksPresent(FX('good-work-order.md'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when Subcontractor block is missing', () => {
-      const text = FX('good-work-order.md').replace(/\*\*Subcontractor\*\*/g, '');
-      const r = checkSignatureBlocksPresent(text);
-      expect(r.pass).toBe(false);
-    });
-  });
-
-  describe('archetype_appropriate_scope', () => {
-    it('passes when atomic-visit scope mentions per visit and photo/GPS', () => {
-      const r = checkArchetypeAppropriateScope(FX('good-work-order.md'), 'atomic-visit');
-      expect(r.pass).toBe(true);
-    });
-    it('fails when atomic-visit scope lacks photo/GPS', () => {
-      const text = FX('good-work-order.md').replace(/photo|GPS/gi, 'data');
-      const r = checkArchetypeAppropriateScope(text, 'atomic-visit');
-      expect(r.pass).toBe(false);
-    });
-    it('passes when focus-group scope mentions attestation + gdoc', () => {
-      const text = FX('good-work-order.md')
-        .replace(/per visit/gi, 'per session')
-        .replace(/photo and GPS capture/gi, 'attestation form submission and gdoc write-up');
-      const r = checkArchetypeAppropriateScope(text, 'focus-group');
-      expect(r.pass).toBe(true);
-    });
-  });
-
-  describe('no_scaffolding_markers', () => {
-    it('passes when no markers present', () => {
-      const r = checkNoScaffoldingMarkers(FX('good-work-order.md'));
-      expect(r.pass).toBe(true);
-    });
-    it('fails when <<TBD>> leaks through', () => {
-      const text = FX('good-work-order.md').replace('Background', '<<TBD>>');
-      const r = checkNoScaffoldingMarkers(text);
-      expect(r.pass).toBe(false);
-      expect(r.detail).toMatch(/<<TBD>>/);
-    });
-  });
-});
-```
-
-- [ ] **Step 7: Run the tests to verify they fail**
-
-Run: `npx vitest run test/skills/pdd-to-work-order-qa/checks.test.ts`
-Expected: ALL FAIL (module does not exist yet).
-
-- [ ] **Step 8: Implement `checks.ts`**
-
-```typescript
-// skills/pdd-to-work-order-qa/checks.ts
-/**
- * Static QA checks for `pdd-to-work-order-qa`.
- *
- * Each check is a pure function that takes the artifact text (work-order
- * markdown or decisions YAML) and returns a `QACheckResult`. Imported by
- * the skill body at runtime and by per-skill tests under
- * `test/skills/pdd-to-work-order-qa/`.
- *
- * The `CHECKS` array is the canonical ordering — surface in the SKILL.md
- * `## Checks` table simultaneously when adding a check.
- */
-
-import type { QACheck, QACheckResult } from '../../lib/qa-types';
-
-const REQUIRED_SECTIONS = [
-  'Background',
-  'Scope of Work',
-  'Geographic Coverage',
-  'Deliverables and Verification',
-  'Timeline and Milestones',
-  'Payment Terms',
-  'Roles and Responsibilities',
-  'Permissions, Ethics, and Compliance',
-  'Data Handling',
-  'Signatures',
-  'Annexures',
-];
-
-const REQUIRED_WO_DECISIONS = [
-  'wo-number',
-  'wo-period-of-performance',
-  'wo-total-not-to-exceed-usd',
-  'wo-payment-schedule-split',
-];
-
-export function checkAllRequiredSectionsPresent(text: string): QACheckResult {
-  const missing = REQUIRED_SECTIONS.filter(
-    (s) => !new RegExp(`^#{1,3}\\s+\\d*\\.?\\s*${escapeRegex(s)}`, 'mi').test(text)
-  );
-  if (missing.length === 0) return { pass: true };
-  return {
-    pass: false,
-    detail: `Missing sections: ${missing.join(', ')}`,
-    auto_fix_hint: `Regenerate the missing sections (${missing.join(', ')}) with substantive content matching their purpose per templates/work-order-template.md.`,
-  };
-}
-
-export function checkRequiredWoDecisionsPresent(decisionsYamlText: string): QACheckResult {
-  const missing = REQUIRED_WO_DECISIONS.filter(
-    (id) => !new RegExp(`\\bid:\\s*${escapeRegex(id)}\\b`).test(decisionsYamlText)
-  );
-  if (missing.length === 0) return { pass: true };
-  return {
-    pass: false,
-    detail: `Missing wo-* decision rows: ${missing.join(', ')}`,
-    auto_fix_hint: `Append the missing rows to decisions.yaml with the AI's best inference values and status: applied or status: open per skills/pdd-to-work-order/SKILL.md § Process step 4.`,
-  };
-}
-
-export function checkPeriodOfPerformanceComplete(text: string): QACheckResult {
-  const headerLine = text.match(/Period of Performance\s*\|\s*([^\n|]+)/i)?.[1]?.trim() ?? '';
-  if (/\d{4}-\d{2}-\d{2}\s+to\s+\d{4}-\d{2}-\d{2}/i.test(headerLine) || /\[.*\]/.test(headerLine)) {
-    return { pass: true };
-  }
-  return {
-    pass: false,
-    detail: 'Period of Performance header lacks both start and end dates (or an explicit placeholder).',
-    auto_fix_hint: 'Set Period of Performance to "YYYY-MM-DD to YYYY-MM-DD" or to "[Period of Performance — TBD]" placeholder.',
-  };
-}
-
-export function checkPaymentScheduleSumsTo100(text: string): QACheckResult {
-  const section = text.split(/##\s*6\.2/i)[1]?.split(/##\s/i)[0] ?? '';
-  const percents = Array.from(section.matchAll(/(\d{1,3})\s*%/g)).map((m) => parseInt(m[1], 10));
-  if (percents.length === 0) {
-    return { pass: false, detail: 'No payment-schedule percentages found in section 6.2.', auto_fix_hint: 'Render the payment schedule table with milestone percentages totaling 100%.' };
-  }
-  const total = percents.reduce((a, b) => a + b, 0);
-  if (total === 100) return { pass: true };
-  return {
-    pass: false,
-    detail: `Payment-schedule percentages sum to ${total} (must sum to 100).`,
-    auto_fix_hint: 'Re-derive milestone percentages from the wo-payment-schedule-split decision and re-render.',
-  };
-}
-
-export function checkTotalNtePresent(text: string): QACheckResult {
-  const section = text.split(/##\s*6\.1/i)[1]?.split(/##\s/i)[0] ?? '';
-  if (/USD\s+(\d|\[)/i.test(section)) return { pass: true };
-  return {
-    pass: false,
-    detail: 'Total Not-to-Exceed (USD) value missing in section 6.1.',
-    auto_fix_hint: 'Insert "USD <amount>" using the wo-total-not-to-exceed-usd decision, or "USD [TBD]" placeholder.',
-  };
-}
-
-export function checkSignatureBlocksPresent(text: string): QACheckResult {
-  const hasSub = /\*\*Subcontractor\*\*/i.test(text);
-  const hasDimagi = /\*\*Dimagi.*\*\*/i.test(text);
-  if (hasSub && hasDimagi) return { pass: true };
-  const missing: string[] = [];
-  if (!hasSub) missing.push('Subcontractor');
-  if (!hasDimagi) missing.push('Dimagi');
-  return {
-    pass: false,
-    detail: `Missing signature block(s): ${missing.join(', ')}`,
-    auto_fix_hint: 'Re-add the missing signature block(s) per templates/work-order-template.md.',
-  };
-}
-
-export function checkArchetypeAppropriateScope(text: string, archetype: string): QACheckResult {
-  const scope = text.split(/##\s*2\.\s*Scope of Work/i)[1]?.split(/##\s/i)[0] ?? '';
-  if (archetype === 'atomic-visit') {
-    const hasVisit = /per visit|per-visit/i.test(scope);
-    const hasPhotoOrGps = /photo|gps/i.test(scope);
-    if (hasVisit && hasPhotoOrGps) return { pass: true };
-    return {
-      pass: false,
-      detail: 'atomic-visit scope must reference per-visit deliverables and photo/GPS capture.',
-      auto_fix_hint: 'Re-draft Scope of Work to include per-visit phrasing and photo + GPS evidence requirements.',
-    };
-  }
-  if (archetype === 'focus-group') {
-    const hasSession = /per session|per-session|attestation/i.test(scope);
-    const hasGdoc = /gdoc|google doc/i.test(scope);
-    if (hasSession && hasGdoc) return { pass: true };
-    return {
-      pass: false,
-      detail: 'focus-group scope must reference per-session attestation + gdoc write-up.',
-      auto_fix_hint: 'Re-draft Scope of Work to include per-session phrasing, attestation form submission, and gdoc write-up.',
-    };
-  }
-  if (archetype === 'multi-stage') {
-    const hasStage = /stage\s+1|stage\s+2|per stage/i.test(scope);
-    if (hasStage) return { pass: true };
-    return {
-      pass: false,
-      detail: 'multi-stage scope must reference at least one per-stage subsection.',
-      auto_fix_hint: 'Re-draft Scope of Work with explicit per-stage subsections.',
-    };
-  }
-  return { pass: false, detail: `Unknown archetype: ${archetype}`, auto_fix_hint: 'Declare archetype: atomic-visit | focus-group | multi-stage in the PDD frontmatter.' };
-}
-
-export function checkNoScaffoldingMarkers(text: string): QACheckResult {
-  const markers = text.match(/<<[^>]*>>/g);
-  if (!markers || markers.length === 0) return { pass: true };
-  return {
-    pass: false,
-    detail: `Leaked AI scaffolding markers: ${[...new Set(markers)].join(', ')}`,
-    auto_fix_hint: 'Resolve each <<...>> marker by filling in concrete content or replacing with a [Placeholder] bracket.',
-  };
-}
-
-function escapeRegex(s: string): string {
-  return s.replace(/[.*+?^${}()|[\]\\]/g, '\\$&');
-}
-
-export const CHECKS: QACheck[] = [
-  { id: 'all_required_sections_present', run: (ctx) => checkAllRequiredSectionsPresent(ctx.workOrderText) },
-  { id: 'required_wo_decisions_present', run: (ctx) => checkRequiredWoDecisionsPresent(ctx.decisionsYamlText) },
-  { id: 'period_of_performance_complete', run: (ctx) => checkPeriodOfPerformanceComplete(ctx.workOrderText) },
-  { id: 'payment_schedule_sums_to_100', run: (ctx) => checkPaymentScheduleSumsTo100(ctx.workOrderText) },
-  { id: 'total_nte_present', run: (ctx) => checkTotalNtePresent(ctx.workOrderText) },
-  { id: 'signature_blocks_present', run: (ctx) => checkSignatureBlocksPresent(ctx.workOrderText) },
-  { id: 'archetype_appropriate_scope', run: (ctx) => checkArchetypeAppropriateScope(ctx.workOrderText, ctx.archetype) },
-  { id: 'no_scaffolding_markers', run: (ctx) => checkNoScaffoldingMarkers(ctx.workOrderText) },
-];
-```
-
-If `lib/qa-types.ts`'s `QACheck` type does not support a context-bearing `run`, adapt the `CHECKS` shape to match what `idea-to-pdd-qa/checks.ts` uses (and ignore the context argument — check signatures may take only a string in that codebase). Read the existing `QACheck` interface from `lib/qa-types.ts` before finalizing.
-
-- [ ] **Step 9: Run the tests to verify they pass**
-
-Run: `npx vitest run test/skills/pdd-to-work-order-qa/checks.test.ts`
-Expected: ALL PASS.
-
-- [ ] **Step 10: Commit**
-
-```bash
-git add skills/pdd-to-work-order-qa/checks.ts test/skills/pdd-to-work-order-qa/
-git commit -m "feat(pdd-to-work-order-qa): static checks + vitest coverage"
-```
-
----
-
-## Task 5: Producer `SKILL.md`
-
-**Files:**
-- Create: `skills/pdd-to-work-order/SKILL.md`
-
-- [ ] **Step 1: Read the existing PDD producer skill for shape**
-
-Read `skills/idea-to-pdd/SKILL.md` (you already have it loaded) — mirror the sections: frontmatter, Inputs, Products, Process, Archetypes, MCP Tools Used, Mode Behavior, Dry-Run Behavior, Change Log.
-
-- [ ] **Step 2: Write the producer SKILL.md**
-
-```markdown
----
-name: pdd-to-work-order
-description: >
-  Draft a contractual Work Order from the approved PDD and the run's
-  decisions.yaml. Generic by default — partner identity is a placeholder
-  unless an LLO was supplied. Renders to a clean Google Doc. Parallel to
-  Phase 8 solicitation, not a replacement.
-disable-model-invocation: true
----
-
-# PDD to Work Order
-
-Take the approved PDD and decisions.yaml and produce a contractual Work Order draft, rendered as a clean Google Doc suitable for human review and signature.
-
-## Inputs
-
-| Source | Artifact | Used for |
-|---|---|---|
-| Phase 1 producer | `1-design/idea-to-pdd.md` | scope, deliverables, timeline, target population, success metrics, evidence model |
-| Phase 1 producer | `decisions.yaml` | load-bearing values (rate, FLW count, working language, candidate LLO, etc.) — read as-is |
-| Run-root | `inputs-manifest.yaml` | optional reference for partner identity if it was supplied as input |
-| Operator (optional) | `--llo <slug>` flag | overrides partner-name placeholder |
-
-## Products
-
-- `ACE/<opp-name>/runs/<run-id>/1-design/pdd-to-work-order.gdoc` — the work-order Google Doc (re-runs create `pdd-to-work-order-2.gdoc`, `pdd-to-work-order-3.gdoc`, etc.)
-- `run_state.yaml.phases.design.products.work_order` — `{title, file_id}` typed handoff. This skill is the sole writer.
-- Appended `wo-*` rows in `ACE/<opp-name>/runs/<run-id>/decisions.yaml` (merge-only — never overwrites existing rows).
-
-## Process
-
-1. **Read inputs in parallel.** Issue one `drive_read_file` block for the PDD, decisions.yaml, and inputs-manifest. Trust context across subsequent steps (do not re-read).
-
-2. **Determine archetype** from the PDD's frontmatter (`archetype: atomic-visit | focus-group | multi-stage`). The archetype branches the Scope of Work, Verification, Roles RACI, and Payment per-unit sections.
-
-3. **Resolve contractual fields.** For each work-order field, apply the inference order:
-
-   - (a) If an existing `decisions.yaml` row from an earlier skill covers it (e.g., `payment-rate`, `flw-count`, `working-language`, `budget-plausibility`), use that value as-is. Never duplicate or rename.
-   - (b) If inferable from PDD body (Timeline → period of performance; Success Metrics + Budget → NTE; etc.), use the inference and emit a new `wo-*` row capturing it.
-   - (c) If genuinely unknowable (partner name absent, WO# unknown, MSA date unknown), insert a bracketed placeholder like `[Partner Name]` in the gdoc and emit a `wo-*` row with `status: open` + `notes` telling the human what to fill in.
-
-   Common `wo-*` rows to emit when load-bearing:
-
-   | ID | Question | Map to surface |
-   |---|---|---|
-   | `wo-number` | Sequence number for this WO under the MSA | Header (placeholder if unknown) |
-   | `wo-period-of-performance` | Start + end dates | Header + Timeline section |
-   | `wo-total-not-to-exceed-usd` | Total NTE budget cap | Payment Terms section |
-   | `wo-payment-schedule-split` | Milestone payment percentages (e.g., 40/60) | Payment Schedule sub-table |
-   | `wo-mobilization-advance-pct` | Mobilization advance % of cap | Payment Schedule row 1 |
-   | `wo-reporting-cadence` | Frequency of progress reports (default: weekly) | Reporting sub-section |
-   | `wo-ethics-scope` | Operational-only vs patient-level | Ethics section |
-   | `wo-data-storage-region` | Server region for data storage (default: US) | Data Handling section |
-
-4. **Append `wo-*` rows to `decisions.yaml`** via `update_yaml_file` with merge-only semantics. Never overwrite existing rows. Required keys per row per `lib/decisions-schema.ts`: `id`, `phase: 1-design`, `skill: pdd-to-work-order`, `question`, `default`, `options_considered`, `source`, `status`. Optional `notes`.
-
-5. **Render the work-order template to a Google Doc.**
-   - `docs_copy_template(templateId=<WORK_ORDER_TEMPLATE_ID from env>, parent=<run-folder file_id>, name="Work Order — <opp-title>")`. If the run already has a `pdd-to-work-order.gdoc`, name the new one `Work Order — <opp-title> (#2)`, etc.
-   - `docs_batch_update` with token replacements. Tokens use `{{...}}` snake_case:
-     - `{{wo_number}}`, `{{opp_title}}`, `{{wo_date}}` (today, ISO), `{{wo_period_of_performance}}`
-     - `{{background_body}}` (synthesized from PDD's Problem Statement + Intervention Design + any named downstream consumer)
-     - `{{scope_body}}` (archetype-branched — see below)
-     - `{{geographic_coverage_body}}` (from PDD Target Population; `[Geographic Coverage — Partner to propose]` if not specified)
-     - `{{primary_deliverable_body}}`, `{{verified_unit_body}}` (from PDD Success Metrics + Evidence Model)
-     - `{{reporting_body}}` (from `wo-reporting-cadence`)
-     - `{{timeline_table}}` (markdown table from PDD Timeline)
-     - `{{wo_total_not_to_exceed_usd}}`, `{{payment_schedule_table}}`
-     - `{{roles_raci_table}}` (archetype-derived RACI)
-     - `{{permissions_body}}`, `{{ethics_body}}`, `{{data_handling_table}}`
-     - `{{pdd_link}}` (Drive URL of the PDD from `phases.design.products.pdd.file_id`)
-     - `{{annexure_b_placeholder}}` ("To be provided" if no opp-specific annexure)
-
-6. **Write `run_state.yaml.phases.design.products.work_order`** via `update_yaml_file` with `merge: 'two-level'`:
-
-   ```yaml
-   phases:
-     design:
-       products:
-         work_order:
-           title: "Work Order — <opp-title>"
-           file_id: <gdoc-id>
-   ```
-
-7. **Invoke `decisions-render`** so the human-readable `decisions.gdoc` refreshes with the new `wo-*` rows.
-
-## Archetypes
-
-### `atomic-visit` (default)
-- Scope: per-visit data capture with photo + GPS standardization.
-- Verification: photo + GPS Layer A on the deliver-app form.
-- Payment unit: per visit (rate from existing `payment-rate` decision).
-- Roles: Dimagi configures app + verification audit; Partner recruits FLWs, runs field ops, transports samples (if applicable).
-
-### `focus-group`
-- Scope: per-session facilitation with attestation form submission and gdoc write-up.
-- Verification: attestation submission Layer A + gdoc receipt Layer B; coordinator-graded practice-session-pass gates payment.
-- Payment unit: per session (facilitator + notetaker rate from existing `per-session-rate` decision); facilitator training stipend on practice-session-pass.
-- Roles: Dimagi configures OCS chatbot + attestation form + gdoc template; Partner recruits facilitators + notetakers, runs sessions, completes gdoc.
-
-### `multi-stage`
-- Scope: per-stage sub-section, each with its own archetype-shaped scope.
-- Verification: per-stage criteria reflecting the stage's archetype.
-- Payment: may mix per-visit and per-session units; stage-gate criteria from PDD.
-- Roles: per-stage RACI.
-
-## MCP Tools Used
-- Google Drive: `drive_read_file`, `drive_create_file`, `update_yaml_file`
-- Google Docs: `docs_copy_template`, `docs_batch_update`
-
-## Mode Behavior
-
-- **Default (auto):** infer all fields, draft the gdoc, append `wo-*` rows, write `products.work_order`, proceed.
-- **Review:** after the gdoc is written, pause and surface the gdoc URL for human approval before proceeding to the next phase.
-
-## Dry-Run Behavior
-
-When `--dry-run` is active:
-- Write the work-order gdoc as normal (Drive writes are reversible).
-- State tracks as `dry-run-success`.
-
-## Change Log
-
-| Date | Change | Author |
-|------|--------|--------|
-| 2026-05-21 | Initial version | ACE team |
-```
-
-- [ ] **Step 3: Verify SKILL.md frontmatter passes the skill catalog**
-
-Run: `npm test -- test/skills/` (or whatever covers skill catalog validation)
-Expected: PASS. If the catalog validator complains about a missing field, align frontmatter with `skills/idea-to-pdd/SKILL.md`.
-
-- [ ] **Step 4: Commit**
-
-```bash
-git add skills/pdd-to-work-order/SKILL.md
-git commit -m "feat(pdd-to-work-order): producer skill"
-```
-
----
-
-## Task 6: QA `SKILL.md`
-
-**Files:**
-- Create: `skills/pdd-to-work-order-qa/SKILL.md`
-
-- [ ] **Step 1: Write the QA SKILL.md**
-
-Mirror `skills/idea-to-pdd-qa/SKILL.md` (you've already read it). Frontmatter `name: pdd-to-work-order-qa`, `disable-model-invocation: true`. Include the 8-check table with `id`, `type: static`, `description`, `auto-fix on fail` columns matching `checks.ts § CHECKS`.
-
-```markdown
----
-name: pdd-to-work-order-qa
-description: >
-  Structural QA on the work-order artifact produced by pdd-to-work-order.
-  Binary pass/fail. Catches missing sections, missing wo-* decision rows,
-  malformed payment schedule, leaked scaffolding markers, etc. Static-only;
-  no LLM. Gates pdd-to-work-order-eval — eval is skipped if QA fails
-  irrecoverably.
-disable-model-invocation: true
----
-
-# PDD-to-Work-Order QA
-
-Structural correctness checks on the work-order artifact. Binary verdict: pass / fail / incomplete. Eight static checks, all runnable in <100ms via the importable `checks.ts` module — no LLM.
-
-See `skills/_qa-template.md` for the shared QA contract (verdict YAML format, auto-fix protocol, static-vs-LLM rules).
-
-## Inputs
-
-| Source | Artifact | Used for |
-|---|---|---|
-| Phase 1 producer | `1-design/pdd-to-work-order.gdoc` (latest) | the work order under structural check |
-| Phase 1 producer | `decisions.yaml` | required `wo-*` decision-row presence check |
-
-## Products
-
-- `1-design/pdd-to-work-order-qa_result.yaml` — QA result per `lib/qa-types.ts` schema
-
-## Checks
-
-| # | id | type | description | auto-fix on fail |
-|---|---|---|---|---|
-| 1 | `all_required_sections_present` | static | All 11 required work-order sections present (Background, Scope of Work, Geographic Coverage, Deliverables and Verification, Timeline and Milestones, Payment Terms, Roles and Responsibilities, Permissions/Ethics/Compliance, Data Handling, Signatures, Annexures). | regenerate the missing section(s) with substantive content per templates/work-order-template.md |
-| 2 | `required_wo_decisions_present` | static | All four required `wo-*` rows present in decisions.yaml: `wo-number`, `wo-period-of-performance`, `wo-total-not-to-exceed-usd`, `wo-payment-schedule-split`. | append the missing rows with AI's best inference + status: applied/open |
-| 3 | `period_of_performance_complete` | static | Header's Period of Performance shows both start and end dates (or explicit placeholder). | render Period of Performance as "YYYY-MM-DD to YYYY-MM-DD" or "[Period of Performance — TBD]" |
-| 4 | `payment_schedule_sums_to_100` | static | Milestone percentages in section 6.2 sum to 100. | re-derive percentages from `wo-payment-schedule-split` decision and re-render |
-| 5 | `total_nte_present` | static | Total Not-to-Exceed USD value present in section 6.1 (number or `[Placeholder]`). | insert "USD <amount>" from `wo-total-not-to-exceed-usd` or `USD [TBD]` |
-| 6 | `signature_blocks_present` | static | Both `**Subcontractor**` and `**Dimagi, Inc.**` signature blocks present. | re-add missing block per templates/work-order-template.md |
-| 7 | `archetype_appropriate_scope` | static | Scope of Work language matches declared archetype: atomic-visit references per-visit + photo/GPS; focus-group references per-session + attestation + gdoc; multi-stage references per-stage subsections. | re-draft Scope of Work to match archetype |
-| 8 | `no_scaffolding_markers` | static | No leaked `<<...>>` AI scaffolding markers in the work-order body. | resolve each marker with concrete content or `[Placeholder]` bracket |
-
-The static check functions live at `skills/pdd-to-work-order-qa/checks.ts` as importable TS. Every check returns a `QACheckResult` (`{pass, detail?, auto_fix_hint?}`) per `lib/qa-types.ts`.
-
-**Adding a check:** append to the `CHECKS` array in `checks.ts`, add a row to the table above (matching `id`), add a unit test in `test/skills/pdd-to-work-order-qa/checks.test.ts`.
-
-## Process
-
-1. **Read the work-order artifact.** Resolve the latest `pdd-to-work-order.gdoc` (the one referenced by `phases.design.products.work_order.file_id` in `run_state.yaml`). Read its body via `drive_read_file`.
-
-2. **Read decisions.yaml** via `drive_read_file`.
-
-3. **Read PDD archetype** from `run_state.yaml.phases.design.products.pdd` (or read the PDD body and parse the `archetype:` frontmatter line).
-
-4. **Save artifact bodies to local temp paths** so the CLI runner can invoke `checks.ts`:
-   ```bash
-   TMP_WO=$(mktemp); TMP_DEC=$(mktemp)
-   # write drive contents to $TMP_WO and $TMP_DEC
-   ```
-
-5. **Invoke the check runner** that imports `checks.ts § CHECKS` and runs each against `{workOrderText, decisionsYamlText, archetype}`. Output: a `QACheckResult[]` aligned with the `CHECKS` array.
-
-6. **Compose and write the verdict YAML** to `1-design/pdd-to-work-order-qa_result.yaml` per the QA verdict schema (`lib/qa-types.ts`). `verdict: pass` iff every check passes; `verdict: fail` with `failures[]` array otherwise (each entry: `{check, detail, auto_fix_hint}`). `verdict: incomplete` if a check could not be evaluated (e.g., decisions.yaml unreadable).
-
-7. **Trigger the producer-retry loop on `verdict: fail`** per `agents/idea-to-design.md § Step 2.4`. After retry: re-run QA. Halt with `verdict: incomplete` when the producer can no longer make progress on the same failures.
-
-## Change Log
-
-| Date | Change | Author |
-|------|--------|--------|
-| 2026-05-21 | Initial version | ACE team |
-```
-
-- [ ] **Step 2: Add the QA skill row to `skills/_qa-decisions.md`**
-
-Read `skills/_qa-decisions.md`. Find the table that registers per-producer QA status. Add a new row for `pdd-to-work-order` with status `has-qa` and rationale: "Static checks live; structural correctness is enforceable without an LLM (section presence, decision-row presence, payment-schedule arithmetic, signature blocks, scaffolding markers). Eval grades the substantive concerns."
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/pdd-to-work-order-qa/SKILL.md skills/_qa-decisions.md
-git commit -m "feat(pdd-to-work-order-qa): SKILL.md + registry entry"
-```
-
----
-
-## Task 7: Eval `SKILL.md`
-
-**Files:**
-- Create: `skills/pdd-to-work-order-eval/SKILL.md`
-
-- [ ] **Step 1: Read the existing eval skill for shape**
-
-Read `skills/idea-to-pdd-eval/SKILL.md` (use the Read tool) — mirror the frontmatter, dimension table, scoring scale, verdict shape, and process steps.
-
-- [ ] **Step 2: Write the eval SKILL.md**
-
-```markdown
----
-name: pdd-to-work-order-eval
-description: >
-  Independent quality re-grade for the Work Order produced by
-  pdd-to-work-order. LLM-as-Judge, five quality dimensions: contractual
-  clarity, PDD alignment, decisions traceability, verification realism,
-  archetype fit. Skipped if pdd-to-work-order-qa returned verdict:
-  incomplete. Verdict shape per lib/verdict-schema.ts.
-disable-model-invocation: true
----
-
-# PDD-to-Work-Order Eval
-
-LLM-as-Judge quality re-grade. Five dimensions, each scored `pass | partial | fail` with cited evidence from the work-order body and `decisions.yaml`. Two or more non-pass dimensions → `verdict: fail`. A `verdict: fail` here does NOT halt the run on its own — `[BLOCKER]` concerns pause per the orchestrator's Per-Mode Pause Matrix.
-
-If `pdd-to-work-order-qa` returned `verdict: incomplete`, this skill is **skipped** and emits `verdict: incomplete` mirroring QA's outcome.
-
-## Inputs
-
-| Source | Artifact | Used for |
-|---|---|---|
-| Phase 1 producer | `1-design/pdd-to-work-order.gdoc` (latest) | the artifact under quality re-grade |
-| Phase 1 producer | `1-design/idea-to-pdd.md` | source-of-truth for PDD alignment check |
-| Phase 1 producer | `decisions.yaml` | source-of-truth for decisions traceability check |
-| Phase 1 QA | `1-design/pdd-to-work-order-qa_result.yaml` | gating signal |
-
-## Products
-
-- `1-design/pdd-to-work-order-eval_verdict.yaml` — verdict per `lib/verdict-schema.ts`
-
-## Dimensions
-
-Each dimension is scored `pass | partial | fail` with a 1-3 sentence rationale citing specific evidence from the artifacts. Two or more non-pass → `verdict: fail`.
-
-### 1. Contractual clarity
-*Could the named partner sign this draft without coming back for clarification on scope, deliverables, payment, or roles?*
-
-Common failure modes: scope describes the intervention but omits unit definitions ("samples" without saying what counts as a verified sample); deliverables reference verification criteria that aren't enumerated anywhere; payment per unit not stated; roles RACI omits responsibilities for sample storage or transport.
-
-### 2. PDD alignment
-*Do the scope, deliverables, timeline, and payment trace back to the PDD?*
-
-Common failure modes: scope expands beyond PDD ("includes patient-level data collection" when PDD is operational-only); timeline contradicts PDD Timeline section; geographic coverage adds regions the PDD doesn't mention; payment per unit doesn't match the PDD's `payment-rate` decision.
-
-### 3. Decisions traceability
-*Do the contractual numerics in the work order match the corresponding rows in `decisions.yaml`?*
-
-Common failure modes: per-visit rate in section 6 differs from `payment-rate` decision row; FLW count in roles section differs from `flw-count` decision row; period of performance in header differs from `wo-period-of-performance` decision row; total NTE in section 6.1 differs from `wo-total-not-to-exceed-usd`.
-
-### 4. Verification realism
-*Are the "verified unit" criteria in section 4.2 actually measurable on the Connect platform?*
-
-Common failure modes: criterion requires data not captured by the Connect app (e.g., "temperature logged during transit" without a temperature field); criterion requires audit data the platform doesn't expose; criterion is subjective ("delivered in good condition") without an audit mechanism.
-
-### 5. Archetype fit
-*Does the work-order shape match the declared archetype?*
-
-Common failure modes: declared archetype is `focus-group` but scope describes per-visit data collection; declared archetype is `atomic-visit` but payment schedule is per-session; multi-stage PDD with a single-stage work order.
-
-## Process
-
-1. **Check the gating signal.** Read `pdd-to-work-order-qa_result.yaml`. If `verdict: incomplete`, emit `pdd-to-work-order-eval_verdict.yaml` with `verdict: incomplete` and return. If `verdict: fail`, proceed (QA's failures are auto-fixable; eval still grades the substantive concerns of the latest draft).
-
-2. **Read the artifacts.** Work order body, PDD body, decisions.yaml. Parallel `drive_read_file` block.
-
-3. **Grade each dimension.** For each of the five dimensions:
-   - State the dimension question.
-   - Quote 1-3 specific pieces of evidence from the work order, PDD, or decisions.yaml.
-   - Assign `pass | partial | fail` with a 1-3 sentence rationale.
-
-4. **Compute the verdict.** `verdict: pass` if all dimensions pass. `verdict: partial` if exactly one is non-pass. `verdict: fail` if two or more are non-pass.
-
-5. **Surface blockers.** Add a `concerns[]` array for any dimension grading `fail` where the underlying gap could compromise the contract's enforceability (e.g., verification criteria that aren't measurable, scope mismatches with PDD). Mark such entries `severity: blocker` so the orchestrator surfaces them at the Phase 1→2 pause.
-
-6. **Write the verdict YAML** to `1-design/pdd-to-work-order-eval_verdict.yaml` per `lib/verdict-schema.ts`. Required keys: `skill`, `run_id`, `verdict`, `dimensions[]` (with `id`, `grade`, `evidence[]`, `rationale`), `concerns[]` (optional), `summary`.
-
-## Change Log
-
-| Date | Change | Author |
-|------|--------|--------|
-| 2026-05-21 | Initial version | ACE team |
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add skills/pdd-to-work-order-eval/SKILL.md
-git commit -m "feat(pdd-to-work-order-eval): quality re-grade rubric"
-```
-
----
-
-## Task 8: Wire the new skills into the `idea-to-design` agent
-
-**Files:**
-- Modify: `agents/idea-to-design.md`
-
-- [ ] **Step 1: Update the frontmatter `skills:` array**
-
-Find the existing frontmatter block in `agents/idea-to-design.md`:
-
-```yaml
-skills:
-  - { name: idea-to-pdd, has_judge: true, qa_skill: idea-to-pdd-qa, eval_skill: idea-to-pdd-eval }
-```
-
-Replace with:
-
-```yaml
-skills:
-  - { name: idea-to-pdd, has_judge: true, qa_skill: idea-to-pdd-qa, eval_skill: idea-to-pdd-eval }
-  - { name: pdd-to-work-order, has_judge: true, qa_skill: pdd-to-work-order-qa, eval_skill: pdd-to-work-order-eval }
-```
-
-- [ ] **Step 2: Append Step 2 / 2.4 / 2.5 to the workflow body**
-
-Find the existing `### Completion` section near the end. Insert the following block immediately before `### Completion`:
-
-```markdown
-### Step 2: PDD → Work Order
-Invoke the `pdd-to-work-order` skill.
-- Inputs (already in subagent context from Step 1 — do NOT re-read):
-  - `ACE/<opp-name>/runs/<run-id>/1-design/idea-to-pdd.md` (the PDD)
-  - `ACE/<opp-name>/runs/<run-id>/decisions.yaml` (load-bearing decisions)
-- Output:
-  - `ACE/<opp-name>/runs/<run-id>/1-design/pdd-to-work-order.gdoc` (re-runs create `pdd-to-work-order-2.gdoc`, etc.)
-  - `run_state.yaml.phases.design.products.work_order` block
-  - Appended `wo-*` rows in `decisions.yaml` (merge-only)
-- **Gate (review mode):** present the work-order URL for approval before continuing.
-
-### Step 2.4: PDD-to-Work-Order QA (structural pass/fail)
-
-Invoke the `pdd-to-work-order-qa` skill — runs 8 static structural checks against the produced work order.
-
-- Input:
-  - `ACE/<opp-name>/runs/<run-id>/1-design/pdd-to-work-order.gdoc` (latest)
-  - `ACE/<opp-name>/runs/<run-id>/decisions.yaml`
-- Output: `ACE/<opp-name>/runs/<run-id>/1-design/pdd-to-work-order-qa_result.yaml`
-- **QA gates eval:** if `verdict: fail`, dispatch the producer with each `failures[].auto_fix_hint`, then re-run QA. Halt with `verdict: incomplete` when the producer can no longer make progress on the same failures. NEVER silently proceed to eval when QA failed.
-
-### Step 2.5: PDD-to-Work-Order eval (independent quality re-grade)
-Unless `--no-evals` was passed AND QA verdict is `pass`, invoke the `pdd-to-work-order-eval` skill.
-- Inputs: work-order gdoc + PDD + decisions.yaml (all in subagent context).
-- Output: `ACE/<opp-name>/runs/<run-id>/1-design/pdd-to-work-order-eval_verdict.yaml`
-- If QA verdict was `incomplete`, this step is **skipped** (eval emits `verdict: incomplete`).
-```
-
-- [ ] **Step 3: Update the `### Completion` section**
-
-Find:
-```
-Write phase summary to `ACE/<opp-name>/runs/<run-id>/1-design/idea-to-design_summary.md`,
-```
-
-Insert after the summary path is computed (or in the summary content itself) a line listing both the PDD URL and the work-order URL. Concretely, update the summary contract:
-
-```
-The summary now lists both:
-- PDD: `phases.design.products.pdd.file_id` (Drive URL)
-- Work Order: `phases.design.products.work_order.file_id` (Drive URL)
-```
-
-- [ ] **Step 4: Verify the agent doc lints/validates**
-
-Run any agent-frontmatter validator:
-```
-npm test -- agents/
-```
-Or grep for the catalog scripts:
-```
-grep -l "skills:" scripts/*.ts | head -3
-```
-If there's a `verify-agents.ts` or similar, run it. Expected: pass.
-
-- [ ] **Step 5: Commit**
-
-```bash
-git add agents/idea-to-design.md
-git commit -m "feat(idea-to-design): wire pdd-to-work-order skill triple"
-```
-
----
-
-## Task 9: Update `.env.tpl` + operator-facing docs
-
-**Files:**
-- Modify: `.env.tpl`
-- Create: `playbook/integrations/work-order-template.md`
-
-- [ ] **Step 1: Add `WORK_ORDER_TEMPLATE_ID` to `.env.tpl`**
-
-Find the existing `ACE_TRAINING_DECK_TEMPLATE_ID` line (around line 211). Add an adjacent entry following the same 1Password reference pattern. If the 1P field doesn't exist yet, document where to create it:
-
-```
-# Work-order template (Google Doc); see playbook/integrations/work-order-template.md to bootstrap.
-WORK_ORDER_TEMPLATE_ID=op://AI-Agents/ACE - Open Chat Studio/Config/work_order_template_id
-```
-
-- [ ] **Step 2: Write the operator-facing bootstrap docs**
-
-Create `playbook/integrations/work-order-template.md`:
-
-```markdown
-# Work-Order Template — Bootstrap
-
-The `pdd-to-work-order` skill renders Work Orders by copying a Google Doc template and replacing `{{...}}` tokens. The template is per-deployment Drive state, not committed to the repo. This page documents how to provision it.
-
-## One-time bootstrap
-
-```bash
-# Ensure ACE_TEMPLATES_FOLDER_ID is set (the Drive folder where ACE keeps its templates)
-ACE_TEMPLATES_FOLDER_ID=<folder id> npx tsx scripts/bootstrap-work-order-template.ts
-```
-
-The script:
-1. Reads `templates/work-order-template.md` (canonical content).
-2. Uploads it to Drive as a Google Doc named "ACE Work Order Template".
-3. Prints the resulting file_id to stdout.
-
-Record the file_id in 1Password at `AI-Agents/ACE - Open Chat Studio/Config/work_order_template_id`, then re-run `op inject -i .env.tpl -o $CLAUDE_PLUGIN_DATA/.env --force` (or `/ace:setup --force-env`).
-
-## Refresh
-
-To replace an existing template with the latest `templates/work-order-template.md`:
-
-```bash
-ACE_TEMPLATES_FOLDER_ID=<folder id> WORK_ORDER_BOOTSTRAP_FORCE=1 \
-  npx tsx scripts/bootstrap-work-order-template.ts
-```
-
-The old template is trashed (recoverable for 30 days in Drive) and a new one is created. Record the new file_id in 1Password.
-
-## Token contract
-
-The skill replaces these `{{...}}` tokens in the template:
-
-| Token | Source |
-|---|---|
-| `{{wo_number}}` | `wo-number` decision (placeholder if open) |
-| `{{opp_title}}` | PDD H1 |
-| `{{wo_date}}` | today (ISO) |
-| `{{wo_period_of_performance}}` | `wo-period-of-performance` decision |
-| `{{background_body}}` | PDD Problem Statement + Intervention Design |
-| `{{scope_body}}` | Archetype-branched |
-| `{{geographic_coverage_body}}` | PDD Target Population |
-| `{{primary_deliverable_body}}` | PDD Success Metrics + Evidence Model |
-| `{{verified_unit_body}}` | PDD Evidence Model Layer A |
-| `{{reporting_body}}` | `wo-reporting-cadence` |
-| `{{timeline_table}}` | PDD Timeline |
-| `{{wo_total_not_to_exceed_usd}}` | `wo-total-not-to-exceed-usd` decision |
-| `{{payment_schedule_table}}` | `wo-payment-schedule-split` + `wo-mobilization-advance-pct` |
-| `{{roles_raci_table}}` | Archetype-derived |
-| `{{permissions_body}}`, `{{ethics_body}}` | Template defaults + PDD scope |
-| `{{data_handling_table}}` | Template defaults + PDD data-subject treatment |
-| `{{pdd_link}}` | `phases.design.products.pdd.file_id` URL |
-| `{{annexure_b_placeholder}}` | "To be provided" if no opp-specific annexure |
-
-Editing the template adds or removes tokens — make sure the skill's `## Process step 5` lists every token the template uses.
-```
-
-- [ ] **Step 3: Commit**
-
-```bash
-git add .env.tpl playbook/integrations/work-order-template.md
-git commit -m "feat(env+docs): wire WORK_ORDER_TEMPLATE_ID + bootstrap docs"
-```
-
----
-
-## Task 10: End-to-end smoke validation against a real PDD
-
-**Files:**
-- (no new files)
-
-- [ ] **Step 1: Pick a recent run with an approved PDD**
-
-Run:
-```bash
-ls -la $CLAUDE_PLUGIN_DATA 2>/dev/null || echo "no plugin data dir"
-```
-
-Identify a recent opp + run from `/ace:status`:
-```bash
-/ace:status
-```
-
-Pick one whose PDD is in `1-design/idea-to-pdd.md`.
-
-- [ ] **Step 2: Invoke the new skill via `/ace:step`**
-
-```bash
-/ace:step pdd-to-work-order <opp-name>/<run-id>
-```
-
-Expected:
-- `1-design/pdd-to-work-order.gdoc` appears in Drive.
-- `decisions.yaml` has new `wo-*` rows.
-- `run_state.yaml.phases.design.products.work_order` is populated.
-
-- [ ] **Step 3: Eyeball the generated gdoc**
-
-Open the gdoc in the browser. Verify:
-- All 11 sections present and substantive.
-- Period of Performance has dates (or `[Placeholder]`).
-- Payment schedule percentages sum to 100.
-- Signature blocks present.
-- Scope of Work language matches the PDD's declared archetype.
-
-- [ ] **Step 4: Invoke `pdd-to-work-order-qa` via `/ace:step`**
-
-```bash
-/ace:step pdd-to-work-order-qa <opp-name>/<run-id>
-```
-
-Expected: `verdict: pass` in `1-design/pdd-to-work-order-qa_result.yaml`. If any check fails, capture the failure and either fix the producer (Step 5 token list, archetype branching) or the check (false positive in `checks.ts`).
-
-- [ ] **Step 5: Invoke `pdd-to-work-order-eval` via `/ace:step`**
-
-```bash
-/ace:step pdd-to-work-order-eval <opp-name>/<run-id>
-```
-
-Expected: `verdict: pass` or `verdict: partial` with cited evidence per dimension. Read the verdict; if any dimension is `fail`, audit whether the work-order body genuinely fails that dimension or whether the rubric needs tightening.
-
-- [ ] **Step 6: Capture findings (no commit unless code changed)**
-
-If the smoke test surfaced any class-level bug, file an issue or open a follow-up PR. If everything ran clean, this is a no-op step.
-
----
-
-## Task 11: Ship the change
-
-**Files:**
-- Modify: `VERSION` (via `scripts/version-bump.sh`)
-
-- [ ] **Step 1: Bump version**
-
-Run:
-```bash
-bash scripts/version-bump.sh
-```
-
-This writes `VERSION` + syncs the three sibling JSON files via the pre-commit hook on the next commit. The script picks `max(local, origin) + patch+1`.
-
-- [ ] **Step 2: Commit the version bump**
-
-```bash
-git add VERSION .claude-plugin/plugin.json .claude-plugin/marketplace.json package.json
-git commit -m "chore: bump VERSION for pdd-to-work-order skill"
-```
-
-- [ ] **Step 3: Push and open the PR**
-
-```bash
-git push -u origin emdash/work-order-1rjnp
-gh pr create --title "feat(phase-1): pdd-to-work-order skill triple" --body "$(cat <<'EOF'
-## Summary
-- New Phase 1 skill `pdd-to-work-order` drafts a contractual Work Order from the approved PDD and the run's decisions.yaml.
-- `wo-*` prefix for work-order-specific decision rows; existing decisions (`payment-rate`, `flw-count`, etc.) are read as-is.
-- QA + eval companions: 8 static structural checks + 5-dimension LLM-as-Judge quality re-grade.
-- Auto-invoked in `/ace:run` as Step 2 of the `idea-to-design` agent.
-- Generic by default — `[Partner Name]` placeholder unless an LLO is supplied. Parallel to Phase 8 solicitation, not a replacement.
-
-Spec: `docs/superpowers/specs/2026-05-21-work-order-skill-design.md`
-Plan: `docs/superpowers/plans/2026-05-21-work-order-skill.md`
-
-## Test plan
-- [ ] `npx vitest run test/skills/pdd-to-work-order-qa/checks.test.ts` passes (8 check unit tests).
-- [ ] `npm test` overall still green (no regressions in manifest tests).
-- [ ] `npx tsc --noEmit scripts/bootstrap-work-order-template.ts` clean.
-- [ ] Smoke test: `/ace:step pdd-to-work-order <opp>/<run>` against a recent approved PDD produces a clean work-order gdoc with expected sections and tokens replaced.
-- [ ] QA `/ace:step pdd-to-work-order-qa <opp>/<run>` returns `verdict: pass` on the smoke-test artifact.
-- [ ] Eval `/ace:step pdd-to-work-order-eval <opp>/<run>` returns `verdict: pass` or `verdict: partial` with cited evidence.
-
-🤖 Generated with [Claude Code](https://claude.com/claude-code)
-EOF
-)"
-```
-
-- [ ] **Step 4: Arm auto-merge**
-
-```bash
-gh pr merge --auto --merge
-```
-
-- [ ] **Step 5: Wait for the merge, then update the local plugin**
-
-After the PR merges (watch via `gh pr view --json state,mergedAt` if needed), run in this session:
-
-```
-/ace:update
-```
-
-If the change includes any MCP-server-side changes (it does not in this PR), also `/reload-plugins`.
-
----
-
-## Notes for the implementing engineer
-
-- **Skill files are stateless LLM-instruction documents.** Don't try to write Python/TS code "for" a SKILL.md — the LLM follows the instructions at runtime. `checks.ts` is the only TS file; it's pure functions.
-- **`docs_copy_template` + `docs_batch_update`** are MCP atoms — use them via the gdrive MCP, not direct Google API calls in skill code. Skill code uses the MCP layer; only the bootstrap script touches Google API directly.
-- **`decisions.yaml` merge semantics.** Use `update_yaml_file` with `merge: 'two-level'` (or whatever the existing pattern is — check how `idea-to-pdd` writes its decisions). Never overwrite existing rows. If a `wo-*` row already exists from a prior run of this skill, leave it (the new run picks the next free WO# variant from the gdoc name).
-- **Re-run semantics.** Each `/ace:step pdd-to-work-order` creates a NEW gdoc (`pdd-to-work-order-2.gdoc`, `pdd-to-work-order-3.gdoc`, ...) and updates `phases.design.products.work_order.file_id` to point at the latest. Older gdocs stay in the run folder as audit trail.
-- **Dimagi signatory is hardcoded** (Lucina Tse, COO) in the template — not parameterized. If a second signatory is ever needed, that's a follow-up PR that templatizes the Dimagi signature block.
-- **Phase 8 still runs.** This skill is parallel to the solicitation, not a replacement. The work order may be signed independently after the solicitation is formally "awarded" to the pre-named LLO.
-
----
-
-## Self-review checklist (run before handoff)
-
-- [ ] **Spec coverage.** Every section in `docs/superpowers/specs/2026-05-21-work-order-skill-design.md` is implemented by at least one task:
-  - Skill identity, inputs, outputs → Task 5
-  - Decisions log convention (wo-* rows) → Tasks 5, 6 (QA check 2)
-  - Process steps → Task 5
-  - Section template → Task 2 (template content), Task 5 (skill renders it)
-  - Archetype branching → Task 5 (skill), Task 4 (check 7)
-  - QA 8 checks → Task 4 (TDD) + Task 6 (SKILL.md table)
-  - Eval 5 dimensions → Task 7
-  - Agent integration → Task 8
-  - Write-back contract → Tasks 1 (manifest), 5 (skill writes `products.work_order`)
-  - Template provisioning → Tasks 2 (canonical content), 3 (bootstrap script), 9 (operator docs)
-  - Re-run semantics → Task 5 (skill creates new gdoc each time), Task 8 (Notes)
-  - Dimagi signatory hardcoded → Task 2 (template), Task 5 (skill doesn't template it)
-- [ ] **Placeholder scan.** No "TBD" / "fill in later" / "similar to Task N" — every step has concrete code or content.
-- [ ] **Type consistency.** Check IDs in Task 4 (`all_required_sections_present`, etc.) match Task 6 SKILL.md table and Task 1 manifest descriptions.
-- [ ] **Frequent commits.** Each task ends with `git commit`; no task batches multiple unrelated changes.
diff --git a/docs/superpowers/specs/2026-05-04-ace-solicitations-phase-design.md b/docs/superpowers/specs/2026-05-04-ace-solicitations-phase-design.md
deleted file mode 100644
index faaf6676..00000000
--- a/docs/superpowers/specs/2026-05-04-ace-solicitations-phase-design.md
+++ /dev/null
@@ -1,378 +0,0 @@
-# ACE Solicitations Phase — Design
-
-**Status:** Draft
-**Date:** 2026-05-04
-**Author:** ACE
-**Related:** [`docs/superpowers/specs/2026-04-28-ace-connect-mcp-design.md`](2026-04-28-ace-connect-mcp-design.md) (the sibling MCP that this work does **not** extend), [`playbook/integrations/connect-api.md`](../../../playbook/integrations/connect-api.md), connect-labs `commcare_connect/mcp/` source, connect-labs `user_docs/solicitations.md`
-
-## Problem
-
-Today ACE's lifecycle assumes the PDD already names which LLO will run an opportunity. Phase 7 (`llo-management`) opens with an `llo-invite` skill that builds a candidate roster from the PDD's preferred-LLO list and pushes the opportunity directly to those orgs.
-
-That assumption holds for the early dogfood opps but does not hold long-term. The product direction is: **ACE always asks LLOs to fill out a solicitation before being awarded an opportunity**. We won't know who the LLO is until a solicitation closes and a winner is awarded. Even when we *do* know who we want short-term, we want them to go through the solicitation flow as a functional test of the path real LLOs will take.
-
-The current pipeline can't represent that:
-1. There is no place in the agent topology where a solicitation is published.
-2. The Connect API surface ACE talks to (`connect.dimagi.com` via `ace-connect`) does not own solicitations — they live in Connect Labs at `labs.connect.dimagi.com`, a separate Django deployment.
-3. Phase 7's `llo-invite` is wired to push opportunities directly to a known roster, not invitations to a public solicitation.
-
-## Goal
-
-Insert a new **Phase 7 — Solicitation Management** between Phase 5 (qa-and-training) and the existing LLO-management work. Renumber: existing Phase 7 becomes Phase 8 (renamed **Execution Management**); existing Phase 8 becomes Phase 9 (Closeout, unchanged).
-
-Phase 7 publishes a solicitation derived from the PDD, optionally invites PDD-named LLO candidates to the solicitation by email, and stops there in default `/ace:run` mode. The review-and-award lifecycle continues via manually-invoked skills, gated by a hard human-in-the-loop checkpoint before the irreversible `award_response` call.
-
-Phase 8 (Execution Management) consumes the awarded org from `opp.yaml` and proceeds with onboarding, UAT, go-live, and recurring monitoring — the same skills it has today, minus `llo-invite` (moved to Phase 7).
-
-**Non-goals:**
-- Extending `ace-connect` with solicitation atoms. Solicitations live in connect-labs; ACE consumes the existing remote MCP at `labs.connect.dimagi.com/mcp/` rather than re-implementing the surface.
-- Auto-awarding. The award call (`award_response`) writes a fund allocation and is irreversible from ACE's perspective; it stays HITL.
-- A migration path for in-flight opps' `run_state.yaml` files. Operations gate: finish in-flight opps on the old code, then merge.
-
-## Phase topology
-
-```
-1. design-review            (unchanged)
-2. commcare-setup           (unchanged)
-3. connect-setup            (unchanged)
-4. ocs-setup                (unchanged)
-5. qa-and-training          (unchanged)
-6. solicitation-management  ← NEW
-7. execution-management     ← was llo-management (Phase 7)
-8. closeout                 (was Phase 8)
-```
-
-**Agent forms** (per the level-0 dispatch invariant in `CLAUDE.md` § Agent topology):
-- `solicitation-management` is a **subagent** (does not call `Agent`). Same shape as `ocs-setup`, `qa-and-training`, `closeout`. Dispatched by `ace-orchestrator` (the existing procedure doc) via `Agent(solicitation-management)` from level 0.
-- `execution-management` is a subagent (rename of `llo-manager`).
-- No procedure-doc agents added.
-
-**Pause-points (updated `pause-points` framework in `agents/orchestrator-reference.md` § Pause Points):**
-- Phase 5 → 6: no longer a mandatory pause. The CLAUDE.md prose flagging Phase 5→6 as "always pause / external-communication boundary" moves to Phase 7→8.
-- **Phase 7 runs autonomously start to finish** (publish solicitation + send invites). No internal HITL gate.
-- Phase 7 → 7: **the new external-communication boundary**. `/ace:run` halts here in default mode. Phase 8 will not start until `opp.yaml.selected_llo.org_slug` is populated, which only happens via the manual `solicitation-review` skill.
-- After `solicitation-review`: HITL gate before `award_response` is called. (Manual flow only — not in default `/ace:run`.)
-- Phase 8 internal pauses (`llo-launch`) unchanged.
-
-## Architecture & MCP integration
-
-**ACE consumes the existing remote `connect_labs` MCP.** It already exposes solicitation, response, review, and fund tools (verified live in `connect-labs/commcare_connect/mcp/tools/solicitations.py`). No new MCP server, no new atoms in `ace-connect`.
-
-**Plugin manifest entry** (`/.claude-plugin/plugin.json`, under `mcpServers`):
-
-```jsonc
-"connect_labs": {
-  "type": "http",
-  "url": "https://labs.connect.dimagi.com/mcp/",
-  "headers": {
-    "Authorization": "Bearer ${LABS_MCP_TOKEN}"
-  }
-}
-```
-
-**Implementation hedge.** Whether `${LABS_MCP_TOKEN}` substitution works in `mcpServers.headers` is uncertain — `CLAUDE.md` documents historical breakage with `${CLAUDE_PLUGIN_DATA}` / `${CLAUDE_PLUGIN_ROOT}` substitution in plugin-root `.mcp.json` (worked around by inlining in `plugin.json` since 0.5.16). The plan step that wires this up should test the substitution path first; if it doesn't work, fall back to reading `LABS_MCP_TOKEN` from `${CLAUDE_PLUGIN_DATA}/.env` at MCP-server-startup time and injecting the header in code, mirroring how `ace-ocs` already handles credentials.
-
-**Auth is Bearer PAT.** Verified by reading `connect-labs/commcare_connect/mcp/auth.py`: the endpoint expects `Authorization: Bearer <raw_token>` against the `MCPAccessToken` Django model. There is no OAuth on the wire today. The OAuth bridge described in connect-labs documentation is a *backend* call chain — once the PAT identifies the labs user, individual tool handlers call `require_connect_token(user)` to look up *that user's* stored Connect access token and use it server-to-server. ACE's responsibility is the PAT only.
-
-**One-time setup.** A labs admin (or a labs user with shell access) runs:
-
-```bash
-python manage.py mcp_create_token --user ace@dimagi-ai.com --name ACE-plugin --ttl-days 0
-```
-
-The raw token is dropped into the 1Password item `op://Dimagi/labs-mcp-pat-ace/credential`. The labs `ace@dimagi-ai.com` account must also have completed Connect OAuth linkage at least once (so `require_connect_token(user)` succeeds inside tool handlers).
-
-**`.env.tpl` adds one line:**
-
-```
-LABS_MCP_TOKEN=op://Dimagi/labs-mcp-pat-ace/credential
-```
-
-**`/ace:setup` does not grow an interactive flow.** ACE is a service identity; one long-lived PAT (`--ttl-days 0`) injected via `op inject` is the right shape. Connect-labs's interactive `labs-token-setup` skill is built for individual humans and is not vendored.
-
-**`/ace:doctor` adds a `[Connect Labs]` section** with three checks:
-
-1. `connect_labs_env` — `LABS_MCP_TOKEN` present and non-empty in installed `.env`. Same drift-detection pattern as the existing `[Connect]` section.
-2. `connect_labs_mcp_reachable` — `POST` an `initialize` JSON-RPC frame with the PAT. 200/204 = OK; 401 = PAT bad; 5xx/timeout = labs is down. Class-level preventer against silent labs outages.
-3. `connect_labs_connect_oauth` — `tools/call` the `list_solicitations` atom with no filter args (or the smallest possible page). Distinguishes (a) PAT-level 401 (HTTP transport rejects the bearer) from (b) HTTP 200 with a tool-level `PERMISSION_DENIED` envelope raised by `require_connect_token`. Each surfaces a different fix path. Mirrors the 0.7.1 `ocs_shared_collection_team` probe in spirit.
-
-**Atom inventory ACE consumes** (names match `@register(name=...)` in `connect-labs/commcare_connect/mcp/tools/solicitations.py` and `tools/reviews.py`):
-
-| Atom | Used by skill | Notes |
-|---|---|---|
-| `create_solicitation` | `solicitation-create` | publishes in one shot |
-| `list_solicitations` | `solicitation-monitor` | idempotency check |
-| `get_solicitation` | `solicitation-monitor`, `solicitation-review` | |
-| `update_solicitation` | reserved | future: extend deadline |
-| `list_responses` | `solicitation-monitor`, `solicitation-review` | |
-| `get_response` | `solicitation-monitor`, `solicitation-review` | |
-| `award_response` | `solicitation-review` (post-HITL) | irreversible |
-| `create_review` | `solicitation-review` | per-criterion scoring |
-| `list_reviews` | `solicitation-review` | dedupe before re-scoring |
-| `generate_criteria` | `solicitation-create` | AI-derived rubric from PDD scope text |
-
-**Atoms ACE explicitly does not use:** `fund_*` (out of scope), `create_response` (responses come from real LLOs, not ACE).
-
-## Data contract
-
-### PDD → solicitation field mapping
-
-`solicitation-create` builds the payload from the existing PDD plus three new optional fields surfaced by Phase 1 `idea-to-pdd`:
-
-| Solicitation field | Source | Notes |
-|---|---|---|
-| `title` | PDD `title` + archetype | e.g. `"EOI: Vaccine Hesitancy FLW pilot — atomic-visit"` |
-| `solicitation_type` | new PDD field `solicitation_type: EOI \| RFP` | default `EOI` |
-| `description` | PDD `intervention_summary` + `target_flw_profile` | full prose |
-| `scope_of_work` | PDD `visit_structure` + `success_criteria` | what the LLO must execute |
-| `budget` | PDD `total_budget` | already used by Connect opp setup |
-| `deadline` | new PDD field `solicitation_deadline_days: int` | default 14; Phase 7 computes `now() + N` |
-| `evaluation_criteria` | derived via `generate_criteria` | structured rubric (criteria + weights), then included in payload |
-| `response_template` | PDD `llo_questions` (new field) | falls back to a default 6-question template |
-| `status` | hardcoded `published` | ACE always publishes |
-
-The three new PDD fields (`solicitation_type`, `solicitation_deadline_days`, `llo_questions`) are **additive and optional**. Existing PDDs without them fall through to defaults; no breaking change to the PDD template.
-
-### `opp.yaml` (opp-level, persists across runs)
-
-```yaml
-solicitation:
-  solicitation_id: 1247
-  public_url: https://labs.connect.dimagi.com/a/dimagi/solicitations/1247/
-  manage_url: https://labs.connect.dimagi.com/a/dimagi/solicitations/1247/edit/
-  type: EOI
-  published_at: 2026-05-04T18:22:00Z
-  deadline: 2026-05-18T23:59:59Z
-  status: open                          # open | closed | awarded | failed
-  awarded:
-    response_id: null
-    awarded_at: null
-    awarded_org_slug: null
-    awarded_org_name: null
-    awarded_contact_email: null
-    award_amount: null
-
-selected_llo:                           # narrow contract Phase 8 reads
-  org_slug: null
-  contact_email: null
-  source: null                          # 'solicitation' (only path today; reserved for future alternates)
-  response_id: null
-```
-
-`solicitation` is the audit trail. `selected_llo` is the stable, narrow contract Phase 8 reads. They are separate so a future alternate selection path (e.g. closed-bid named-LLO mode) can populate `selected_llo` without producing fake `solicitation` metadata. Only `source: 'solicitation'` is implemented now — the field is included as a forward-compatibility hint.
-
-### `run_state.yaml` (per-run)
-
-The new phase appears as `phases.solicitation_management.<skill_name>: pending|complete|skipped`. Recurring `solicitation-monitor` invocations log under `phases.solicitation_management.solicitation_monitor.runs[]` like other recurring skills.
-
-**Renames:**
-- `phases.llo_management.*` → `phases.execution_management.*`
-- `phase_6_backlog` (was llo-management bucket) → `phase_7_backlog`
-- `phase_7_backlog` (was closeout bucket) → `phase_8_backlog`
-- New `phase_6_backlog` for solicitation-management
-
-No migration script. In-flight opps finish on the old code; new opps use the new schema.
-
-### New artifact paths under `ACE/<opp>/`
-
-```
-ACE/<opp>/solicitation/
-  draft.md                   # solicitation-create writes before publish
-  published.md               # snapshot of submitted fields + URLs returned by labs
-  invitations.md             # llo-invite writes: who got emailed, when, status
-  responses/                 # solicitation-monitor writes one file per response
-    <response_id>.md
-  review/
-    scoring-rubric.md        # solicitation-review writes per-response scores
-    recommendation.md        # ranked candidates + reasoning, input to HITL gate
-  award-record.md            # written when award_response is called (success or failure)
-```
-
-All registered in `lib/artifact-manifest.ts` (which skill produces, which skills consume).
-
-## New skills (Phase 7)
-
-Four skills under `skills/<name>/SKILL.md`. Two run automatically in default `/ace:run`; one is recurring; one is manual. All stateless per the convention in `skills/README.md`.
-
-### `solicitation-create` (auto, default run)
-
-**Inputs:** approved `ACE/<opp>/design/pdd.md`, `ACE/<opp>/opp.yaml` (program_id, total_budget).
-
-**Steps:**
-1. Map PDD fields to solicitation payload using the table above.
-2. Call `connect_labs.generate_criteria` with PDD scope text; capture the structured rubric.
-3. Write `ACE/<opp>/solicitation/draft.md` for traceability (the full payload + the AI-derived rubric).
-4. Call `connect_labs.create_solicitation` with `status=published`.
-5. Write `ACE/<opp>/solicitation/published.md` with `solicitation_id`, `public_url`, `manage_url`, deadline, criteria.
-6. Update `opp.yaml.solicitation` block.
-
-**Outputs:** `published.md`, `opp.yaml.solicitation.{solicitation_id, public_url, deadline, status: open}`.
-
-**Eval companion:** `solicitation-create-eval` — provisional rubric. Mirrors `connect-program-setup-eval` shape. Grades PDD-fidelity (do scope/criteria match the PDD substance?), field completeness, deadline sanity. Calibrated per `eval-calibration` once 3+ real solicitations have shipped.
-
-### `llo-invite` (auto, default run — moved from Phase 8, transformed)
-
-**Behavior change:** previously identified PDD-named candidates and prepared a Connect-side invite roster. Now identifies the same candidates and emails each one a link to the public solicitation URL. **Makes no Connect API calls.** The Connect program-level invite (`connect_send_llo_invite`) is no longer this skill's responsibility — it fires only for the awardee inside `llo-onboarding`.
-
-**Inputs:** `ACE/<opp>/design/pdd.md` (preferred LLOs), `opp.yaml.solicitation.public_url`.
-
-**Steps:**
-1. Read `preferred_llos` from PDD.
-2. If empty: write `ACE/<opp>/solicitation/invitations.md` with `status: empty (long-term solicitation flow)`, exit successfully.
-3. Otherwise: for each preferred LLO, send an email via `email-communicator` containing the solicitation URL, deadline, and scope summary.
-4. Write `ACE/<opp>/solicitation/invitations.md` listing every recipient with send status (`sent | failed: <reason>`).
-
-**Skill directory stays `skills/llo-invite/`** (name preserved per design discussion). Frontmatter `phase: solicitation-management`.
-
-**No `llo-invite-eval` rubric.** The skill is a templated email sender; LLM-as-Judge grading is not load-bearing.
-
-### `solicitation-monitor` (recurring while solicitation open)
-
-**Trigger:** runs while `opp.yaml.solicitation.status == open`. Mirrors the recurring-skill pattern in current Phase 7 (`timeline-monitor`, `flw-data-review`, `ocs-chatbot-qa`).
-
-**Inputs:** `solicitation_id`.
-
-**Steps:**
-1. Call `list_responses(solicitation_id)`. For each new response, call `get_response` and write `ACE/<opp>/solicitation/responses/<response_id>.md`.
-2. Summarize inflow (count, time-to-deadline, no-response invitees if PDD-named LLOs exist).
-3. Append a tick line to `comms-log/observations.md` per the canopy contract.
-
-**Modes:** `--quick` (count only) vs `--monitor` (full pull). Mirrors `ocs-chatbot-qa --quick/--monitor`.
-
-**No eval companion** initially. Read-only recurring skill.
-
-### `solicitation-review` (manual, `/ace:step solicitation-review` only)
-
-**Inputs:** `solicitation_id` + `responses/` directory.
-
-**Steps:**
-1. `list_responses` → for each, score against the rubric captured in `published.md` (LLM-as-Judge over each criterion).
-2. Write `ACE/<opp>/solicitation/review/scoring-rubric.md` and `recommendation.md`.
-3. Optionally call `create_review` for each response so labs has ACE's scores in the audit trail.
-4. **HITL gate:** present `recommendation.md` and wait for explicit human approval of the awardee + amount.
-5. On approval: call `award_response`. Write `ACE/<opp>/solicitation/award-record.md`.
-6. Populate `opp.yaml.selected_llo` with `{ org_slug, contact_email, response_id, source: 'solicitation' }`.
-
-**Outputs:** `review/` artifacts, `award-record.md`, `selected_llo` populated.
-
-**Eval companion:** `solicitation-review-eval` — provisional. Compares ACE's top-ranked recommendation against the human's actual award decision over time (detection-rate metric). Calibrated per `eval-calibration` once 3+ awards have shipped.
-
-## Phase 8 (Execution Management) changes
-
-**Agent rename:** `agents/llo-manager.md` → `agents/execution-manager.md`.
-
-Frontmatter: `name: execution-manager`, `phase: execution-management`, `phase_display: Execution Management`, `phase_ordinal: 7`. Description rewritten — Phase 8 is no longer "first LLO contact"; it is "execute the awarded LLO's run of the opportunity."
-
-**Skill list inside Phase 8:**
-- `llo-onboarding` — **modified**. Reads `opp.yaml.selected_llo`, fails fast with a "run /ace:step solicitation-review first" message if empty. Otherwise sends Connect program-level invite (`connect_send_llo_invite`) + ACE onboarding email to `selected_llo.contact_email`. Replaces the old roster-based flow.
-- `llo-uat` — unchanged.
-- `llo-launch` — unchanged.
-- Recurring: `timeline-monitor`, `flw-data-review`, `ocs-chatbot-qa`, `ocs-chatbot-eval` — unchanged.
-
-**Phase entry guard:** the orchestrator's pre-Phase-7 check enforces `opp.yaml.selected_llo.org_slug != null`. If empty, `/ace:run` halts at the 6→7 boundary with a message pointing to `solicitation-review`.
-
-**Eval rubric updates:**
-- `llo-launch-eval` — pure rename/renumber (Phase 7 → Phase 8 in references). No rubric content change.
-- `cycle-grade-eval` — same.
-
-## Renaming/renumbering blast radius
-
-Concrete file touches:
-
-**Agents (5 files):**
-- `agents/ace-orchestrator.md` — phase order, pause-points, `phases:` block.
-- `agents/connect-setup.md` — Phase 7/7 references.
-- `agents/ocs-setup.md` — Phase 7/7 references.
-- `agents/qa-and-training.md` — Phase 7/7 references; the "Phase 5→6 always pause" note moves.
-- `agents/llo-manager.md` → renamed to `agents/execution-manager.md`.
-- New `agents/solicitation-management.md`.
-
-**Skills (6 files containing Phase 7/7 strings, plus moves and additions):**
-- `skills/training-onboarding-email/SKILL.md` — Phase number references.
-- `skills/training-deck-build/SKILL.md` — same.
-- `skills/llo-launch-eval/SKILL.md` — phase ordinal.
-- `skills/cycle-grade-eval/SKILL.md` — phase ordinal.
-- `skills/connect-opp-setup/SKILL.md` — Phase 7 references.
-- `skills/ocs-widget-handoff-eval/SKILL.md` — Phase 7/7 references.
-- `skills/llo-invite/SKILL.md` — substantial rewrite + phase reassignment.
-- New: `skills/solicitation-create/`, `skills/solicitation-monitor/`, `skills/solicitation-review/`, `skills/solicitation-create-eval/`, `skills/solicitation-review-eval/`.
-
-**Library + tooling:**
-- `lib/artifact-manifest.ts` — drop `connect-setup/invites.md`, add `solicitation/*` artifacts.
-- `bin/ace-doctor` — phase health checks reference phase names; add `[Connect Labs]` section.
-- `commands/run.md`, `commands/step.md` — `ace:llo-manager` → `ace:execution-manager`.
-- `templates/pdd-template.md` — three new optional fields.
-- `CLAUDE.md` — phase order list, pause-points prose, plugin overview.
-- `README.md` — phase summary table if present.
-
-**Config:**
-- `.env.tpl` — `LABS_MCP_TOKEN` line.
-- `.claude-plugin/plugin.json` — `mcpServers.connect_labs` entry.
-
-Approximate count: ~25 files modified, 1 file renamed, 5 skill directories added, 0 deleted.
-
-## Testing & evals
-
-**Unit tests (vitest, default `npm test`):**
-- `test/mcp/connect-labs/` — new directory. Mock the labs MCP transport; verify each consumed atom's request shape and response handling.
-- `test/skills/solicitation/solicitation-create.test.ts` — fixture PDD → expected payload, mocked `create_solicitation`, verifies `published.md` + `opp.yaml` mutation.
-- `test/skills/solicitation/solicitation-monitor.test.ts` — mocked `list_responses` returning N IDs, verifies one `responses/<id>.md` per ID, single observation-log append.
-- `test/skills/solicitation/solicitation-review.test.ts` — mocked `list_responses` + `list_reviews`, verifies scoring rubric written, `award_response` not called pre-HITL, `selected_llo` populated only after approval.
-- `test/skills/llo-invite/` — updated for new behavior (email send, no Connect call).
-
-**Integration tests (`LABS_INTEGRATION=1`, hits live labs):**
-- `test/mcp/connect-labs/integration/e2e.integration.test.ts` — end-to-end flow `create_solicitation` → `list_responses` → `get_response` → seeded test response → `create_review` → `award_response`. Authenticated by a CI-provisioned PAT in 1Password.
-- Gated like `OCS_INTEGRATION` — does not run in default `npm test`.
-
-**Fixtures:**
-- New `test/fixtures/CRISPR-Test-004-Solicitation/` — PDD seeded with `solicitation_type: EOI`, `solicitation_deadline_days: 7`, two `preferred_llos`, `total_budget: 50000`. Used by both unit and (sanitized) integration tests. Registered in `test/fixtures/artifact-manifest.test.ts`.
-
-**Eval rubrics (provisional, calibrated over time):**
-- `solicitation-create-eval` — PDD-fidelity, field completeness, deadline sanity.
-- `solicitation-review-eval` — recommendation-vs-actual-award detection rate.
-- `opp-eval` umbrella aggregator — adds a `solicitation` category. Lifts the "full coverage" bar from 6 categories to 7.
-
-## Error handling
-
-**`solicitation-create`:**
-- Labs MCP unreachable → halt with doctor-style error.
-- `create_solicitation` returns 4xx → preserve `draft.md`, halt, surface error verbatim.
-- `generate_criteria` returns degenerate output → write what was returned, mark `evaluation_criteria` as `needs-review` in `published.md`, still publish (criteria editable post-publish via labs UI without affecting responses).
-
-**`llo-invite`:**
-- Per-recipient email fails → log to `invitations.md` with `status: failed`, continue.
-- All recipients fail → halt with surfaced error.
-- PDD has no `preferred_llos` → no-op, `invitations.md: empty (long-term solicitation flow)`.
-
-**`solicitation-monitor`:**
-- Read-only; failures are non-fatal (log and skip this tick).
-
-**`solicitation-review`:**
-- HITL gate timeout → no award call, no state mutation.
-- `award_response` 4xx after approval → write `award-record.md` with `status: failed` and the error envelope; **do not** populate `selected_llo` (Phase 8 stays gated). Hard-deduct in eval rubric for any path that sets `selected_llo` when the award did not succeed.
-
-## Future work (out of scope)
-
-- **OAuth-on-the-wire for the labs MCP.** Connect-labs `CLAUDE.md` flags this as planned. When it ships, swap the `headers.Authorization` block for `auth: { type: oauth }` — single-PR migration.
-- **Auto-award.** Once `solicitation-review-eval` is calibrated and detection-rate is high enough, an opt-in auto-award path becomes defensible. Not now.
-- **`connect_labs.update_solicitation`-driven extend-deadline skill.** When monitoring detects low response rate, auto-extend the deadline once with notification. Reserved atom; no skill yet.
-- **`solicitation-monitor` proactive nudge.** When PDD-named candidates haven't responded by N days before deadline, send a reminder email. Trivial extension of `email-communicator` once we see real solicitation data.
-- **Labs-side improvements** to make service-account integration cleaner — see appendix.
-
-## Appendix A — Prompt to hand to connect-labs
-
-Paste verbatim into a Claude Code session in `~/emdash/repositories/connect-labs/`:
-
-> **Make labs MCP easier to integrate for service-account / plugin clients (e.g. ACE).**
->
-> Context: external Claude Code plugins (specifically `ace`, which lives in a sibling repo at `~/emdash/repositories/ace/`) want to call labs MCP tools — `create_solicitation`, `list_responses`, `award_response`, `create_review`, etc. — under a single service identity (`ace@dimagi-ai.com`), not per-developer tokens. The current PAT path works but has three rough edges that compound for service-account use.
->
-> 1. **Self-serve service-account PATs.** Today `python manage.py mcp_create_token` requires shell access to a labs host or admin action. For service identities (where one PAT serves N developer machines and N CI runs), a self-serve UI gated by a specific role/permission would let the owning team rotate without filing a ticket. Spec: a labs UI page `/admin/mcp/tokens/` (or non-admin equivalent) where a user with `is_staff` or a new `mcp.create_service_token` perm can create a PAT for a target user, set TTL, and copy the raw value once. This already half-exists in the admin (`/admin/mcp/mcpaccesstoken/`); the gap is a non-admin route + UX for "create + copy raw token".
->
-> 2. **Expose an MCP atom that returns token metadata for the current bearer.** Something like `whoami` returning `{ user, token_name, created_at, expires_at, last_used_at }`. Service-account clients (like ACE's `/ace:doctor`) would call this to detect upcoming expiry and warn at 30/14/7-day boundaries. Today the only signal of expiry is a 401 at use-time, which is too late.
->
-> 3. **Distinguish "PAT bad" from "Connect OAuth missing" in error envelopes.** When a tool handler calls `require_connect_token(user)` and the user hasn't completed Connect OAuth linkage, the client gets a 200 with a tool-level `PERMISSION_DENIED`. When the PAT itself is bad, the client gets a 401. Both states are "fix your auth" but the *fix path* is different (rotate PAT vs. re-do Connect OAuth in labs UI). Surface the distinction explicitly: a stable error code per state in the JSON envelope (`AUTH_TOKEN_INVALID` vs `CONNECT_OAUTH_REQUIRED`), and ideally a `connect_oauth_url` field in the latter so clients can deep-link the user to fix it.
->
-> 4. **Stretch: OAuth 2.1 on the wire.** CLAUDE.md flags this as planned for a later phase. Anyone who wants to prioritize it: ACE would swap a one-line config (`headers.Authorization: Bearer ...` → `auth: { type: oauth }`) and drop the PAT entirely. The earlier this ships, the less long-term PAT inventory there is.
->
-> Pick whichever of (1)–(3) has the lowest cost-to-impact ratio first. (2) is probably the cheapest and unblocks the most concrete operator pain.
diff --git a/docs/superpowers/specs/2026-05-05-app-multimedia-coverage-design.md b/docs/superpowers/specs/2026-05-05-app-multimedia-coverage-design.md
deleted file mode 100644
index f977e42b96a7df71c90be6466eb2a38bce229d3f..0000000000000000000000000000000000000000
GIT binary patch
literal 0
HcmV?d00001

literal 21583
zcmbW9+jbjSa;E1RPq7y@%SF;mKvI>eZHl5TQBoCMI<QHWs}JslOaciqO8^p$Ops!i
z%F9=t<y+6JH7_tTzMfa;d6RmQ`TmHVnIJ{E%4Q!#0-5_5@khiTvG=`@Ef&daHLZ(T
zJ}$CkR9@!G>^u)Y{nP&l<Gd=)=e^#Cz<+ulk|eSJ@%Q%jNuAZJYI~cno-eaW9X8LF
zSuwBbaybj5a<-V}b^bVHS6NXP^YgHnX7eqz!n@D2I=43;-hcRIa{m$kjo&<7)feT`
zzI%G`BGh>{i{G3SXH&IPy(^|u?>E)hN9BB;kLq^d=}t)Lem2UMc`{)z$s(&q7inzf
z=pw7U@1wG+6Z<nf4Eg*bn~(BYKCd4$z*#n5Wz*?(DCU>tyL`O8y%XkT9TwTL4&@}I
zgKU)VE?4uW+2edxUS`uUS{8M_EHazHNuGy4|66#&H(`>qT+2Kx^FOZgst#xQw7g<W
zy?gh<gMK(#mS@v^*6T6r<ZM+;>u|s%4w!^?s0!6Zu?Sa1eZiW%uak1l;HDX^RQ>RB
zS<Y(~Cl6QU@?E~HdYdm_eOGM-1~}K69mF-N>g#FFoEYLIGcC$_#b1GuBvXw$8(oKi
z{dtzn$5+L;zUWnXt^KPQ@jRatb(k#6Stw@6NEOO?$igho)XQWw4b|$5=e0sMOX~nZ
z>g5bPUxvC27x{D%_#-gFustrf?T<RE-i6J)f8Gxp7pw~pF3JkQu@26%v+KZC_P)vH
z=h?C>!|T%8)ADR=h4PM;D>aja(X?Qq+p+D-qGDTnql>Z_<pBv9S86v@*Rw^<tZH`d
zJm2c~dZ*<$yVlTH|Dr-7=8Pz-FW6VTV6s_;tgf@sMHm+q(tYil5?6*9uy4I~8FqWU
zgl0?@5J99VRHKW0hGfuQQTJ(CUw&4F)S5^KX|@^{CI4O)<Gf76q{ye^UI_1sIpdaO
zY!~yz3enFNk}Za&C#s;`4|VpA4a&meqO8j;6JOrlULWf=TZhj3E;b`|sp^N*X6}sC
z+D)eUWj>{i)ZR+NVp%ToWqlox71OyzG0lg}It}BJzNIi{tN9p_%PPb|vc|Hos5EkL
z@H>{TQ`JF-K?9rrpx^JOeOt6BzAOy_Euaed`vp3WR@C&(Hg?Kl?<S@hpwPS#b9&rN
z_w394l;uW;z=`jdyMw`mQ4bo#h<)$r!+AE#_tUL@_zvkUS2b<;bjHnmH1V0xMM}m#
z&XV7}euaup5F5~>+HnbD;H*XSkS5=#Hp{`2<9z88eXao(d^AF6QE(~!BBwiPaLs4o
zE0l1Vfw73o=E1aFjl=P39x4<Xk=~*0uku+jFOtubm(#4eNY1hm__f7ASnVhi^YC=B
zm=>eV^vjyoa%ABpQ=o}cR)?isGAll1j^#p60&j;>`|!MAgmrN#`QlsTV7y|sG_cGO
zl5DP}I(qY!2~jvWeaixzEsK1z8@4xoTdvT&d016w7HER>n{<S8pPE!&A6}QMCCv!@
z`WxF4$8f>(OkNr}|EvzzAk9T_egSxwEB;|I&k%Z-vaf%QT56cnYt;Fq+znSWkX<5+
zO;6==r~R<m5k`8IPp3)EoW}XQRc(`;UbR|I0RA21pq2)oPgl7n2|}1z5>z1G{<%T>
z!+!Xx%%)ZN3!a}R=lWe=+>0M?UOw3B^)LV!#e9UGhpUSMNNKdOk_{GGv(gT_Y<#u?
ztk}GfOw>-O8l7UFTsXoURb4V&hEs;CZ2FE_H%*3VQPsXRXBp4~x@}yQtLZpurXjO-
z2Z=>xZO56^tjed_@wdl_CD_Vd&~}5cf5R5ZV{o8PuJek$6P})*V~?5@XK(vGmSlT7
z%CS0O!_)`!x&({#_ea7Wo*o_cd!OsYrw+N^yB?;C3lpOmEoca4?oAtmv{8c`w{yFN
z0iCYygw<kNX7-?Dx5X+$st%kqEk^-KK`|Zgn9$=?xT>)3#l$Q}^Eu!I)-6kr5V0x{
z=iV2Z$ulNC&S#5KiXF#}%L2qXxX;U^#PF}?)4Y<R@f$!fy%yFO?rSKULLGAn=zg&f
z<T<wazPSpZsas)$gL9sb`@Ki%{6KDAI0Kc!t2|p`#!=+w`IJY|M|6$>;TdG1qB{WG
z#hbD9dtd7Pm*6Ac-sRfxrb%S06cR)15Y|(hF$N-pCTOM9;7y|y!Brs%Pi6JOUm^`y
zJwWa@@=0QCc5ZKLS?b3uHpU&PxVk`V3jxsivb?IW<w<xg_g2m$#{Vwp!{<S9l4*%T
zMQOYtv5blKBR=JHn@sY2EMyJe9)?XxAsC1YkucQ@t<@rA(`BBGuh|e*RLZc?DknA<
z)1|lC?Fm2q^S?$_>pH47Z1lLqWg<T`;x)d|n)_)=_g-jT$YSL0N7L0<+svRJJis<j
z>qHX4ZhkM&23`5D-~s2>blb-N>ecJ;{c3!k1Ayt(#dYUxjLQ|CDLbf51h~h=SP<N{
zTY=Lf9DU&R#vXKt%H+!{WJ$wOcyKM7iNB)Fk^75YpU9~PSF%N}Xagl*7GP{naPjvU
zAS>s6E&A}sSv|f_O*E?kLVH&nGdTI?@YSpSZ0w%8MkP__J#*o!I%lu$el_fseBoYz
zMGsGx=at_4eUk9Xlc^NvA3h+^(wcwxAZ_|X8aBuIBtzmtst?kw$M(W_c`eocL+XFd
zviFHCfA!?e`a2Kq-}fNtbqVm(nXn%D)Y9fx{1V#<o3moMEP>WLgB@?9l^l$s!mc-^
zyv)}Y+k|s=v2W9Qy98Hw;OT@AAf#H2aSj+F1e-GGV4GuD{vOLgSfK?Z;%goTb8bdp
zj!KoA+flic85XGPDfZq%6x=Ege!#*PLdG(|5ekc9!E(-#0(_B>%e!Vgf}(1K0)<UO
ztDOd_ws5P>cpjr{j*1m?6f!0ueMnH~fuKo;Pik1L5P{{ji3G9Cyha)|7pY)+@yFp9
z+gi+1`BMC(w#)KrhXpb;2x;8!=A#`P=2{V5yHu&!1wG#Pff!1jA;t?_s@lw4I>*-^
zp69ii+4R=8SZ{+bd;ayoQOGW{0#|mbki<sw5{r*7CFDUYnjRAw7}+|r^u@$s8JH|5
z6D?*O#_{@I%GS1`!<?_?vVOzS=ufCVOP|WH3|I34w?R{p1yb<A&uGPtG7mF7bc+_7
zO@EH*!%=zqv2WRK$swWhtzGF%Ya7CAO%&G)w``jtn0ogvh1hFcl&nga=4K160^6l|
zQ7{p{Xi0br)Gn_I9<Q$F{9YA*$~}q|1|L0t{+T%|?DcwN+(-Y2=FRg>F-HoMa>8Vy
z>^fVrbBUrX0+8BRgYcI=x;7$-zWn(Lyd)L~o7)>lc?l~b_etK?8TbzfO1WlIc6qAd
z=~NLGKZMX49g{@IrEsDi0Qm*pwb+2Bi6ugXhd5^(T6}*k?o(XBjcvs+A||ZKZV|>o
zwn-+TktmB2%u-jSH=;qb%PYTK*2@Ouz6zUfepzck+y}#)TF_`B*>1n-w3#0N5^I$&
z>KM?b7lmAI(=iP3XeGUZ&0%tQWv!1J+)nNK@X|EB&Sr7Vq()I}cEg@Q`@=xsMvT1g
zkL!v0dXeu2Q)@skYO{ZBGB3dA^@|mTB9Rf70=lr9Em(WB;V5>)23xq|vDZ8P<jb(R
zXZo>NEqN;6-`a@pDMXKh4>*AI<ZDrk;}|`_XM|#R7@=A34}e?EAMTC~@x_IuAA)I5
z_orL*5A4DJ@$P~*)AnMGwjB|Y*6mmNLsZIqHJ!Fkc;r{@hTmfh&3mvGB5mTMFg{}c
z58fdmlDWEgTSJ>z)lN1EKiGsB+>Y(}uEr_+WTN(a_)(&NFZ}ucK5V!Q7O`<^y#sb!
zza-(nsWzl{6OlSY{%2bXFu!YJnV$({>L}r36D@p+@5{5u6XEva7bIubi3t&2I!swj
z=!vdu-Hw2KSWYxaPGxhzO}sT!y`1G9_&d3k0sf+(@WxvTYU^*p;lvy9=t&U;{?R<7
zjmx=9B(=&@;$WZpTa)Rq`MYwQz&e^oHpUie^#Tj@eO1o=k01OW+|Q=14R0)(H@*O3
zh9hh|Uu;aKR}!YlkPkNyFaLmWuNwLU?L+xB8(jmqi+XqOr+@ih5AWYsps`e#&RXG4
z@2@@By1UKekDA91Z}JKNBmTvHRGA;!dFht~+wcCBP!qX1kz-MZFZy4aWqk_7LrfK;
zaPs^cNT0t2F#UrhD+bm09gL1QcnY-LVXVR#fxtV9GJqqCvZd3}XO7A`7nFK4IEfa6
z1B>UPy)0}>N4PGmD-gUa8RQSaVP_*>5K0rJPdCDksdwegs0)uG@Uo3j0$MFlbm(qf
z?<)NEDMq*Pa*lvnU=4i%eli6<l7U}fWV;5FJsQ~W3rHK6<;1wjPNHsrY(!7*N|0(W
zc9FkdL)nPKLq$YD46gmdM_<Z3-380tP}aag@Yy*awDCEUl5BMyX{rvqyWxQkM<40U
z-Ee>H1A(rM^4)ASY*2eBz3v_oj78M#KD6x~3WAaF6Qib)ni&d!$7p}T<E|EqlDvRS
zrKoB`s0+*n*cJs?W3u3?ot-{u6w>vv>oUmbn?~-2b$wia0+nMtazJ=iu_Rs)(9;1`
z9=>^f&~oUa4EWlI-CrA3=t5AncPoz|Vh)dAZ^QC8bDoa<gKCB|3e$9TR)SSTVRJe0
z`Zr|fmI-1R0B;;F$VIJ4vr;*viMu@T^E69)J47E#FjG{f<r|>pGMJ62*d4xKA9Ul6
z3>$B6*KC&7RK`Q@5QN#-lorxsY%WXV;x*P|dmCR=JLY;5qjmQ!mF>}27Fm)G$>7xG
z^q8}$fV&xiZ;ARyVo+Z~Dl3fcH@ln4q{h!${7a3!XP{!S6|CFPkB$7XgpGQ)cAT3x
zFkwPcak=#{4Xm<Uc%1kIC&ivMyNL8pv-7YC=fx5b3(TP^^6_J2!%>r%Kyi0ufrwl1
z9~E16fTAG9N0ff2u)LW{_~Ffu&KbD*H`I6>{_*qsl}Bnn{kMM&|M)t<KNGq`q#_>)
z8$5c~k0$$TSObkIeG~RWl5|0ED#{4@cr_<qc+CW1vNCbJXR?Ez-*4LVVe!-)$``Mi
zrE@;&D<_X5d(*{$F9qC+B5&K$j>?Eii9-Q}BhDdp7&hn%PsB<9T^6H7PzxWhjQ<5w
z_`$st!-usG+kM`0I5lrY_CU%odbi6G_(XmgP&ey89@vTx%(3i{ai0$&zU>c(duxN=
z?=R-(Q7T~4Z^$?%3z{lQ-dc>b%fhhd62Jq+5BP*}ZCOcZ*sxi7R`byrQN@V}b=cBo
zI_9R6Xix~z58p0~Dl$glt}qV=yO@9uH!M$M-ShYPXeB3NSJ{msoA$#qai~Z&p%7z)
zC#Ys&^Yer}CHy9Ok@+wt<y0BX6|})*O$d1nrQtDr`YCerEAf~|%HFuku1K~qs;$__
zlz7_pr+@k14_x|pGyvnrMR`Sj#2%yxXxMR#liN^Gq?2WG%{`CFDOQ%0NaG#DR#;MM
zfR{P1h<up%KxXIne_M>7Y;-&9Z*RaE8OAnqMw0gM1YycFkS80~)5iWUNO0{9?7jU~
zPVhaZ4J0L&cQFS&oM9h@{7mawv~(ntE#7Nq=-Di1H*E}!WMtElB(g$tiI(B48nX~z
z&m`gK?a8SG2OSw3n{kt(uy2O%HND130Wc@I_D@aP2Hbq*CaYnt`Fv7{FXwq{&qr6)
zD;&N}alo^c0%>9jqFZlUJQ>PDP$&x2WEmK^Nq7lR|4QQYt>+LrVVBuGIQaV4tz^A{
z#Up<=TO(gQ@i6ROf(L4pY=n}%g>o>ZWPm<kTiJd%&LP*ZO`1;kwq|w#Q3Y;2U>b@f
zSWsAF8%MAA*Ba<Er7mDKqWreQ>nNcaskEUnS*bc88c|?F&O9H+mzr-GmPopYB?q32
zmSX?|s%}Id8Do$}x`F-cJ-Dx7{JmvMV(-WRJKAiT&8k%|7%s>W$dYB<I8jM>VN^nc
zB9YEU!6PHmdc%lUJNS?D%xk$``R-aw^%e!U>7^0OM$cKfCd-R&fC03ObRJb-J^ljS
zOT)Lvhu#wqJjcr6R3Xs2P^VZ*EB7FCA~HodG%hBSFka0T$cN<7_;RE&4z-=*jk1>s
z6ru`G<Zww&A4I-XE^D{&za(IQYT}(_@3;IUuU{w6pYMyEi@mQ=sm%`EAZM`mNqiJx
zxe-E0dTz%M_Sv05M8i!$rj|>W!H-@>_&)KCsZKSl%Q~ClWXE&{1jpW+rs4RA`BXzP
znRk5Gnq>RVyf^#5drbnSLStp{Nm)1B6PxwGP@mt?u52<1{~?~yjh64xoK*s##_N7l
zv-U_wHH;#<-Yg1I;2fZW^G0hu+#t4}kcy5WtkIY&1~kH&&H2tkU{K?OOkIufjU8c}
zhM6d%!;`l`l^7U<*`isvZigdDgGN6q$Al`a2-ilYdDc>F8gyCoU3Q|wJ0=ZooMa?a
zJ1Su(YPj5BfA=g$FY<xqB22c8hi=>syvZTsBhBHFts8g7Bn+1Mz4nwU4j)l!UiCo|
zxS2jHeOD$ay<4n5@pSWW_{D1*m}$4V&6+~sDb2JSpb#1~9s0au0vh2T+6u(=VH@9s
z$eAFNk>C^s@t=5K8>t)};ey;_C~)X!&)h!B`r#()n@)?fHGEAx5~?ejI1&U+a2m|G
zTJkyFj9af7Cd*zo<Spj*BNkDv^a&mV5ztMBr?HGKNEnHa(+?dlGw-D>0h3ZUWxj?9
z`(p!slT!&Ka(`{*0GJb51gp2ZByGeHdIn2#YA7NP$1^vWxHkUPn-0<#79&1B2Z3L%
zAu|475}a%ak;#+y(SWvcQf3N_n78!`n<~hX?j}xTTuO-}An`7xUgX>G3!12tj7Yhp
z=E+T42YE&#?F3DRfQ=*(8Dso=dpvADdjDQ&N0FG*Vph~!mLJ2)nFxRgmNOujWV3u*
za5a|KVg=1xh@*}{G_(G8UMLgIDv&&k{J^AmuS76oLj=g<sC;y}G8v%*b319?SY`rV
zo@wjpAQUWH;JXfrEc4v++|NB7o0zZJ)Vv5BRBiC+OmciXtcl#MB6}0zxW7qU)N)A_
zt1`mAiLz3S2*p(ihFr2~<YznwPJph`NNxIqBd8OKF~?!!<L+4HjUL|qjN~Qybp}J$
zJ9zu%^u?Rg;a4x-yf}V(`u2GE?eVK87j?a;C{ZKsE0PQ9Rf+Y;hAc`V^!+>Dq2=K>
zFMj)E?_GYqueN)=XYeb?KTC`!n)VrTzg|q0X;dv~S}tflU2|qUyQ`;ro*^HXBXagB
zq!#ZcW9&)oVJu53dC_7~II8+zHa9t^x)pjr3$a%CrJCBe3!KPbtN2wC{(wxh)xau+
ztC5=f|6lK1haQP+s)CC@nQOiqo`Hz?oOe7N7oZmxJvp^p4-nl{yA6yQ7c0K8_Gtq=
z;-x7&G|GBIE|*32H$F8hK5QfWs`;|<0q%*NVS<><AA3??QR4Q}fM7h3t6YX3R!b=1
zJ=Ip{<NY5c;(55WJX^kfC$ai=H+%`pB8y8tOXk!ff>!sWt)_pC&xmHBXJCJZN<x|<
zo1l)dNvx$SIfY2B6r;YGM8yXlo0ulF?*7N|Nw4=x$;Ni;;T+EBZmpWz0+`ejsMWx}
zU$?2=r1VLe`60YC%>59wUpO6asEMNZp{w~&?FWygF*na6UY^J=HflOhbqjT;%H3O4
zgCaGHJl39FW03tBt7MhqbU}GY*XCWm`<_258s8X~SRdNoBS_y*so*Aj6|}j%@N(*@
zN2QM2sduxf0i#XwV5|s&Y00A*o?SC(4ddBrLTJG+DnlEDzdL%Ne+YE@L~(Ip6wnxq
zSvvFP;9F@^DZKyj{USftUsh2(m@Pi%4{MMbjAIKC32lE^3nI;<Zc0FHcp~zLv+*P7
z7>El*`9~)0bLol6Bts@LRkjm*m=YfKNqwgjs*E3rvTrilMq=7L9{b+G=&Tmh3jzw_
zu^oWy{h96G=BQi|<AXzF4=wF)tDJTbuO35=wzp4#UJI_<`eToOEfxlFtXL{r0_ICk
zF*Sn{5lf6;jC+cj<qJS!U9p!}P~zu^=rfhkDnD?9MqPQ$;pW$;r$<(Vh8`*&V}mIq
zf&OOx=8XSF7(zn3mnxcmreN68+cB*54-f)rlM+WfH(F4OZ1loFNqPFL+TvvC3x%6y
zIYrseVdg!{cKgM|`0u4ZO*o#NQT#@k_F6&1?N<hb7x6Jz*@eB@V?j;L8;i4x1KmOO
zARFwty6vmFrZRGpo!DRV00*w;In3p{_VjwTcM2^ATA>7*Fq6al5<SrGu&x%ByS&R4
zmhKqDJ_&ck)i+x2;(6Ci-tIto7dpN;FM#JGT4dkA#(ezbv-&gg&{R+(#JHknZumEr
zp#kR!1>DGJ?`w=dH4_@_Z<2$P<Cmu#55YU%GmA$Mq0mJGJ~nz_iQmpoYscM&-(ES<
z@7?lABcnRWb~FW$X3H2Of+8_KBd(I7{oTpiH`>{Sm3M$egNu4LwRB6qw3M)?fHzP`
zY*mvF-q3>bxBYC~v80Y0ld=?t+~#mMQ0e9^GQQxcG#d>k&FgJxv1RsdU<T`<pA9K=
z^LqQXtG6UaZTQ<abh~*QtJ%C9HU<}eTo*KTH)vSMLuHc@Ig`e<u^S!j126}=Nnvkv
z!bt}k_Impo{(ueOM}K<5{9~E@->c%`M>o^o*<p!vT@LZMbw0*QavX(0bi-f%_1adi
z?EoSXDG*@XYyC%o7&hV-$iF^({X*mx#$9HUD&FJ6)5C+OuW;~SjWQFAa!E8hPZmVK
zpfn6m1Jdc!&JF6XEin2%L<<`>kCg<(+=_KiQMR`Lg0JR1+(^j|Z2PDtTBhcd0r`0c
zV;NQQU}lf;t0oP9M-{8n^1L)d3xxH)HJVgLM1-wnFAcUu5?h9=Qi!j5Rx5&5b5_hT
zCR4>h11JoV@9@#YV?+r$4TI4?kePV?j3`&fN<?IMf1A|R$z)MWO%ktJVr*7q?o~Wz
zwkL<Pq)^tdx>=30e3X%5Rk6R2s++4CizbKp!$N3GF=P1p+lI-u5A~uQ47pu=ZX87K
z*^8HNk6)NSrxjW6Ub`oo)>K8Zm@3s|vVZ=!@C7U&RVr2Z75es9{O_-~dWc-Cx0DUz
zuP7M&hW%XWcpE;72`dS=$;zTH)i`O#$0Zpz)+Jj)M3BO?z@KXG=R#yFwlD#EZdHDm
zK|E2n2b_i)s8)7=fY4zG`2R$PV66xNRl!KCyMWC>!&}9E|IUM5=*m$fI1!_De%bSp
z1o^o)8Y=nco@?p90o_N2=0BL0vlrvA{^PHHYAx@kXdR;vto3x8;7KAYW9OCh9>tnV
zv{aB_h2h<z8($y^YqH;oa^B+eoI7cnr&bBa1=p$Zp<1A5mo3J8R#ak!#tdpA5m_gO
zr8r%l6C>MJewd#Lyzhp)3p&=4pbE4sgX@f%*(Cwk?k5GZ9p=Eqtl1q&^hdpsaFt_h
znO56U3TUZ&^Wfl)(aD-3G536Rs3N7B*N(`kdf9SCzl+UKiQK`jO?9t`41;@B=IQ#R
zMkAh<=h`?TA3f2|sikitKL9C0BJ{91h{6HTzX0S^sy>enp&wj4k%Jp~8n8xysKcg0
zC{M%KK7#qa^nMb;k{LKGf@|uAoo>_Wn2TV9*q4&S%yAyyw#86b0@BcoL^aJi=trIY
z=Va4pXUxH3V!drh{`SbDmpib4CeoA-KuS?`%MmyVvvq1khc5}S@tCnCUaXOI#V<A<
zbNbBySvA_)K@83EYtb}J!j92Yk9{`E*zmh@T?A{7LrL_bsLgYvSQU<2l38@f1dnz4
zVT)E|qnI~~sI(}Me$$3(5{)|f(5)nVnHz%}^_EF$BamfS%!-Ccq%%(2sD?ijYI}kR
zSK{L{(J+|JlK7tEa(n_>(#H&2<MO<Q;u*82^i8~Qb9sR^w&>Gs-Gcm-QEMi)#}Da-
zvA9PRi6nJPnYJ4w@~k%qRM7{oHBG1hGt~spIS2hn4&B8dI42!BQ2-`bUf<)^55{L>
z)?gZqFTqwq?5-2DA9Pej2bhTu?Ips7(u%8Y0p%L88c(;7^G4eV3@xF8dpIE}%DH9>
znQcInvluNA*{ij}3-LR2?Wa7UK1U;}8$MWO_>;7cc580z@uyn-f>s6pwDgSXtbs|h
zd=T+^P>6i)6+1I?kn3Eh4UK$kZ2%w?Ptru5(ECa)@?{>JKN!_7S_ay-_*b+m;=nyB
z!B*6&aUj;8H^&R4q?}{%AxH#d4v5(N3?`8f43u8p=>`*pIF6x__@SfW8l0v<p?gsI
zX&eeAR91>*A|5@<*tJ!Gb^W$nhecqOf=2JpbJ;G%n-4g~GqH2pLI|y?Eg{_<S;Yy0
z7cshZb;ne#>acI@*7#Y=N67C~>X&jGvRNx+D5mEi1B?}0&j_Z<Kw8Nc4IQu5Q^jUI
zb!3tkZ2|%7Iu2wUlyHQB6Oi|g>pW9Bvq_bDr|lUPx@b<R(Bp}4`hoHjQ#(`eA$*JX
z-1yoz52{G}T>3#ta|by9vH6N>d8VjpOln;4L^%gQrMF!G5OV<_uBkDWfCmVERb^q>
z0(WFC^3_sxizBa7gFCMBzK%pGlSTQ1js5n8490n0mc@9a+GCa3RD(H(bdtq%1z`Y{
z4!^`1yBr!MaQ+dZ%TL6wv1UYaYgYkqF~QtVR}P+gQldSBTfwAIgrVFht?4#_D?8W0
zJRlvglQJB)?-UR*!O=+mp{CIzNbF&g;_wAf$Qe&`4k(nClK6Q}@kYARKyz1QY34d@
z?-=N%>z53~6B!7A1qgo400+)LzE{9pzokwM&dc7~g$CP>qRkBhhL=rkP*)1n9ZI%v
z_6Q9qD>6UY3vW!E6WIat6irs5r}$~t?{*ol$pumUf3kSj)V4|H+M*4Oy>BQTs(cj!
zv_JQBJYbn!MXFZuZIy%hN~9^q#k}c<@{XpeINoJ(f$t!k_;guqQ@87*=u!|&?nbTb
z&llHjctk+?+xm)wm%7105-D$?yUoE0q0llfq_h?3w@bl6Irm@{ghvMqA5b>}_^Cl^
z(p%?aowBhO8t1w>XQm29+h=h%R=()s4&^HTO=iHHN{f_l-sccg`t2ZOF2q2XXuWN2
zFt~iE`56M(0bwiN(eYHN`MFgmL|g=&F3R$qRi+vtSoHwGAgWj=y!{porPUkXIv%X(
zyh?})ODCz)Upk9d)U%%B`k1lt%MUQ4amg*oW3J9UNW`1B$`}!1alIL(UJnh&)B4#n
zu(18CT(=E(SwsmZ8G=ux@NFA<_BA$Ib)k2rNB#Wu_BQy>GM5uSpJ|iUWfpE&Y=a|s
zHkXM+?8wfj4Y0*pJ21~4No{nDL-#z~5Kg;KAc9)6f{i?KSdY6eqUCMn;<j+o^1maW
zehUdl+@8kbn6<OlO&VKLS<N7VF=w98ZtC|QJlIk|WY=S83MTVa;uz1+aJC7Q{dy7R
zM@V2BOFK%j*hK-SH0}i|!-qrJRQ%0PmZ-EkV7jUvGjd6)F~><-2=@tw){9~kg>VN!
ztxGaCEp))8A_u1w2Fv-;ElM=8j6!KnNT1s)FsiENlM@a@u0V@?ZOQgGC!sl(WhYY|
zz+<7$POCSnrzM7IZeS44YH^MV8H}Ub(K$tQjxzyviqTmP+3{ef^!zNd^?Yo#?AzOv
z99b}6g)#1v!IyPTjJ1RmuODW1=ak;9Jjob4tGFYW<KhJTIS`@YKD2X7Oes~#u=NmY
zvb!jT%1b>OdhQ3lwG5Q#$79<mOW}phNQ{wp4}SWm|NaGqn3ACpA)NSK<pXXMQrJ{Y
zcA7=QY15THP4s8KdvMrRrRXpclZlYrB&Z;6{l1WHDnd0XFLV}R&CP4X(spBMrxH3~
z{NsxHIRmMndcspB^(l<8bg~LPm@{x)4@{b?b<|H07>b=cYNM$V;&G*&%=XVXGl0l;
zg+5+?p<*%DFKocFc+b!{P?X*%&HsP@(|t-v-^f;&XDNFDCh;Bc;?ARcC|aUTiu_2L
zO{rzE-#FJ3D?a=Kop;j;lQUH$1NY7hK8=ER%*4VCPs_=RBjnO~7%S&BecK6b27f0g
zEK|xpQ50`|A!^O-5`C>}yHK{P;<g(~MJ$a7DehyS#E`TQ*VPD4B?w&41|ps(uoZTU
znLQ`p#U)#pxYryIX>Rq2(U^d10mRXY4PtG}T5jUzVBi9{&#Eh9LV?>`PohOT#B|`Z
zTM0;%k@b`(Al-aDh?3HxkjG1F6!Oib&_F>s>XfM=dGq4nba+Tf%~urD9KL<?<bj>%
z_a?+QxtG<5lPDXx-X4}}QqpgB$_kDvJ{Bw2O~%p|Ek52nJs&4}vThGd<kc*fO_=~!
z15!69N5{!2A13$jf01xpYi(+j<d_dk)k`hvBbIeA2pu-kO=}3g7RxSivn-6kL&6~{
z(zPT`UWp<1BB&cxRW#u++=nJ%ZDXvblVkB*yIEhsQ3U#=xznyK`w`$%X$f~U?;;xA
z6;YL35yZIcvOo9)(2%9HNFBS=Ty%8sQ300thBmG?g#b1!%c*(?O27Qv{Uz9CH8V_t
z%IQ~rZJt4|_oP}tm^mxf;v5FyOGWwu9Mn-Fw}jEaVyb8<UPfeO1a{rB*iBAbXKgHj
z*vWmD&Nj-5d+p>q{DLAEQIN!<S`G9pCSsuM03$UaC{YO&w<YRifQ%eFD~W8LMY5ZR
z;yp-*#!<Q>p|lR>guDIXvSG2f7R{1rw{uPw4yjYNOR9ym(D>L7hiwIxxR@I{q<fJF
zv*RI8JV&hJsDUm}o2v<H%a3pnYx?oBuIuhj9yh|iMLK*VUnE|Fge*AUGrI1^j?9CV
z)EKfLT-|gQkNx;bA530%`o|AabqCG(66xc1gey*{(1-aZIB#zu^mO*r?+~#&w{BfD
zB#3*xeoqkFZ);9UP~RPY>i!>nDSrnkz59wDd*Byd#p9MKwF`Lg%vyhi)o$+Dk$a$v
zj;7$9Op#5IbHWEkOpA>aBTCNB7*YR_yIm}Txq}d0tJ(Sx4l&&-?Tp$Hp8f2No{&L)
K8KPg*`TqeZ17TtS

diff --git a/docs/superpowers/specs/2026-05-06-skills-audit-findings.md b/docs/superpowers/specs/2026-05-06-skills-audit-findings.md
deleted file mode 100644
index ef14943d..00000000
--- a/docs/superpowers/specs/2026-05-06-skills-audit-findings.md
+++ /dev/null
@@ -1,316 +0,0 @@
-# ACE skills audit — findings (Stage 1)
-
-**Date:** 2026-05-06
-**Scope:** automated audit pass over all 54 ACE skills under `skills/`.
-Stage 2 (per-phase deep-read) and Stage 3 (per-phase fix PRs) follow.
-**Companion brief:** `/tmp/skill-budget-handoff.md` (routing-budget angle, written by another session).
-
-## TL;DR
-
-| Lever | Today | Goal |
-|---|---:|---:|
-| ACE description chars (aggregate) | **15,834** | ≤6,500 (≤120/skill) |
-| Per-skill descriptions over 200 chars | **39 / 54** | 0 |
-| Skills with banned-pattern flags in description | **33 / 54** | 0 |
-| Skills with `## Inputs` section | 16 / 54 | 54 |
-| Skills with `## Outputs` section | 8 / 54 | 54 |
-| Skills with `## Procedure` or `## Steps` section | 7 / 54 | 54 |
-| Stale/dead cross-references | **~10 instances** | 0 |
-| Stale `## Current Workaround` blocks | **2** (`llo-uat`, `llo-feedback`) | 0 |
-
-The audit confirms three orthogonal issues:
-
-1. **Description bloat** — descriptions enumerate phases, paths, sibling refs,
-   and trigger phrases that belong in the body.
-2. **Body-structure drift** — most skills lack the `## Inputs / ## Outputs /
-   ## Procedure` skeleton, leading to ad-hoc layouts that vary skill-to-skill
-   and phase-to-phase.
-3. **Reference staleness** — references to retired skills (`training-materials`
-   umbrella, `qa-plan`) and out-of-date workarounds (manual email send when
-   `email-communicator` exists) survive past their refactor.
-
-## Per-skill description char counts and lint flags
-
-(Source: `/tmp/skill-desc-lint.txt`. Sorted descending by chars.)
-
-| Skill | Chars | Banned patterns |
-|---|---:|---|
-| app-screenshot-capture | 800 | phase-label, step-of-phase, inline-paths, internal-coupling |
-| app-multimedia-coverage | 770 | sibling-of, delete-this-skill |
-| commcare-form-patch | 573 | phase-label, temporary, delete-this-skill, inline-paths, internal-coupling |
-| connect-baseline-screenshots | 510 | (size only) |
-| connect-program-setup-eval | 445 | internal-coupling |
-| ocs-chatbot-eval | 413 | phase-label, internal-coupling |
-| pdd-to-learn-app-eval | 396 | mirror-of, internal-coupling |
-| app-test-cases | 395 | phase-label, successor-to, inline-paths, internal-coupling |
-| llo-launch-eval | 388 | phase-label, internal-coupling |
-| training-deck-outline | 378 | inline-paths, ace-opp-path |
-| idea-to-pdd-eval | 378 | inline-paths, internal-coupling |
-| solicitation-monitor | 358 | phase-label, ace-opp-path, internal-coupling |
-| app-ux-eval | 349 | inline-paths, internal-coupling |
-| ocs-widget-handoff-eval | 346 | phase-label, inline-paths |
-| cycle-grade-eval | 342 | internal-coupling |
-| flw-data-review-eval | 335 | internal-coupling |
-| pdd-to-deliver-app-eval | 334 | internal-coupling |
-| opp-eval | 324 | (size only) |
-| eval-calibration | 314 | (size only) |
-| ocs-chatbot-qa | 311 | internal-coupling |
-| training-flw-guide | 309 | inline-paths |
-| upload-transcript | 299 | (size only) |
-| app-release-eval | 299 | phase-label, provisional, inline-paths |
-| ocs-agent-setup | 298 | (size only) |
-| pdd-to-test-prompts | 297 | phase-label |
-| llo-invite | 293 | phase-label |
-| solicitation-review-eval | 285 | provisional, skills-path |
-| app-release | 282 | phase-label |
-| app-connect-coverage | 282 | (size only) |
-| solicitation-create-eval | 280 | provisional, skills-path |
-| solicitation-create | 275 | phase-label |
-| training-deck-build | 273 | phase-label, inline-paths |
-| solicitation-review | 267 | phase-label, inline-paths |
-| training-onboarding-email | 255 | phase-label, inline-paths |
-| pdd-to-app-journeys | 248 | phase-label, inline-paths |
-| training-faq | 236 | phase-label, inline-paths |
-| email-communicator | 234 | (size only) |
-| llo-launch | 210 | (size only) |
-| connect-opp-setup | 207 | (size only) |
-| training-llo-guide | 186 | inline-paths |
-| llo-onboarding | 180 | (clean) |
-| idea-to-pdd | 174 | (clean) |
-| app-deploy | 172 | phase-label |
-| pdd-to-deliver-app | 169 | (clean) |
-| pdd-to-learn-app | 167 | (clean) |
-| flw-data-review | 167 | (clean) |
-| training-quick-reference | 165 | inline-paths |
-| llo-uat | 146 | (clean) |
-| learnings-summary | 140 | (clean) |
-| connect-program-setup | 138 | (clean) |
-| timeline-monitor | 132 | (clean) |
-| llo-feedback | 120 | (clean) |
-| opp-closeout | 98 | (clean) |
-| cycle-grade | 92 | (clean) |
-
-**Pattern frequencies in descriptions (across 54 skills):**
-
-| Pattern | Count | Example fix |
-|---|---:|---|
-| `phase-label` ("Phase 5", "Step 1 of Phase X") | 19 | move to body intro |
-| `inline-paths` (.yaml/.md/.json files in description) | 16 | move to `## Inputs` / `## Outputs` |
-| `internal-coupling` ("reads X, writes Y") | 14 | move to `## Inputs` / `## Outputs` |
-| `ace-opp-path` (`ACE/<opp>/...`) | 4 | move to body |
-| `provisional` ("Provisional rubric") | 4 | move to body, "## Calibration status" |
-| `successor-to` / `mirror-of` / `sibling-of` | 4 | move to body, "## Related skills" |
-| `temporary` / `delete-this-skill` | 2 | move to body, "## Removal criteria" |
-| `step-of-phase` ("Step N of Phase M") | 1 | move to body |
-| `skills-path` (`skills/<name>/SKILL.md`) | 2 | move to body |
-
-## Stale references (catalog)
-
-### Stale skill references (high priority)
-
-1. **`training-materials` umbrella (removed 0.10.89)** referenced as a current
-   skill in:
-   - `training-deck-build/SKILL.md:7,13,19` — "Phase 5 follow-up to
-     `training-materials`", "After `training-materials` has written..."
-   - `training-deck-outline/SKILL.md:20,33,188` — "after `training-materials`
-     has produced..."
-   - `training-onboarding-email/SKILL.md:34,170` — phase-table claims it's a
-     sibling of `training-materials`
-   - `training-flw-guide/SKILL.md:199` — historical context; OK to keep as
-     such if framed as past tense
-
-   **Fix:** rephrase as "the per-artifact training skills" or list the actual
-   sibling skill names. The path `ACE/<opp-name>/training-materials/` (the
-   directory) is still correct and should NOT be touched — the bug is only
-   when the term refers to the now-removed umbrella *skill*.
-
-2. **`qa-plan` (retired by `app-test-cases`)** referenced in:
-   - `app-test-cases/SKILL.md` — frontmatter description: "Successor to qa-plan
-     (which is retired in this same release)." Per pattern lint, this is a
-     `successor-to` flag — move to body's `## Related skills` section.
-   - `app-screenshot-capture/SKILL.md:317-318` — changelog history; valid as
-     past tense.
-
-3. **`pdd-to-deliver-app-eval/SKILL.md:21-22`** lists "future cross-artifact
-   rubrics: `pdd-to-learn-app-eval`, `learn-vs-deliver-eval`,
-   `connect-opp-vs-pdd-eval`" — `pdd-to-learn-app-eval` has shipped; the
-   "future" framing is dated. Update to "siblings: `pdd-to-learn-app-eval`;
-   planned: ..."
-
-### Stale workflow (high priority)
-
-4. **`llo-uat/SKILL.md`** has a `## Current Workaround` block (Steps 1-5) that
-   instructs the operator to manually send emails. The `email-communicator`
-   skill exists (PR #20) and can do this autonomously. Update procedure to
-   call `email-communicator` directly; remove or shrink the workaround block.
-
-5. **`llo-feedback/SKILL.md`** — same pattern. Same fix.
-
-### Aspirational forward-looking refs (low priority — leave with note)
-
-6. **`app-connect-coverage/SKILL.md:252-254`** lists potential future siblings
-   `app-localization-coverage`, `app-summary-coverage`. These are
-   intentionally aspirational; OK to keep but clarify framing as "Potential
-   future siblings (not yet shipped)".
-
-### Verified-clean (NOT stale despite grep hits)
-
-- `register_hq_api_key` / `finalize_opportunity` in `connect-opp-setup` —
-  appears only in the changelog row that documents their removal. Valid history.
-- `state.yaml` hits in `llo-launch` and `connect-opp-setup` — these are
-  references to `connect-state.yaml`, NOT the renamed `state.yaml` →
-  `run_state.yaml`. False positive.
-- `chatbots.dimagi.com` — zero hits.
-- `llo-management` (renamed to `execution-management`) — zero hits.
-- `fetch_otp` (removed atom) — zero hits.
-
-## Body-structure inconsistency
-
-Of 54 skills:
-
-- 38 lack `## Inputs` (and ad-hoc replacements vary: `## Reads`, `## Source
-  artifacts`, `## Pre-conditions`, plain prose).
-- 46 lack `## Outputs` (replacements: `## Writes`, `## Produces`, `## Output
-  files`, prose).
-- 47 lack `## Procedure` or `## Steps` (replacements: `## Steps to follow`,
-  numbered prose, `## Algorithm`, `### Step N` headers without parent).
-- 35 lack `## Archetypes` despite many being archetype-aware.
-
-This is the single largest body-quality issue. Skills are individually
-readable but a phase agent reading multiple in one session has to re-orient
-to the structure each time.
-
-## Cross-skill redundancy
-
-Found via grep on common phrases:
-
-1. **9 skills** include the boilerplate "Writes a verdict YAML in the shared
-   QA/eval shape so opp-eval can aggregate it." This phrase belongs once in
-   `skills/README.md § QA vs Eval` and should be linked to, not repeated.
-2. **All 12 `*-eval` skills** follow a near-identical 5-section body (rubric
-   intro / dimensions / hard-deduction rules / output verdict shape /
-   calibration). The boilerplate setup duplicates ~30-50 lines per skill.
-   Candidate for shared-template extraction (a `skills/_eval-template.md`
-   that each eval skill includes via "See [this template]" reference).
-3. **All 6 `training-*` artifact skills** follow a near-identical structure
-   (audience / inputs / artifact-shape / self-eval / output path). Same
-   pattern — extract to `skills/_training-template.md`.
-4. **All 4 `solicitation-*` skills** share boilerplate around labs MCP usage
-   and PDD reading. Same pattern.
-
-Saving estimate from extraction: ~1,500-2,000 lines of duplicated body
-content collapsed to single-line includes.
-
-## File-length distribution
-
-```
-Smallest:   65 lines (templates / very-tight skills)
-Avg:       203 lines
-Median:    180 lines
-Largest:   491 lines (ocs-chatbot-eval — has full rubric inline)
-Total:   10,970 lines across 54 skills
-```
-
-The two longest skills (`ocs-chatbot-eval`, `opp-eval`) inline rubrics that
-could move to `skills/<name>/rubric.yaml` as data files.
-
-## Recommendations / prioritization
-
-### P0 — Functional regressions (fix in PR 1, with budget cleanup)
-
-- Stale `## Current Workaround` blocks in `llo-uat`, `llo-feedback`
-  (manual-send instructions when email-communicator now exists).
-- Stale `training-materials` skill references in `training-deck-build`,
-  `training-deck-outline`, `training-onboarding-email`.
-- Banned `successor-to` pattern in `app-test-cases` description.
-
-### P1 — Cosmetic/budget (fix in per-phase PRs)
-
-- All 39 over-200-char descriptions: rewrite to verb + use-when format ≤120
-  chars. Move phase/path/coupling info to body.
-- All 33 banned-pattern instances cleaned up.
-
-### P2 — Body-structure standardization (fix in per-phase PRs)
-
-- Add `## Inputs / ## Outputs / ## Procedure` skeleton to skills missing them.
-- Standardize archetype handling header where applicable.
-- Move inline rubrics out to `rubric.yaml` data files for the two largest skills.
-
-### P3 — Redundancy extraction (separate PR after P2)
-
-- `skills/_eval-template.md` — extract eval-skill boilerplate.
-- `skills/_training-template.md` — extract training-skill boilerplate.
-- `skills/_solicitation-template.md` — extract solicitation boilerplate.
-- Update each affected skill to reference the template instead of inlining.
-
-### P4 — Guardrails (separate PR after all fixes ship)
-
-- CI lint that fails on banned-pattern matches and >200-char descriptions.
-- Stale-ref allowlist test (fails on known-removed atoms / skills / domains).
-- Conventions section in `skills/README.md`.
-
-## Stage-2 deep-read targets (per-phase batches)
-
-The automated pass found everything pattern-detectable. Stage 2 needs a
-careful read of each skill's body for issues automation can't catch:
-- Logical drift (procedure no longer matches its inputs/outputs)
-- Dead code paths in procedures
-- Missing edge cases vs. archetype contract
-- Internal contradictions
-- Adherence to the verdict-shape and artifact-manifest contracts
-
-Suggested batching (one parallel agent per batch, ~5-10 skills each):
-
-| Batch | Skills | Phase |
-|---|---|---|
-| B1 | idea-to-pdd, idea-to-pdd-eval, pdd-to-test-prompts, pdd-to-app-journeys | 1 |
-| B2 | pdd-to-learn-app, pdd-to-deliver-app, pdd-to-learn-app-eval, pdd-to-deliver-app-eval, app-deploy, app-release, app-release-eval, app-connect-coverage, app-multimedia-coverage, commcare-form-patch | 2 |
-| B3 | connect-program-setup, connect-program-setup-eval, connect-opp-setup | 3 |
-| B4 | ocs-agent-setup, ocs-chatbot-qa, ocs-chatbot-eval, ocs-widget-handoff-eval | 4 |
-| B5 | app-screenshot-capture, app-test-cases, app-ux-eval, training-llo-guide, training-flw-guide, training-quick-reference, training-faq, training-deck-outline, training-deck-build, training-onboarding-email, connect-baseline-screenshots | 5 |
-| B6 | solicitation-create, solicitation-create-eval, solicitation-monitor, solicitation-review, solicitation-review-eval | 6 |
-| B7 | llo-invite, llo-onboarding, llo-uat, llo-launch, llo-launch-eval, llo-feedback, flw-data-review, flw-data-review-eval, timeline-monitor, email-communicator, upload-transcript | 7 |
-| B8 | opp-closeout, learnings-summary, cycle-grade, cycle-grade-eval, opp-eval, eval-calibration | 8 + cross-cutting |
-
-Each agent gets the full per-batch list, this findings doc, and the brief to
-verify against. Output: appended findings + suggested-rewrite blocks per
-skill.
-
-## Open questions
-
-1. **Body-template extraction**: should the eval/training/solicitation
-   templates live as actual `_*.md` files included via skill body link, or
-   should they live in `skills/README.md` and be referenced inline? The
-   former is more reusable; the latter is one less artifact.
-2. **`disable-model-invocation` policy**: blanket-apply to all 54 ACE skills
-   (the most aggressive route), or carve out exceptions for skills the user
-   *might* free-text-invoke? User has stated they only use `/ace:run`, which
-   argues for blanket.
-3. **Rubric extraction**: move large inline rubrics (`ocs-chatbot-eval`,
-   `opp-eval`) to `rubric.yaml` data files, or leave inline? Argument for
-   move: easier to test/version. Argument against: skill becomes harder to
-   read in isolation.
-
-## Methodology / reproducibility
-
-All findings can be regenerated:
-
-```bash
-# 1. Per-skill desc/frontmatter inventory
-for d in $(ls skills/ | grep -v README.md); do
-  # ... see /tmp/skill-inventory.txt for the full pipeline
-done
-
-# 2. Staleness greps
-grep -rln "training-materials\|qa-plan\|register_hq_api_key\|chatbots.dimagi.com" skills/
-
-# 3. Description-pattern lint
-# See /tmp/skill-desc-lint.txt for output
-
-# 4. Body-structure check
-for d in skills/*/; do
-  grep -c "^## Inputs\|^## Outputs\|^## Procedure" "$d/SKILL.md"
-done
-```
-
-These will become `test/lib/skill-audit.test.ts` in P4.
diff --git a/docs/superpowers/specs/2026-05-09-mobile-cloud-runner-poc.md b/docs/superpowers/specs/2026-05-09-mobile-cloud-runner-poc.md
deleted file mode 100644
index 13b56210..00000000
--- a/docs/superpowers/specs/2026-05-09-mobile-cloud-runner-poc.md
+++ /dev/null
@@ -1,180 +0,0 @@
-## Mobile Cloud Runner — POC Spec
-
-**Date:** 2026-05-09
-**Status:** Draft for review
-**Owner:** Jon
-
-## Context
-
-`ace-mobile` (Phase 5 emulation layer, shipped 0.9.0) is local-only by design — it drives a single AVD on the operator's Mac, and the 2026-04-28 design spec explicitly defers cloud device farms. That deferral is now blocking three concrete things:
-
-1. Multi-operator collisions (two people running ACE on the same Mac fight over adb ports / AVD state).
-2. Linux/Windows ACE users have untested host paths.
-3. `ace-web` background jobs (e.g., scheduled deep QA, scheduled training-prep refresh) have no way to drive mobile flows because nothing in the cloud runs an emulator.
-
-The cheapest, lowest-friction fix is a **spin-up/spin-down cloud emulator** wrapped behind the same MCP capability map that `ace-mobile` already exposes. The POC validates three things: that an EC2-hosted Android emulator can run our existing Maestro recipes unchanged; that the start/stop economics actually land near $0 at our usage; that an HTTP API surface in `ace-web` is enough to swap behind `mcp/mobile/capability-map.ts` without touching skills.
-
-## Goals
-
-1. A new `CLOUD` backend in `mcp/mobile/capability-map.ts` that routes `ensure_avd_running`, `install_apk`, `run_recipe`, `save_snapshot`, `load_snapshot`, `capture_ui_dump` to an HTTP API in `ace-web`. Skills don't change.
-2. An ace-web HTTP service that owns one EC2 instance: `start → run → stop`, with three independent auto-stop layers so leaks are structurally hard.
-3. A pre-baked AMI that boots straight into a working Android emulator state — KVM enabled, SDK + system image cached on disk, CommCare APK pre-staged, AVD pre-warmed to a "logged-in test user" snapshot.
-4. End-to-end smoke run: `mobile_run_recipe(recipePath="connect-login.yaml", ...)` invoked from a skill, dispatched to the `CLOUD` backend, executed against the EC2 emulator, screenshots streamed back to Drive at `ACE/<opp>/screenshots/`. One green test, end-to-end, is the success bar.
-5. Cost target: under $5/mo at 50 runs × 5min/run; under $20/mo at 10× that volume.
-
-## Non-goals
-
-- Multiple device targets / device variety. One AMI, one AVD shape (Pixel 7, API 34, x86_64), matching the local stack.
-- Concurrency. One run at a time per instance. (Forking to N parallel instances is a Phase 2 question once the singleton path is solid.)
-- iOS. Android only.
-- Migration path off Mac AVD. Local AVD remains the default for operator-driven runs; cloud is opt-in via env var or skill arg.
-- ARM64 emulator path on Graviton. x86_64 with nested virt only.
-- Real-device cloud (BrowserStack, Sauce, Firebase). Spin-up/down emulator only.
-- Multi-user auth state. The pre-baked snapshot embeds *one* registered ConnectID test user; rotating that is a manual rebuild.
-- Anything that touches `ace-web`'s production deployment topology beyond adding one Express route group + one IAM role + one EC2 instance.
-
-## Architecture
-
-```
-┌────────────────────────────────────────────────────────────────┐
-│ ACE plugin (this repo)                                          │
-│                                                                 │
-│ skill (e.g. app-screenshot-capture)                             │
-│   └─ MCP atom: mobile_run_recipe(...)                           │
-│       └─ capability-map.ts: route → CLOUD backend (when env set)│
-│           └─ mcp/mobile/backends/cloud.ts                       │
-│               POST https://ace-web/api/mobile/run-recipe        │
-└────────────────────────────────────────────────────────────────┘
-                              │ HTTPS, ACE_WEB_PAT_TOKEN
-                              ▼
-┌────────────────────────────────────────────────────────────────┐
-│ ace-web                                                         │
-│                                                                 │
-│ Express routes: /api/mobile/{ensure-running, install-apk,       │
-│   run-recipe, save-snapshot, load-snapshot, capture-ui-dump,    │
-│   stop, status}                                                 │
-│   └─ EmulatorController                                         │
-│       ├─ AWS SDK: ec2.startInstances / stopInstances            │
-│       ├─ SSH (ssh2): adb / maestro / shell over SSH             │
-│       └─ S3 download: artifacts (screenshots/PNGs) → presigned  │
-│           URLs returned to caller, who fetches + uploads to     │
-│           Drive (Drive auth stays in the plugin, not ace-web).  │
-│                                                                 │
-│ Auto-stop layers:                                               │
-│   1. finally{} block in EmulatorController                      │
-│   2. In-VM `shutdown -h +10` after each run + EC2's             │
-│      InstanceInitiatedShutdownBehavior=stop                     │
-│   3. CloudWatch alarm: CPU<5% for 30min → stop instance         │
-└────────────────────────────────────────────────────────────────┘
-                              │ EC2 API + SSH
-                              ▼
-┌────────────────────────────────────────────────────────────────┐
-│ AWS                                                             │
-│                                                                 │
-│ EC2 m8i.xlarge (4 vCPU / 16 GiB), nested virt enabled           │
-│   AMI: ace-mobile-emulator-vN (pre-baked)                       │
-│     - Ubuntu 24.04 + KVM (kvm-ok passes)                        │
-│     - Android SDK + system-images;android-34;google_apis;x86_64 │
-│     - AVD `ACE_Pixel_API_34` pre-created                        │
-│     - CommCare 2.62.0 APK at /opt/ace/apks/commcare.apk         │
-│     - Maestro CLI at /opt/maestro                               │
-│     - Snapshot `registered-test-user` pre-saved                 │
-│   30 GB gp3 root volume                                         │
-│   Tags: auto-stop=true, owner=ace-web-mobile-poc                │
-│   Security group: SSH from ace-web's egress IP only             │
-│                                                                 │
-│ S3 bucket: ace-mobile-artifacts-<env>                           │
-│   PUT from EC2 instance role; GET via presigned URL             │
-│                                                                 │
-│ CloudWatch alarm: CPU < 5% / 30min → ec2:StopInstances          │
-└────────────────────────────────────────────────────────────────┘
-```
-
-## HTTP API contract (ace-web ↔ MCP)
-
-All endpoints are POST, JSON, auth via `Authorization: Bearer <ACE_WEB_PAT_TOKEN>`. Idempotent where noted. Errors return `{error: {code, message}}` with HTTP 4xx/5xx.
-
-| Path | Body | Returns | Idempotent |
-|---|---|---|---|
-| `/api/mobile/ensure-running` | `{}` | `{instance_id, state, public_dns, started_at}` | yes — no-op if already running |
-| `/api/mobile/install-apk` | `{apk_url}` (presigned URL or `built-in:commcare-2.62.0`) | `{package_name, version}` | no — `adb install -r` |
-| `/api/mobile/run-recipe` | `{recipe_yaml: string, env: {[k]: string}, screenshot_prefix?: string}` | `{exit_code, stdout, stderr, artifacts: [{name, presigned_url, content_type}]}` | no |
-| `/api/mobile/save-snapshot` | `{name}` | `{name, saved_at}` | no |
-| `/api/mobile/load-snapshot` | `{name}` | `{name, loaded_at}` | no |
-| `/api/mobile/capture-ui-dump` | `{}` | `{xml: string}` | no |
-| `/api/mobile/stop` | `{}` | `{instance_id, state, stopped_at}` | yes |
-| `/api/mobile/status` | — (GET) | `{instance_id, state, last_run_at, idle_for_seconds}` | yes |
-
-**Recipe transport:** `recipe_yaml` is the literal YAML string, not a path — the cloud runner has no shared filesystem with the caller. The MCP backend reads the local file and POSTs the content. Same for `apk_url`: caller pre-uploads APK to S3 (or refers to the built-in pre-staged one) and passes a URL.
-
-**Artifacts:** `run-recipe` returns presigned URLs that expire in 1 hour. The MCP backend downloads each artifact and writes it to its expected local destination (typically `ACE/<opp>/screenshots/<step>.png` via Drive). ace-web does not know about Drive.
-
-## AMI contents (pre-baked, versioned)
-
-Pinned versions live in `ace-web`'s `infra/mobile-ami/Packerfile` (or equivalent). v1 ships:
-
-- Ubuntu 24.04 LTS (x86_64)
-- `qemu-kvm`, `libvirt-daemon-system`, kernel modules loaded; `kvm-ok` passes
-- OpenJDK 17
-- Android command-line tools, `platform-tools` (adb), `emulator`, `system-images;android-34;google_apis;x86_64`
-- AVD: `ACE_Pixel_API_34`, Pixel 7 hardware profile, `hw.camera.front=emulated`, 4 GB RAM, 6 GB internal storage
-- Maestro CLI 2.5.x at `/opt/maestro/bin/maestro`
-- CommCare APK 2.62.0 at `/opt/ace/apks/commcare.apk` (md5 in AMI metadata)
-- Pre-saved AVD snapshot `registered-test-user`: emulator booted, CommCare installed, ACE test user registered + logged in, Connect mobile-app handoff completed
-- systemd unit `ace-mobile-runner.service`: starts emulator with `-no-window -gpu swiftshader_indirect -no-snapshot-save -snapshot registered-test-user` on boot
-- SSH: ed25519 host keys; instance role allows S3 PutObject scoped to `ace-mobile-artifacts-*`
-
-**Refresh cadence:** rebuild AMI when CommCare APK rev'd or test-user identity rotated. Manual `npm run build:ami` from `ace-web`. AMI version recorded in `ace-web` config + surfaced in `/status` response.
-
-## Auto-stop layers (independent, all three required)
-
-1. **Application-level `finally`.** Every Express handler that calls `ec2.startInstances` registers an `afterRun` callback that calls `/api/mobile/stop`. Wrapped in `try/finally` so exceptions in the run path still trigger stop.
-
-2. **In-VM shutdown.** `ace-mobile-runner.service` watches for an idle marker file (`/var/run/ace-mobile/last-activity`); if no activity for 10 minutes, runs `sudo shutdown -h now`. EC2 launch template sets `InstanceInitiatedShutdownBehavior=stop` so this stops the instance, doesn't terminate it. Independent of any ace-web call — survives ace-web crash.
-
-3. **CloudWatch alarm.** `CPUUtilization < 5% for 30 consecutive minutes` → `arn:aws:automate:<region>:ec2:stop`. Catches the case where (1) and (2) both failed (e.g., a runaway emulator pegging CPU low but not actually doing work). ~$0.10/mo.
-
-POC success bar: kill the ace-web process mid-run; verify the instance still stops within 40 minutes.
-
-## MCP `CLOUD` backend (this repo, ~80 lines)
-
-`mcp/mobile/capability-map.ts`:
-
-```ts
-export type Backend = 'MAESTRO' | 'AVD' | 'COMPOSITE' | 'CLOUD';
-```
-
-`mcp/mobile/backends/cloud.ts` (new): typed HTTP client. Reads `ACE_WEB_BASE_URL` + `ACE_WEB_PAT_TOKEN` from env. Exposes the same shape as `avd.ts` so the dispatcher swap is mechanical.
-
-Routing rule: the dispatcher picks `CLOUD` when env var `ACE_MOBILE_BACKEND=cloud` is set, otherwise falls back to current AVD/Maestro behavior. POC default stays `AVD`. No skill changes.
-
-Atoms NOT routed to cloud in v1 (deliberately):
-- `register_test_user` — the registered state is *baked into the AMI snapshot*, so this becomes a no-op on cloud. The MCP atom returns success immediately when backend=cloud and the AMI advertises a pre-baked test user.
-- `list_avds`, `stop_avd` (the AVD-CLI ones) — local-only.
-- `generate_recipes_from_app_summary` — runs in-MCP, not on the device; backend-independent.
-
-## Success criteria
-
-A POC is "done" when **all five** of these pass in one session:
-
-1. `aws ec2 describe-instances --filters Name=tag:owner,Values=ace-web-mobile-poc` shows `state=stopped` from a cold start. Run a smoke recipe; instance starts, recipe runs, instance returns to `stopped` within 2 min of recipe finish.
-2. `mobile_run_recipe` invoked from this plugin (env: `ACE_MOBILE_BACKEND=cloud`) returns the same shape as the AVD backend; one PNG lands in Drive `ACE/_poc/screenshots/`.
-3. Recipe execution wall-clock under 90s for a known-fast recipe (e.g., `connect-login` against the pre-baked snapshot).
-4. Kill ace-web mid-run (`SIGKILL`); instance stops within 40min via layer 2 or 3.
-5. Cost telemetry: a 10-run smoke loop costs <$0.50 of EC2 + S3 (verify via Cost Explorer or instance hours × rate).
-
-## Open questions
-
-- **APK source of truth.** AMI bakes 2.62.0; Connect mobile is on a faster release cadence. Do we accept a stale APK between rebuilds, or do we add an `install-apk` "fetch latest from Connect releases" path? POC: stale-is-fine; rebuild on demand.
-- **Snapshot rot.** The pre-baked `registered-test-user` snapshot embeds a registered ConnectID session for a `+7426`-prefixed demo phone (Connect-id skips OTP entirely for that range — no human-in-the-loop OTP entry needed at bake time). Sessions can still expire server-side. POC: ignore until it bites; v2 might include a `refresh-snapshot` admin endpoint that re-runs the demo registration in-place.
-- **Drive vs S3 for artifacts.** Current local backend writes PNGs straight to a local screenshotDir; the plugin orchestrates Drive upload. Cloud needs S3 as transit. Worth keeping S3 as the *only* sink and having ace-web push to Drive directly? POC: no — keep Drive auth in the plugin, pass through S3.
-- **Concurrency.** Singleton EC2 means two simultaneous `mobile_run_recipe` calls serialize. POC: 503 the second caller. Real fix is a small instance pool with a queue; out of scope.
-- **AMI build pipeline.** Manual `packer build` for v1, GitHub Actions later. POC: manual.
-
-## Out-of-scope follow-ups (filed, not done)
-
-- Concurrent runs (queue + N-instance pool).
-- ARM64 / Graviton emulator path.
-- iOS Simulator on `mac1.metal`.
-- GCP variant (kept warm as a comparison; revisit if AWS C8i nested virt has surprises in production).
-- Multi-tenant: one cloud emulator service serving multiple ACE installs.
diff --git a/docs/superpowers/specs/2026-05-11-mobile-cloud-runner-api-gaps.md b/docs/superpowers/specs/2026-05-11-mobile-cloud-runner-api-gaps.md
deleted file mode 100644
index 18f9579f..00000000
--- a/docs/superpowers/specs/2026-05-11-mobile-cloud-runner-api-gaps.md
+++ /dev/null
@@ -1,141 +0,0 @@
-# Mobile cloud runner — closing the API gaps
-
-**Status:** ✅ Shipped (2026-05-13). All five gaps implemented end-to-end.
-**Original draft:** 2026-05-11
-**Related:** [2026-05-09-mobile-cloud-runner-poc.md](2026-05-09-mobile-cloud-runner-poc.md) (parent design)
-**Author:** Jonathan + claude (audit pair)
-
-## Status (audit 2026-05-13)
-
-| Gap | Status | Evidence |
-|-----|--------|----------|
-| 1 — `install-apk` returns `version_code` | ✅ shipped | `mcp/mobile/backends/cloud.ts:308,313-317` reads `result.version_code ?? 0` (server-side ships since ace-web #307) |
-| 2 — `capture-ui-dump` returns parsed `elements[]` | ✅ shipped | `mcp/mobile/backends/cloud.ts:367-376` surfaces `result.elements ?? []` |
-| 3 — `run-recipe` returns structured `steps[]` | ✅ shipped | `mcp/mobile/types.ts:117-125` defines `StepResult`; `mcp/mobile/backends/cloud.ts:91-107,714-737` parses + narrows the `status` enum at the boundary |
-| 4 — `stop` busy guard with `force: boolean` | ✅ shipped | `mcp/mobile/backends/cloud.ts:200-211` passes `{force: true}` when caller opts in; ace-web returns 409 otherwise |
-| 5 — `register_test_user` no-op on cloud | ✅ shipped | `mcp/mobile/client.ts:729-739` short-circuits on `useCloud` — cold-boot path already registers the +7426 demo user |
-
-Original spec body retained below for historical context.
-
----
-
-## Why
-
-The POC shipped and is in production on `emdash/mobile-recipe-cwd-fix`. Audit of the ACE ↔ ace-web HTTP contract revealed a small set of gaps where ace-web has the information internally but isn't surfacing it to the ACE plugin. This spec captures the gaps, the fix per gap, and what stays explicitly deferred.
-
-**Out of scope** (intentionally deferred):
-- **Client-side APK upload via presigned PUT.** Standard Connect APKs are baked into AMI states (`/opt/ace/states.yaml`) and installed by `ensure-running`. Custom APK upload only matters for RC testing, which we don't have yet.
-- **Async `run-recipe` with polling / streaming.** Synchronous up to the 30-min SSM ceiling is fine until a real Phase 8 walkthrough hits that wall.
-- **`bootTimeMs` in `ensure-running` response.** Nice-to-have for monitoring; defer.
-
-## Gap inventory and fix per gap
-
-### Gap 1 — `install-apk` doesn't return `version_code`
-
-**Where:** `apps/mobile/controller.py:279-307` (`EmulatorController.install_apk`).
-
-**Today:** Parses `package_name` and `version` (= `versionName`) from `aapt dump badging`. Doesn't extract `versionCode`. ACE's `ApkInfo` requires `versionCode: number`; the cloud backend hardcodes `0` (`mcp/mobile/backends/cloud.ts:194`).
-
-**Why it matters:** Phase 8 `app-release` post-release verification checks installed CommCare version. `versionCode` is the monotonic integer; `versionName` is human-readable and can be ambiguous (`"2.62.0"` vs `"2.62.0-rc1"`).
-
-**Fix:**
-- Extend the `aapt dump badging` parser to also grep `versionCode='N'`.
-- Add `version_code: int` field to `InstallResult` (controller dataclass + view envelope).
-- Update `mcp/mobile/backends/cloud.ts` install_apk parser to consume `result.version_code` instead of hardcoding `0`.
-
-### Gap 2 — `capture-ui-dump` returns raw XML only
-
-**Where:** `apps/mobile/controller.py:460-474` (returns `result.stdout` as raw XML string).
-
-**Today:** Returns `{xml: <raw_uiautomator_dump>}`. Clients have to parse it themselves. ACE's cloud backend returns `{xml, elements: []}` — `elements` is hardcoded empty.
-
-**Why it matters:** Selector-based skills want `elements[]` to assert "this control exists" or "the value of textbox X is Y." Every client parsing the XML separately is duplicated work, and the local AVD backend already returns parsed elements.
-
-**Fix:**
-- Server-side: parse the XML once in the controller. Return `{xml, elements: [{id, text, class, bounds, clickable}]}`.
-- The local AVD backend (`mcp/mobile/backends/avd.ts:capture_ui_dump`) already does this parse; lift its logic to the controller (or re-parse on the ACE side until we know callers need it).
-- Update `mcp/mobile/backends/cloud.ts:246` to surface `result.elements` instead of `[]`.
-
-**Decision: parse server-side.** Reasoning: (a) every client needs the parse, so one canonical parser beats N; (b) keeps the cloud backend's response shape parallel to the AVD backend's; (c) the raw XML is large — clients that don't need it can ignore the field.
-
-### Gap 3 — `run-recipe` returns no structured step report
-
-**Where:** `apps/mobile/controller.py:309-369` (returns `RunResult{exit_code, stdout, stderr, artifacts[]}`).
-
-**Today:** Skills reconstruct step ordering from artifact filenames (`01_tap_button.png`, `02_assert_visible.png`). Fragile — depends on Maestro naming conventions and recipe authors not skipping numbers. No way to know which step failed without parsing stdout heuristics.
-
-**Why it matters:** Phase 8 walkthroughs need to surface "step 4 of 12 failed at assertVisible(text='Submit')" — currently impossible without scraping Maestro's stdout. Eval rubrics also want structured per-step status to assert "did the recipe reach step N."
-
-**Fix:**
-- Maestro's `--debug-output` directory writes structured artifacts: `commands-(json|html)`, `screenshots/`, `maestro.log`. The JSON is one record per executed command with `command`, `metadata`, `screenshot`.
-- Parse the JSON in the controller after `aws s3 cp` finishes. Build `steps: [{index, name, status: 'pass' | 'fail' | 'skipped', screenshot_name?, error?, duration_ms?}]`.
-- Add `steps: list[Step]` to `RunResult` alongside existing `artifacts[]`.
-- ACE `cloud.ts:CloudRunResult` gains `steps?: StepResult[]`; `RecipeRunResult` in `mcp/mobile/types.ts` similarly. Skills that don't need it ignore it; `app-screenshot-capture` becomes order-independent.
-
-**Open question:** Does Maestro emit a per-command JSON in `--debug-output`, or only an aggregate `commands.json`? The implementation PR should verify against an actual Maestro run before nailing down the shape. If only aggregate, the per-step shape may need to be derived; we may also need to fall back to parsing `maestro.log`. **Action:** implementation PR starts by running one recipe end-to-end against the staging instance and pasting an `ls -la /tmp/run-<id>/` in the PR description, so reviewers see the actual debug-output surface.
-
-### Gap 4 — `stop` has no busy guard
-
-**Where:** `apps/mobile/views.py:316-331` (`stop`) and `apps/mobile/controller.py:227-236`.
-
-**Today:** Stop endpoint deliberately bypasses the singleton lock to allow aborting a hung recipe. Means an accidental stop call mid-run kills a legitimate flow with no warning.
-
-**Why it matters:** As we wire more skills to the cloud, accidental concurrent calls become more likely. The "abort hung recipe" path is rare and intentional; the "two skills accidentally racing" path is common and silent.
-
-**Fix:**
-- Add `force: bool = false` to the stop request body (new `StopSerializer`).
-- View logic:
-  - Read singleton state (without acquiring).
-  - If busy and `force=false`: return `singleton-busy` error (HTTP 409) with `{current_owner}`.
-  - Otherwise: existing behavior — call controller.stop().
-- ACE `mcp/mobile/backends/cloud.ts:stopAvd` passes through a `{force?: boolean}` option.
-
-### Gap 5 — `register_test_user` is unhandled on cloud
-
-**Where:** `mcp/mobile/client.ts:registerTestUser` routes to `avd.registerTestUser` even when `useCloud=true`; `CloudBackend` has no such method, so the call throws.
-
-**Today:** Spec (`2026-05-09-mobile-cloud-runner-poc.md:152`) says the cold-boot path on cloud already runs the +7426 demo registration recipes — `register_test_user` is meant to be a no-op on cloud. Implementation never landed.
-
-**Fix:** In `mcp/mobile/client.ts:registerTestUser`, short-circuit when `useCloud`:
-
-```ts
-registerTestUser(avdName: string): Promise<TestUserInfo> {
-  if (this.useCloud) {
-    // Cold-boot path on ace-web already registers the +7426 demo user
-    // before /run/ace-mobile/ready is touched; this atom is a no-op.
-    return Promise.resolve({ phone: '+7426...', registered: true, source: 'cloud-baked' });
-  }
-  return this.avd.registerTestUser(avdName);
-}
-```
-
-Pin the exact pre-baked phone number from `infra/mobile-ami/scripts/`. Add a unit test that verifies the no-op shape.
-
-## Sequencing
-
-| Order | PR | Repo | Independent? |
-|------|----|----|------|
-| 1 | Gap 5 (register no-op) | ACE | Yes |
-| 2 | Gap 1 (version_code) | ace-web | Yes |
-| 3 | Gap 2 (parsed elements) | ace-web | Yes |
-| 4 | Gap 4 (stop busy guard) | ace-web | Yes |
-| 5 | Gap 3 (structured steps) | ace-web | Yes — biggest |
-| 6 | ACE-side type updates | ACE | Blocked by 2 + 5 |
-
-PRs 1–4 can ship in parallel. PR 6 lands after the ace-web PRs are merged + deployed.
-
-## Verification
-
-Per PR:
-- Unit test in `apps/mobile/tests/` for shape (request validation + envelope).
-- Integration test (where applicable) hits the staging instance via `MOBILE_INTEGRATION=1`.
-
-End-to-end:
-- Re-run `app-screenshot-capture` on the staging cloud instance after PR 6 lands; confirm screenshots line up with `steps[]` and `versionCode` populates.
-
-## Non-goals
-
-- New endpoints (everything fits in existing routes).
-- Client-side APK upload (deferred).
-- Async / job-queue refactor (deferred).
-- Changing the `{data, error}` envelope (additive only).
diff --git a/docs/superpowers/specs/2026-05-15-ace-sweep-atom-contracts.md b/docs/superpowers/specs/2026-05-15-ace-sweep-atom-contracts.md
deleted file mode 100644
index 5e72cc35..00000000
--- a/docs/superpowers/specs/2026-05-15-ace-sweep-atom-contracts.md
+++ /dev/null
@@ -1,186 +0,0 @@
-# Sweep atom contracts — research findings
-
-**Status:** research complete; ready to implement
-**Date:** 2026-05-15
-**Parent spec:** `2026-05-15-ace-sweep-design.md`
-**Phasing:** referenced as "follow-up atoms" in `2026-05-15-ace-sweep-pr1-foundation-and-drive.md`
-
-PR 1 (`/ace:sweep` foundation + Drive) and PR 2 (per-system sweep skills in report-only mode) shipped. Each "report-only" item in the coverage matrix can be promoted to "auto-execute" by shipping a focused atom-building PR. This doc captures the exact upstream contract for each atom so an implementer can build it without re-research.
-
-## Decision matrix (per atom)
-
-| Atom | Transport | Backend file | Auth | Method | Endpoint |
-|---|---|---|---|---|---|
-| `connect_delete_unaccepted_flw_invites` | HTML form POST | `mcp/connect/backends/playwright.ts` | session cookie | POST | `/a/<org_slug>/opportunity/<opp_id>/delete_invites/` |
-| `ocs_archive_chatbot` | HTML form POST | `mcp/ocs/backends/playwright.ts` | session cookie + CSRF | POST | `/a/<team_slug>/chatbots/<pk>/delete/` |
-| `ocs_archive_collection` | HTTP DELETE method | `mcp/ocs/backends/playwright.ts` | session cookie + CSRF | DELETE | `/a/<team_slug>/documents/collection/<pk>/delete/` |
-| `ocs_archive_pipeline` | HTTP DELETE method | `mcp/ocs/backends/playwright.ts` | session cookie + CSRF | DELETE | `/a/<team_slug>/pipelines/<pk>/delete/` |
-| `ocs_delete_collection_file` | HTML form POST | `mcp/ocs/backends/playwright.ts` | session cookie + CSRF | POST | `/a/<team_slug>/documents/collections/<pk>/files/<file_id>/delete` |
-| `commcare_list_apps` | JSON REST | `mcp/connect/backends/commcare.ts` | session cookie (TaskPie `allow_session_auth=True`) | GET | `/a/<domain>/api/v0.4/application/` |
-| `commcare_delete_app` | HTML form POST | `mcp/connect/backends/commcare.ts` | session cookie + CSRF | POST | `/a/<domain>/apps/delete_app/<app_id>/` |
-| `labs_delete_record` | REST in local-tool proxy | `mcp/connect-labs-server.ts` | OAuth2 Bearer (LABS_MCP_TOKEN) | POST | `/export/labs_record/` body: `[{id}]` |
-
-The `labs_delete_record` atom is the only one that doesn't fit cleanly in an existing MCP server file — the labs MCP is a stdio JSON-RPC proxy. It needs **local-tool routing**: intercept `tools/list` to append our local tool, intercept `tools/call` for `labs_delete_record` and make the REST call directly, forward everything else.
-
-## Contract details (per atom)
-
-### `connect_delete_unaccepted_flw_invites`
-
-- **URL pattern (urls.py):** `path("<slug:opp_id>/delete_invites/", views.delete_user_invites, name="delete_user_invites")`
-- **Full URL:** `/a/<org_slug>/opportunity/<opp_id>/delete_invites/` (`<opp_id>` is the opportunity UUID slug)
-- **Method:** POST
-- **CSRF:** view is `@csrf_exempt` — token not required, but safe to send for compatibility
-- **Form body:** `user_invite_ids=<int>&user_invite_ids=<int>...` (Django `getlist("user_invite_ids")`; integer auto-PKs from the `UserInvite` model)
-- **Permissions:** `@org_member_required` + `@opportunity_required`
-- **Behavior:** filters to `id__in=invite_ids` AND `opportunity=request.opportunity` AND excludes `status=accepted`. Hard-deletes matched rows plus their `OpportunityAccess` records.
-- **Response on success:** HTTP 200 with `HX-Redirect` header pointing to worker_list
-- **Response on empty list:** HTTP 400 (`HttpResponseBadRequest`)
-- **List source:** `connect_list_invites` already exposes these — each invite's `id` is the integer this atom expects.
-
-**Implementation outline:**
-```typescript
-// playwright.ts — direct POST, no CSRF scrape needed
-deleteUnacceptedFlwInvites: ConnectClient['deleteUnacceptedFlwInvites'] = async ({
-  organization_slug, opportunity_id, user_invite_ids,
-}) => {
-  if (user_invite_ids.length === 0) {
-    return { deleted: 0 };  // no-op; don't even call the endpoint
-  }
-  const urlPath = `/a/${organization_slug}/opportunity/${opportunity_id}/delete_invites/`;
-  const form: Record<string, string | number> = {};
-  user_invite_ids.forEach((id, i) => { form[`user_invite_ids_${i}`] = id; });
-  // ⚠️ Django expects repeated key, not indexed — use page.request.post's `multipart` or build query body manually
-  const res = await this.request.post(urlPath, { form, headers: { Referer: `${this.opts.baseUrl}${urlPath}` } });
-  if (res.status() !== 200) throw await httpErrorFor(res, urlPath, 'POST');
-  return { deleted: user_invite_ids.length };
-};
-```
-
-⚠️ Open question: Playwright's `request.post({form: ...})` may not support repeated keys; verify with a test or use `data: 'user_invite_ids=1&user_invite_ids=2'` (URL-encoded string body).
-
-### `ocs_archive_chatbot` / `_collection` / `_pipeline` / `_collection_file`
-
-All four follow the same pattern but vary in HTTP method:
-
-| Atom | Method | Trailing slash | Success status |
-|---|---|---|---|
-| `ocs_archive_chatbot` | POST | yes | 302 (HTMX `HX-Redirect`) |
-| `ocs_archive_collection` | DELETE | yes | 200 (empty body) |
-| `ocs_archive_pipeline` | DELETE | yes | 200 (empty body) |
-| `ocs_delete_collection_file` | POST | no (!) | 200 (empty body) |
-
-- **CSRF:** all four require token; scrape from a GET to the resource's detail page (e.g. `/a/<slug>/chatbots/<pk>/`).
-- **Permissions:** `LoginAndTeamRequiredMixin` + `PermissionRequiredMixin(permission_required=...)`.
-- **Soft-delete:** chatbot/collection/pipeline set `is_archived=True` via `.archive()`. Collection file is HARD-deleted via async task.
-
-**View source references:**
-- chatbot: `apps/experiments/views/experiment.py:archive_chatbot` — `@require_POST` + `@login_and_team_required` + `@permission_required("experiments.delete_experiment")`
-- collection: `apps/documents/views.py:DeleteCollection` — `View.delete()` method, calls `collection.archive()`
-- pipeline: `apps/pipelines/views.py:DeletePipeline` — `View.delete()` method, calls `pipeline.archive()`
-- collection file: `apps/documents/views.py:delete_collection_file_view` — `@require_POST` + `@transaction.atomic()`
-
-### `commcare_list_apps`
-
-- **JSON REST.** Reuses the PlaywrightSession cookie jar (session cookie path) since the same auth covers `commcare_delete_app`.
-- **URL:** `GET /a/<domain>/api/v0.4/application/` (TaskPie resource defined in `corehq/apps/api/resources/v0_4.py:ApplicationResource`). The unscoped `/api/v0.4/application/?domain=<domain>` form returns 404 from Django URL routing even though TaskPie accepts a `domain` query param — the resource is mounted only under the `/a/<domain>/` prefix.
-- **Response (JSON):** `{ objects: [{ id, name, version, is_released, built_on, modules, versions }], meta: {...} }`
-- **Auth:** `LoginAndDomainAuthentication(allow_session_auth=True)` — session cookies work. API key (`Authorization: ApiKey ...`) also works on this resource if needed.
-- **For sweep:** we only need `id`, `name`, `doc_type`. The domain is fixed by the URL path.
-
-### `commcare_delete_app`
-
-- **HTML form POST.** Cannot use API key — requires session cookie + CSRF (`@require_can_edit_apps` + Django middleware).
-- **URL:** `POST /a/<domain>/apps/delete_app/<app_id>/`
-- **CSRF flow (mirror `commcare_make_build`):**
-  1. GET `/a/<domain>/apps/` to populate `csrftoken` cookie.
-  2. Extract token from cookie jar.
-  3. POST `/a/<domain>/apps/delete_app/<app_id>/` with `X-CSRFToken` header.
-- **Form body:** empty (app_id comes from URL).
-- **Decorators:** `@no_conflict_require_POST` + `@require_can_edit_apps`.
-- **Response on success:** HTTP 302 redirect to domain dashboard (`HttpResponseRedirect(reverse(DomainDashboardView.urlname, args=[domain]))`).
-- **Soft-delete:** sets `Application.doc_type` to `Application-Deleted`, creates `DeleteApplicationRecord` for undo. Restore via `undo_delete_app/<record_id>/`.
-
-### `labs_delete_record`
-
-- **REST with OAuth2 Bearer** — same token (`LABS_MCP_TOKEN`) the proxy already uses for `/mcp/`, but hits a different endpoint.
-- **URL:** `POST /export/labs_record/` (NB: HTTP method is POST despite "delete" semantics; the view is a `ListCreateAPIView` with a custom `delete()` method — but the dispatching is by POST verb with the operation implied by the call shape)
-- **Body:** `[{"id": <int>}]` — array of `{id}` objects (single-object form also accepted; the view normalizes).
-- **Auth header:** `Authorization: Bearer ${LABS_MCP_TOKEN}` (standard `oauth2_provider` + `TokenHasScope` with `required_scopes = ["export"]`).
-- **No type discriminator needed** — lookup is by primary key alone (`LabsRecord.objects.filter(pk__in=ids).delete()`).
-- **Permissions:** OAuth2 "export" scope + record-scope check (`_check_edit_permissions()` walks opportunity_id/program_id/organization_id from the request data, but for delete the scope check is on the record being deleted — non-owner-with-scope can delete; hard-delete, no audit trail).
-- **Response on success:** HTTP 200, empty body.
-
-**Proxy implementation:**
-The labs MCP proxy (`mcp/connect-labs-server.ts`) currently forwards every JSON-RPC frame unchanged. To add `labs_delete_record` as a LOCAL tool:
-
-1. Define the tool schema once at module scope:
-   ```typescript
-   const LOCAL_TOOLS = [{
-     name: 'labs_delete_record',
-     description: 'Hard-delete a LabsRecord by primary key. Covers solicitations, funds, reviews, and responses (all backed by the same LabsRecord table; type discriminator not required).',
-     inputSchema: {
-       type: 'object',
-       properties: { id: { type: 'integer' } },
-       required: ['id'],
-     },
-   }];
-   ```
-2. In the frame handler:
-   - If `method === 'tools/list'`: forward upstream as usual, then in the response merge `LOCAL_TOOLS` into `result.tools`.
-   - If `method === 'tools/call'` and `params.name === 'labs_delete_record'`: do NOT forward. Make a direct REST call:
-     ```typescript
-     const res = await fetch(`${LABS_BASE_URL}/export/labs_record/`, {
-       method: 'POST',
-       headers: {
-         'Authorization': `Bearer ${token}`,
-         'Content-Type': 'application/json',
-       },
-       body: JSON.stringify([{ id: params.arguments.id }]),
-     });
-     ```
-     Return an MCP `tools/call` result frame.
-   - Otherwise: forward unchanged.
-3. `LABS_BASE_URL` defaults to `https://labs.connect.dimagi.com` (strip the `/mcp/` suffix from `LABS_MCP_URL` if set).
-
-## Per-PR file inventory (8 PRs, or N bundled-PR variants)
-
-Each atom PR touches these files:
-
-**Single Connect atom (`delete_unaccepted_flw_invites`):**
-- `mcp/connect/capability-map.ts` — add capability
-- `mcp/connect/client.ts` — add method signature
-- `mcp/connect/backends/playwright.ts` — implementation
-- `mcp/connect/backends/composite.ts` — passthrough to playwright
-- `mcp/connect/backends/rest.ts` — stub
-- `mcp/connect-server.ts` — register MCP tool
-- `test/mcp/connect/unit/playwright-fallbacks.test.ts` — add test cases (or new file)
-- `skills/sweep-connect/SKILL.md` — promote FLW invites from report-only to auto-delete
-- `agents/sweep.md` — update coverage matrix
-
-**Each OCS archive atom** follows the same 9-file pattern in `mcp/ocs/`. The 4 archive atoms could ship as one PR (all touch the same files; differ only in URL/method).
-
-**HQ atoms (list_apps + delete_app):**
-- `mcp/connect/backends/commcare.ts` — both implementations
-- `mcp/connect-server.ts` — register both MCP tools
-- `test/mcp/connect/unit/` — new test files
-- `skills/sweep-hq/SKILL.md` — promote from stub to operational
-- `agents/sweep.md` — update coverage matrix
-
-**Labs atom:**
-- `mcp/connect-labs-server.ts` — extend with local-tool routing
-- `test/mcp/connect-labs/` — new test file (does this dir exist? check)
-- `skills/sweep-labs/SKILL.md` — promote LabsRecord types from report-only to auto-delete
-- `agents/sweep.md` — update coverage matrix
-
-## Recommended ship order
-
-1. **`connect_delete_unaccepted_flw_invites`** — smallest, well-researched, demonstrates the pattern.
-2. **`labs_delete_record`** — single-file proxy extension; high value (covers 4 product types with one atom).
-3. **`commcare_delete_app` + `commcare_list_apps`** — promotes HQ from stub to operational.
-4. **4 OCS archive atoms** — bundle into one PR since they all touch the same OCS MCP files and follow the same pattern.
-
-## Out of scope
-
-- `connect_delete_opportunity` — no upstream view exists. Would require a Django PR to commcare-connect first.
-- HQ build / multimedia delete — no upstream support at all. Permanent gap; document in sweep report.
-- Connect program / payment unit delete — no upstream support. Permanent gap; admin-UI link only.
diff --git a/docs/superpowers/specs/2026-05-15-ace-sweep-design.md b/docs/superpowers/specs/2026-05-15-ace-sweep-design.md
deleted file mode 100644
index 2f699cb1..00000000
--- a/docs/superpowers/specs/2026-05-15-ace-sweep-design.md
+++ /dev/null
@@ -1,171 +0,0 @@
-# `/ace:sweep` — Cross-System Orphan Cleanup
-
-**Status:** design • **Date:** 2026-05-15 • **Owner:** jjackson
-
-## Problem
-
-Every `/ace:run` creates artifacts across many systems: Drive folders, Connect programs/opportunities/payment-units, OCS chatbots/collections/sessions, CommCare HQ apps + builds, connect-labs workflows/pipelines/synthetic opps. Most have no in-line teardown.
-
-Jon's current habit is to delete the Drive folder when an opp is done, which leaves the target-system artifacts orphaned — they've lost their reference back to ACE, but they remain alive in Connect, OCS, HQ, and labs. Orphan accumulation degrades each system's signal-to-noise and burns through quotas (OCS chatbots, HQ app slots).
-
-The goal: a **repeatable sweeping pass** that finds and removes orphaned artifacts ACE has created, run one system at a time, with human triage.
-
-## Non-goals
-
-- **Not** real-time / per-run cleanup. A later phase could add `/ace:closeout`-driven teardown; that's separate.
-- **Not** authoritative deletion of non-ACE artifacts. ACE-likely is a fingerprint judgment, not a guarantee — humans confirm.
-- **Not** historical recovery. Trashed Drive items survive 30 days; deleted Connect/OCS/HQ items are gone.
-
-## Core mechanism: live-set diff
-
-The safety mechanism is structural, not pattern-based:
-
-1. **Live set** = every external identifier still referenced by an opp visible in Drive. Walk `ACE/<opp>/`; for each opp, parse `opp.yaml` + every `runs/<run-id>/run_state.yaml` and harvest IDs into one index.
-2. **System inventory** = everything ACE's credentials can see in a target system.
-3. **Orphans** = (system inventory) − (live set).
-4. **ACE-fingerprint score** is applied *only* to orphans, to help the human triage which of them ACE plausibly created vs. ones that pre-date ACE / belong to a different agent / are real human-created artifacts.
-
-Crucially: deleting the Drive folder is exactly what marks a thing as "abandoned" — it's the user's existing signal. The sweep operationalizes that signal.
-
-## Architecture
-
-### New components
-
-```
-agents/
-  sweep.md                      # procedure doc; orchestrates live-set → probe → triage → execute
-
-skills/
-  sweep-live-set/SKILL.md       # Drive walk → live-set.yaml
-  sweep-drive/SKILL.md          # Drive sweep (and validates live-set extractor)
-  sweep-connect/SKILL.md        # Connect sweep (PR 2)
-  sweep-ocs/SKILL.md            # OCS sweep (PR 3)
-  sweep-hq/SKILL.md             # CommCare HQ sweep (PR 4)
-  sweep-labs/SKILL.md           # connect-labs sweep (PR 5)
-
-commands/
-  sweep.md                      # /ace:sweep [system]
-
-lib/
-  sweep-fingerprint.ts          # ACE-fingerprint scoring rules (shared)
-  sweep-report.ts               # markdown table renderer
-```
-
-`sweep.md` must be a procedure doc (not a subagent) because it dispatches sub-skills.
-
-### Per-system skill contract
-
-Each `sweep-<system>` skill:
-
-1. Reads `live-set.yaml`.
-2. Lists everything in its system via existing `list_*` atoms.
-3. Diffs → orphan candidates.
-4. Scores each orphan via `sweep-fingerprint.ts`:
-   - **high** — name matches a known ACE pattern (e.g. `CRISPR-*`, golden-template clone signature) AND created in an ACE activity window.
-   - **medium** — one of the two signals.
-   - **low** — neither, but visible to ACE's account.
-5. Emits `runs/sweep-<timestamp>/<system>-orphans.md` (the triage report) and `<system>-delete-plan.yaml` (initially empty; populated as user approves chunks).
-6. Executor pass: reads `delete-plan.yaml`, calls auto-delete atoms where available, or emits a manual punch list for systems without delete atoms.
-
-### Capability matrix (per product, per system)
-
-Per-system research (2026-05-15, reading upstream source on GitHub) produced the following. The pattern is consistent across systems: most products have **no REST DELETE API**, but **do have HTML/web-UI POST endpoints** that we can drive from a new Playwright-backed atom — the same architectural split ACE already uses for `connect-*` and `ocs-*` writes.
-
-| System | Product | Existing ACE atom | Upstream mechanism | Backend | Plan |
-|--------|---------|-------------------|--------------------|---------|------|
-| Drive | folder / doc / sheet / slide | `drive_trash_file` | `files.update(trashed:true)` | REST | ✅ use existing |
-| connect-labs | workflow | `workflow_delete` | REST DELETE | REST | ✅ use existing |
-| connect-labs | pipeline | `pipeline_delete` | REST DELETE | REST | ✅ use existing |
-| connect-labs | synthetic opp | `synthetic_disable` | `enabled=False` | REST | ✅ use existing (soft) |
-| connect-labs | solicitation | — | `LabsRecordDataView.DELETE` ({id: pk}) | REST | **new atom** `labs_delete_record(type, id)` (generic, covers 4 products) |
-| connect-labs | fund | — | same generic endpoint | REST | covered by above |
-| connect-labs | review | — | same generic endpoint | REST | covered by above |
-| connect-labs | response | — | same generic endpoint | REST | covered by above |
-| connect-labs | workflow run | — | Django ORM only? | unclear | research as part of PR 5 |
-| Connect | opportunity | — | internal `delete_opportunity()` cascading helper, HTML view; also `active=False` toggle | Playwright (POST) | **new atom** `connect_delete_opportunity` |
-| Connect | program | — | no delete, no inactive | — | ❌ upstream gap; report-only |
-| Connect | payment unit | — | no delete, no inactive | — | ❌ upstream gap; report-only |
-| Connect | LLO invite (program application) | `connect_accept_program_application` | status mutation only | — | ❌ no delete; status update only |
-| Connect | FLW invite | — | HTML view `delete_user_invites/`, unaccepted-only | Playwright (POST) | **new atom** `connect_delete_unaccepted_flw_invites` |
-| OCS | chatbot (experiment) | — | `/team/<slug>/chatbots/<id>/delete/` → `experiment.archive()` (sets `is_archived=True`) | Playwright (POST) | **new atom** `ocs_archive_chatbot` |
-| OCS | collection | — | `/team/<slug>/collections/<id>/delete/` → `collection.archive()` + async file cleanup | Playwright (POST) | **new atom** `ocs_archive_collection` |
-| OCS | pipeline | — | `/team/<slug>/pipelines/<id>/delete/` → `pipeline.archive()` | Playwright (POST) | **new atom** `ocs_archive_pipeline` |
-| OCS | session | `ocs_end_session` | `POST /api/sessions/<id>/end_experiment_session/` (REST!) | REST | ✅ use existing |
-| OCS | source material / collection file | — | `/team/<slug>/collections/<id>/files/<file_id>/delete` (hard delete via async task) | Playwright (POST) | **new atom** `ocs_delete_collection_file` |
-| OCS | chatbot version | — | no independent delete; cascades from chatbot archive | — | covered by `ocs_archive_chatbot` |
-| CommCare HQ | application | — | `POST /a/{domain}/apps/delete_app/{app_id}/` → soft-delete (`doc_type` → `Application-Deleted`), restorable via `undo_delete_app` | Playwright (POST) | **new atom** `commcare_delete_app` |
-| CommCare HQ | build | — | no delete, only `is_released` toggle | — | ❌ upstream gap (builds immutable); report-only |
-| CommCare HQ | multimedia | — | no scalable delete; only logo removal | — | ❌ upstream gap; report-only |
-| Nova | app (blueprint DB) | `mcp__nova__delete_app` | local DB | REST | ✅ local to Nova, runs as part of sweep but doesn't touch HQ |
-
-**New atoms to build (one per system PR):**
-
-- **connect-labs (PR 5):** `labs_delete_record(type, id)` — one atom covers solicitation/fund/review/response via the generic `LabsRecordDataView.DELETE`.
-- **Connect (PR 2):** `connect_delete_opportunity`, `connect_delete_unaccepted_flw_invites` — Playwright POSTs to existing HTML views.
-- **OCS (PR 3):** `ocs_archive_chatbot`, `ocs_archive_collection`, `ocs_archive_pipeline`, `ocs_delete_collection_file` — Playwright POSTs to existing archive views.
-- **CommCare HQ (PR 4):** `commcare_delete_app` — Playwright POST to `delete_app` HQ view. Lives alongside the other `commcare_*` atoms in `mcp/connect/backends/commcare.ts`.
-
-**Documented upstream gaps (report-only — sweep lists them, human can't delete):**
-
-- Connect: programs, payment units, LLO invites (status mutations only)
-- CommCare HQ: builds, multimedia
-- These get a separate "upstream-blocked" section in the sweep report. File upstream issues to track each gap so we can revisit when an API delete lands.
-
-**Architectural note:** Every new atom follows the existing Playwright-write-CSRF-form pattern in `mcp/{connect,ocs}/backends/playwright.ts` — fetch the page, scrape CSRF token, POST the form. The executor calls the atom by canonical name; backend choice is invisible to the sweep skills.
-
-### ACE-fingerprint heuristics (initial set)
-
-- **Connect program/opportunity:** name matches `^CRISPR-.*`, or description contains `ACE-generated`, or organization_name is one of ACE's known seed orgs.
-- **OCS chatbot:** cloned from `OCS_GOLDEN_TEMPLATE_ID` (chatbot exposes parent on `ocs_get_chatbot`); name starts with `ACE-` or matches opp-name shape.
-- **OCS collection:** ditto, plus structural — single-collection chatbots only.
-- **HQ app:** name contains "Learn" or "Deliver" AND domain is `connect-ace-prod`, project space owned by `ace@dimagi-ai.com`.
-- **labs workflow/pipeline:** created_by is ACE service account; opportunity_ids reference Connect IDs in live-set or orphans.
-
-Each heuristic is a separate function in `sweep-fingerprint.ts` so they can be tuned independently as we observe real orphan distributions.
-
-## UX
-
-```
-$ /ace:sweep
-Which system? drive | connect | ocs | hq | labs | all
-
-$ /ace:sweep drive
-[1/3] Building live set from Drive...
-      Found 14 active opps, 47 active runs, 312 referenced IDs.
-
-[2/3] Listing Drive root ACE/...
-      Found 89 folders. 14 active, 75 orphan candidates.
-
-[3/3] Triage report → ACE/_sweep/2026-05-15T18-22Z/drive-orphans.md
-      high   42  (will trash on approval)
-      medium 21  (review individually)
-      low    12  (likely human-created — skipping by default)
-
-Approve high-confidence batch (42 items)? [y/N]
-```
-
-Per-system runs are independent; you can sweep Drive today and Connect next week.
-
-## Phasing
-
-Research pass complete (2026-05-15). All per-system findings codified in the matrix above. Implementation phasing:
-
-- **PR 1:** `sweep-live-set` + `sweep-drive` + `sweep.md` procedure doc + `/ace:sweep` command + `sweep-fingerprint.ts` skeleton + `sweep-report.ts`. End-to-end works for Drive. No new MCP atoms.
-- **PR 2 (Connect):** `sweep-connect` skill + new Playwright atoms `connect_delete_opportunity`, `connect_delete_unaccepted_flw_invites`. Programs/payment-units/program-applications surface in sweep report as upstream-blocked.
-- **PR 3 (OCS):** `sweep-ocs` skill + new Playwright atoms `ocs_archive_chatbot`, `ocs_archive_collection`, `ocs_archive_pipeline`, `ocs_delete_collection_file`. Sessions auto-end via existing `ocs_end_session`.
-- **PR 4 (HQ):** `sweep-hq` skill + new Playwright atom `commcare_delete_app` in `mcp/connect/backends/commcare.ts`. Builds and multimedia surface as upstream-blocked.
-- **PR 5 (labs):** `sweep-labs` skill + new REST atom `labs_delete_record(type, id)` (generic, covers solicitation/fund/review/response). Workflows/pipelines/synthetic auto-delete via existing atoms.
-
-Each per-system PR is independent after PR 1. Upstream-blocked items each get an issue filed against the relevant Dimagi repo so we can revisit when an API delete is added.
-
-## Open questions
-
-- **Drive trash vs delete.** MVP uses `drive_trash_file` (30-day recovery). If we ever want hard delete (e.g. quota pressure), add a `--purge` flag that calls `drive.files.delete()` directly on items already in trash.
-- **Sweep report retention.** Sweep reports themselves accumulate under `ACE/_sweep/<timestamp>/`. We should set a TTL — proposal: sweep reports older than 90 days are auto-trashed by the next sweep run.
-- **Cross-run live-set caching.** Building the live set requires walking every opp's runs. For ~100 opps this is fine. If it gets slow, cache last build with an ETag-style invalidation. Not MVP.
-
-## What this design explicitly avoids
-
-- **A cleanup phase in `/ace:run`.** That's a different problem (real-time teardown). Sweep is for the backlog.
-- **A keep-list / tombstone marker.** Live-set diff already gives us "what's still referenced"; tombstones would be redundant.
-- **Auto-deletion of high-confidence items without human approval.** Even high-confidence orphans go through the approval gate. Safety > speed.
diff --git a/docs/superpowers/specs/2026-05-21-work-order-skill-design.md b/docs/superpowers/specs/2026-05-21-work-order-skill-design.md
deleted file mode 100644
index 3a25da87..00000000
--- a/docs/superpowers/specs/2026-05-21-work-order-skill-design.md
+++ /dev/null
@@ -1,233 +0,0 @@
-# Work-Order Skill — Phase 1 Design
-
-**Date:** 2026-05-21
-**Status:** Approved (design phase)
-**Phase:** 1 (`idea-to-design`)
-**Skills introduced:** `pdd-to-work-order`, `pdd-to-work-order-qa`, `pdd-to-work-order-eval`
-
-## Summary
-
-Phase 1 currently produces the PDD (`idea-to-pdd.md`) and a structured `decisions.yaml` log of load-bearing default-decisions. When an opportunity is going straight to a pre-known LLO without a competitive selection, Dimagi also needs a contractual **Work Order** — a signable document derived from the same source-of-truth (the PDD and the decisions log).
-
-This skill adds that artifact. Generic by default — partner identity is a placeholder unless an LLO is supplied as input. Phase 8 still publishes a solicitation; the work order is parallel, not a replacement.
-
-## Motivation
-
-- Today, work orders are authored manually outside the lifecycle. Numerics (rate, FLW count, geographic coverage, period of performance, NTE budget) get re-derived by the human from the PDD with no traceability.
-- The `decisions.yaml` + `decisions-render` pattern already gives us a structured audit surface for every load-bearing choice. The work order is a natural consumer of that surface.
-- Reference example: `[Shared] DFHF - RDT Work Order #3` (Malaria RDT POC Sampling Pilot) — sections, payment schedule, RACI table, ethics clauses, signature blocks.
-
-## Scope
-
-In scope:
-- New producer skill `pdd-to-work-order` and its `*-qa` / `*-eval` companions.
-- Integration into the `idea-to-design` agent as Step 2 (after PDD QA+eval).
-- New `wo-*` rows appended to `decisions.yaml` for work-order-specific load-bearing fields.
-- New `phases.design.products.work_order` write-back to `run_state.yaml` (typed handoff: `{title, file_id}`).
-- Archetype branching across `atomic-visit`, `focus-group`, `multi-stage`.
-
-Out of scope:
-- Replacing Phase 8 solicitation. Solicitation still runs; work order is parallel.
-- Legal review automation. The skill produces a draft; humans review and sign.
-- Live signature workflow / e-signature integration.
-- Reading the work order from any downstream phase. Phase 2+ continue to read the PDD; the work order is a human-facing artifact only.
-
-## Skill: `pdd-to-work-order`
-
-### Identity
-- **Phase:** `1-design`
-- **Skill name:** `pdd-to-work-order`
-- **Mode behavior:** runs in both `auto` and `review` modes; review mode pauses for approval after the gdoc is written.
-- **Trigger:** auto in `/ace:run`, every run, after `idea-to-pdd` ships and passes QA. Also invokable via `/ace:step pdd-to-work-order <opp>/<run-id>`.
-
-### Inputs
-| Source | Artifact | Used for |
-|---|---|---|
-| Phase 1 | `1-design/idea-to-pdd.md` | scope, deliverables, timeline, target population, success metrics, evidence model |
-| Phase 1 | `decisions.yaml` | load-bearing values (rate, FLW count, language, etc.) |
-| Run root | `inputs-manifest.yaml` (read at run start by orchestrator) | optional reference for partner identity if it was supplied |
-| Operator (optional) | `--llo <slug>` flag | overrides partner-name placeholder |
-
-All reads use parallel `drive_read_file`; context is trusted across steps per the agent's read-redundancy rules.
-
-### Products
-- `ACE/<opp>/runs/<run-id>/1-design/pdd-to-work-order.gdoc` — the contract, formatted as a clean Google Doc.
-- `run_state.yaml.phases.design.products.work_order` — `{title, file_id}` typed handoff. This skill is the sole writer.
-- `ACE/<opp>/runs/<run-id>/decisions.yaml` — appended `wo-*` rows (merge-only, never overwrites existing rows).
-
-### Decisions log convention
-
-The skill follows the existing bar criterion (load-bearing AND maps to a known surface). Work-order-specific rows use the `wo-` prefix to avoid collisions with future Phase 8 solicitation rows. Rows from earlier in Phase 1 (e.g. `payment-rate`, `flw-count`, `working-language`, `budget-plausibility`, `candidate-llo-roster`) are read as-is — never duplicated, never renamed.
-
-Common `wo-*` rows:
-
-| ID | Question | Map to surface |
-|---|---|---|
-| `wo-number` | Sequence number for this WO under the MSA | Header (placeholder if unknown) |
-| `wo-period-of-performance` | Start + end dates | Header + Timeline section |
-| `wo-total-not-to-exceed-usd` | Total NTE budget cap | Payment Terms section |
-| `wo-payment-schedule-split` | Milestone payment percentages (e.g., 40/60) | Payment Schedule sub-table |
-| `wo-mobilization-advance-pct` | Mobilization advance % of cap | Payment Schedule row 1 |
-| `wo-reporting-cadence` | Frequency of progress reports (default: weekly) | Reporting sub-section |
-| `wo-ethics-scope` | Operational-only vs patient-level | Ethics section |
-| `wo-data-storage-region` | Server region for data storage (default: US) | Data Handling section |
-
-Rows go in with `status: open` when the AI's inference is a best-guess (e.g., dates inferred from Timeline section), `status: applied` when directly derivable from existing decisions or PDD.
-
-### Process
-
-1. **Read inputs in parallel** — PDD + `decisions.yaml` + run-folder context. Trust context across subsequent steps.
-2. **Determine archetype** from PDD frontmatter. Branches Section content (see "Section template" below).
-3. **Resolve contractual fields.** For each work-order field:
-   - (a) If a relevant `decisions.yaml` row exists from an earlier skill, use it.
-   - (b) If inferable from PDD body (Timeline → period of performance; Success Metrics + Budget → NTE; etc.), use that and emit a `wo-*` row capturing the inference.
-   - (c) If unknowable (partner name absent, WO# unknown, MSA date unknown), insert a bracketed placeholder like `[Partner Name]` and emit a `wo-*` row with `status: open` + `notes` telling the human what to fill in.
-4. **Append `wo-*` rows to `decisions.yaml`** via `update_yaml_file` with merge-only semantics. Never overwrite existing rows.
-5. **Render work-order template** to a Google Doc:
-   - `docs_copy_template` from a stored template (file_id in `.env` as `WORK_ORDER_TEMPLATE_ID`, fallback path documented in the skill).
-   - `docs_batch_update` with token replacements (`{{wo_number}}`, `{{title}}`, `{{period_of_performance}}`, `{{scope_body}}`, `{{payment_schedule_table}}`, `{{roles_raci_table}}`, `{{signature_dimagi}}`, `{{signature_partner}}`, etc.).
-   - Sections that branch on archetype receive archetype-specific token bodies generated by the skill.
-6. **Write `run_state.yaml.phases.design.products.work_order`** — `{title, file_id}` via `update_yaml_file` with `merge: 'two-level'`.
-7. **Invoke `decisions-render`** so the human-readable `decisions.gdoc` refreshes with the new `wo-*` rows.
-
-### Section template
-
-Fixed structure mirroring the malaria example, with archetype-aware sub-content:
-
-1. **Header** — WO Number, WO Date (today, ISO), Title (from PDD H1), Period of Performance.
-2. **Background** — synthesized from PDD's Problem Statement + Intervention Design. References the downstream consumer if `named-downstream-consumer` decision is named (e.g., GiveWell RFI).
-3. **Scope of Work** — archetype-branched:
-   - `atomic-visit`: per-visit data capture, photo standardization, GPS, sample handling.
-   - `focus-group`: per-session facilitation, attestation form submission, gdoc write-up obligations.
-   - `multi-stage`: per-stage sub-section with each stage's archetype-shaped scope.
-4. **Geographic Coverage** — from PDD Target Population. If the PDD names states/regions, use them; otherwise `[Geographic Coverage — Partner to propose]`.
-5. **Deliverables and Verification**:
-   - 5.1 Primary Deliverable (target count + verification criteria from PDD Evidence Model).
-   - 5.2 Definition of a Verified Unit (Layer A criteria from Evidence Model).
-   - 5.3 Reporting Deliverables (weekly + end-of-pilot; `wo-reporting-cadence`).
-6. **Timeline and Milestones** — from PDD Timeline, week-by-week table.
-7. **Payment Terms**:
-   - 7.1 Total Not-to-Exceed (from `wo-total-not-to-exceed-usd`).
-   - 7.2 Payment Schedule (table from `wo-payment-schedule-split` + `wo-mobilization-advance-pct`; per-unit rate from existing `payment-rate` / `per-session-rate` rows).
-   - "Dimagi will pay only for verified units" clause.
-8. **Roles and Responsibilities** — archetype-derived RACI table (Dimagi vs Partner) covering protocol design, app configuration, FLW recruitment/training, field ops, permissions, transport, verification, reporting.
-9. **Permissions, Ethics, Compliance** — template clauses; ethics scope from `wo-ethics-scope`; private-retail engagement clauses included only if PDD scope touches private retail.
-10. **Data Handling** — standard Dimagi clauses; Data Subjects + Personal Information pre-filled from PDD's data-subject treatment.
-11. **Signatures** — Dimagi block prefilled (Lucina Tse, COO, address); partner block left blank for `[Partner Name]`, `[Title]`, `[Date]`, `[Address for correspondence]`.
-12. **Annexures** — Annexure A pointer to the PDD (file_id from `phases.design.products.pdd`). Placeholder for opp-specific annexures (e.g., sampling protocols).
-
-### Archetype branching
-
-| Section | `atomic-visit` | `focus-group` | `multi-stage` |
-|---|---|---|---|
-| Scope of Work | per-visit, photos, GPS, samples | per-session, attestation form, gdoc | per-stage with each stage's archetype shape |
-| Verification | photo + GPS Layer A | attestation submission + gdoc receipt Layer A/B | per-stage verification criteria |
-| Payment unit | per visit | per session (facilitator + notetaker + training stipend) | per stage, may mix per-visit and per-session |
-| Roles RACI | FLW recruitment + supervision | Facilitator + notetaker recruitment + practice-session sign-off | per-stage RACI |
-| Ethics | operational data only | consent + audio recording protocol | stage-dependent |
-
-## Skill: `pdd-to-work-order-qa`
-
-Static structural checks. Uniform verdict shape with `failures[].auto_fix_hint` for the producer-retry loop.
-
-Checks:
-
-1. All required headings present (sections 1–12 above).
-2. Required `wo-*` rows present in `decisions.yaml`: `wo-number`, `wo-period-of-performance`, `wo-total-not-to-exceed-usd`, `wo-payment-schedule-split`.
-3. Period of Performance has start + end dates (or explicit placeholder text — not silently missing).
-4. Payment schedule percentages sum to 100%.
-5. Total NTE present (number or placeholder, not silently missing).
-6. Signature blocks present for both parties.
-7. Archetype-appropriate scope language:
-   - `atomic-visit`: scope references "per visit" or equivalent + photo + GPS.
-   - `focus-group`: scope references attestation form + gdoc.
-   - `multi-stage`: at least one per-stage subsection.
-8. No leaked AI scaffolding markers (`<<TBD>>`, `<<unclear>>`, `<<>>`).
-
-Output: `1-design/pdd-to-work-order-qa_result.yaml`. `verdict: fail` triggers the producer-retry loop with `auto_fix_hint` per failure.
-
-## Skill: `pdd-to-work-order-eval`
-
-LLM-as-judge, quality-only re-grade. Skipped if QA verdict is `incomplete`. A `verdict: fail` here does NOT halt the run on its own — `[BLOCKER]` concerns pause per the orchestrator's Per-Mode Pause Matrix.
-
-Dimensions:
-
-1. **Contractual clarity** — could a partner sign without follow-up questions on scope, deliverables, or payment?
-2. **PDD alignment** — do scope / deliverables / timeline / payment trace back to the PDD?
-3. **Decisions traceability** — do contractual numerics match their corresponding `decisions.yaml` rows?
-4. **Verification realism** — are "verified unit" criteria measurable on the Connect platform?
-5. **Archetype fit** — does the contract shape match the declared archetype?
-
-Per-dimension `pass | partial | fail`. Two or more non-pass → `verdict: fail` for the rubric. Eval verdict written to `1-design/pdd-to-work-order-eval_verdict.yaml` per `lib/verdict-schema.ts`.
-
-## Agent integration
-
-`agents/idea-to-design.md` gains a new Step 2 block after Step 1.5:
-
-```
-### Step 2: PDD → Work Order
-Invoke the `pdd-to-work-order` skill.
-- Inputs: PDD + decisions.yaml (already in subagent context from Step 1 — do NOT re-read).
-- Output: 1-design/pdd-to-work-order.gdoc, products.work_order in run_state.yaml,
-  appended wo-* rows in decisions.yaml.
-- Gate (review mode): present the work-order URL for approval.
-
-### Step 2.4: Work-Order QA (structural pass/fail)
-Invoke `pdd-to-work-order-qa`. Same producer-retry loop semantics as Step 1.4.
-
-### Step 2.5: Work-Order eval (independent quality re-grade)
-Unless --no-evals AND QA pass, invoke `pdd-to-work-order-eval`.
-- Skipped if QA verdict is incomplete (eval mirrors with verdict: incomplete).
-```
-
-Agent frontmatter `skills:` array grows by one entry:
-
-```yaml
-skills:
-  - { name: idea-to-pdd, has_judge: true, qa_skill: idea-to-pdd-qa, eval_skill: idea-to-pdd-eval }
-  - { name: pdd-to-work-order, has_judge: true, qa_skill: pdd-to-work-order-qa, eval_skill: pdd-to-work-order-eval }
-```
-
-Phase summary at completion lists both PDD and work-order URLs.
-
-## Write-back contract
-
-Existing `phases.design.products.pdd` block is unchanged. New sibling:
-
-```yaml
-phases:
-  design:
-    products:
-      pdd:
-        title: ...
-        description: ...
-        file_id: ...
-      work_order:
-        title: "Work Order — <PDD Title>"
-        file_id: <gdoc-id>
-```
-
-`pdd-to-work-order` is the sole writer of `products.work_order`.
-
-## Template artifact
-
-A Google Docs template lives at a known file_id (stored in `.env` as `WORK_ORDER_TEMPLATE_ID`, sourced from 1Password vault). Template content mirrors the malaria example with token markers for replacement. The template file_id is per-deployment configuration, not committed in the repo. Bootstrapping the template is documented in `playbook/integrations/work-order-template.md` (or a similar location) — out of scope for this design doc.
-
-## Dry-run behavior
-
-When `--dry-run` is active:
-- Write the work-order gdoc as normal.
-- Skip emailing any admin-group summary if relevant.
-- State tracks as `dry-run-success`.
-
-## Resolved decisions
-
-- **Template provisioning:** bootstrapped via `scripts/bootstrap-work-order-template.ts` (mirrors `scripts/bootstrap-ocs-golden-template.ts`). One-time per deployment; writes the resulting template file_id to `.env` as `WORK_ORDER_TEMPLATE_ID`.
-- **Re-runs:** each invocation creates a NEW gdoc. Older work-order drafts remain in the run folder as `pdd-to-work-order.gdoc`, `pdd-to-work-order-2.gdoc`, etc. Sole writer of `products.work_order` in `run_state.yaml` updates the pointer to the latest one. New `wo-*` decision rows are still merge-only (no duplicates).
-- **Signature block:** Dimagi signatory is hardcoded to Lucina Tse, COO (with the Cambridge MA address from the malaria example) for now. Revisit if/when a second signatory is needed.
-
-## Non-goals
-
-- This is a Phase 1 artifact only. No downstream phase reads it.
-- No e-signature workflow. The gdoc is a draft for humans to download, route, and sign offline.
-- No automatic emailing of the WO to a partner. Phase 8 / Phase 9 handle LLO contact.
diff --git a/package.json b/package.json
index eddbfdad..b7ff3858 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "ace",
-  "version": "0.13.330",
+  "version": "0.13.331",
   "description": "AI Connect Engine - orchestrator for building Connect Opps using AI",
   "type": "module",
   "scripts": {
diff --git a/skills/README.md b/skills/README.md
index df256765..aa8a1c98 100644
--- a/skills/README.md
+++ b/skills/README.md
@@ -2,7 +2,7 @@
 
 This file is the contract for authoring SKILL.md files in `skills/`. Read it before adding a new skill or making non-trivial edits to an existing one. Existing skills are the source of truth — if this contract drifts from them, fix the contract or fix the skills, but they should agree.
 
-ACE skills are prompt-based capability definitions. Each one handles one step of the CRISPR-Connect process (see `docs/superpowers/specs/2026-04-01-ace-design.md`). Skills are stateless — they read from and write to the opportunity's Google Drive folder (`ACE/<opp-name>/`) and call MCP tools for external system access. The agents in `agents/` are what dispatch to skills.
+ACE skills are prompt-based capability definitions. Each one handles one step of the CRISPR-Connect process (see the repo-root `CLAUDE.md` for the phase pipeline and `agents/orchestrator-reference.md` for state schemas). Skills are stateless — they read from and write to the opportunity's Google Drive folder (`ACE/<opp-name>/`) and call MCP tools for external system access. The agents in `agents/` are what dispatch to skills.
 
 ## File location and naming
 
@@ -253,7 +253,7 @@ When `--dry-run` is active:
 - State tracks as `dry-run-success`
 ```
 
-See `docs/superpowers/specs/2026-04-01-ace-design.md` § "Testing and Dry-Run Strategy" for the full dry-run model.
+The shape above is the full dry-run model — there's no longer a separate design doc for it.
 
 ### `## Failure Modes` (when the skill has typed errors worth enumerating)
 
diff --git a/skills/upload-transcript/SKILL.md b/skills/upload-transcript/SKILL.md
index c1213a54..ff6e4d01 100644
--- a/skills/upload-transcript/SKILL.md
+++ b/skills/upload-transcript/SKILL.md
@@ -51,9 +51,7 @@ Optional:
 ## Payload fields
 
 When invoked from the orchestrator with both `<opp>` and `<run-id>` in
-context (the multi-run layout introduced 2026-05-02 — see
-`docs/superpowers/specs/2026-05-02-ace-run-multi-run-revival-design.md`),
-send BOTH:
+context (the multi-run layout introduced 2026-05-02), send BOTH:
 
 - `opp_slug`: the opp folder name (e.g. `turmeric`).
 - `opp_run_id`: the run-id (e.g. `20260502-1830`).