TASKS — v0.1 (Phases 3–7)

Atomic work units for the v0.1 autonomous development loop, spanning Phases 3 through 7 of docs/design/roadmap.md. Each task has acceptance criteria; the loop must only mark a task done when bun run check:ci passes AND the acceptance criteria are met.

If a task is ambiguous or a criterion cannot be met without changing an architecture doc, stop and notify the user via Telegram (see CLAUDE.md autonomous-mode rules).

Conventions:

[ ] = not started
[/] = in progress (at most one at any time)
[x] = done and pushed to dev

v0.1 ship criterion: a user can npm pack && npm i -g ./*.tgz && a2a-bridge init && a2a-bridge daemon start, then drive Claude Code end-to-end from Gemini CLI (A2A), OpenClaw (ACP via acpx), Zed (ACP), or VS Code (ACP). All loop-automatable release artifacts are present; only credential-gated steps (npm publish, marketplace submission) remain for the human operator.

Phase 2 baseline carried into v0.1: 223 pass / 0 fail / 19 E2E filtered (bun run check:ci, 27 files, 532 expect calls) as of commit ed50f4c. Each phase footer below records the new baseline once that phase lands.

Phase 3 — Verification and delegation patterns

Phase 4 — RoomRouter and TaskLog

Phase 5 — ACP inbound (multi-client reach)

P5.1 — Add @agentclientprotocol/sdk dependency. Acceptance: bun add @agentclientprotocol/sdk; pinned version in package.json and bun.lock. bun run typecheck and bun run lint:deps green. No imports yet.
P5.2 — Scaffold runtime-daemon/inbound/acp/ layout. Acceptance: new files acp/index.ts (stub AcpInboundService implements IInboundService), acp/connection.ts (placeholder for the AgentSideConnection glue). inbound-factory.ts gains an "acp-stdio" kind that constructs AcpInboundService. lint:deps green; no behavior yet.
P5.3 — Implement the ACP initialize + newSession handlers. Acceptance: AcpInboundService.start(stream) opens an AgentSideConnection over the supplied stdio pair, advertises the agent's protocolVersion and minimum capabilities, and responds to newSession with a session id derived from the request. Unit tests use the SDK's in-memory pair to verify the handshake.
P5.4 — Bridge ACP prompt → ClaudeCodeGateway. Acceptance: incoming prompt calls extract the user text, forward via gateway.startTurn, and stream each chunk event back as a session/update notification. Final complete produces a terminal session/prompt response with stopReason: "end_turn". Errors map to stopReason: "refusal" with the message in the notification. Unit test mocks the gateway and asserts the wire sequence.
P5.5 — Wire ACP cancel to gateway turn cancel. Acceptance: ACP cancel calls turn.cancel() on the active ClaudeCodeTurn. Pending prompt resolves with stopReason: "cancelled". Unit test covers the cancel path.
P5.6 — a2a-bridge acp CLI subcommand. Acceptance: src/cli/cli.ts recognizes acp as a subcommand that starts an AcpInboundService over process.stdin / process.stdout. Connects to the long-running daemon via the unix-socket control plane (auto-starting the daemon if missing, same heuristic the existing CLI uses). E2E test (filtered out of test:unit) spawns the subcommand and exercises the full ACP initialize → prompt → reply round trip via the SDK client.
P5.7 — README "Connect ACP clients" section. Acceptance: README gets a new section after "Connect Gemini CLI" with copy-paste config snippets for OpenClaw (acpx.config.agents), Zed (agent_servers), and VS Code's ACP plugin. Each snippet references a2a-bridge acp as the command. No other README sections touched.
P5.8 — SDK-level ACP integration test. Acceptance: new test in src/cli/acp-inbound.test.ts boots the ACP service against an in-memory stdio pair using the SDK's ClientSideConnection, sends a prompt, and asserts the client observes at least one streamed update plus a terminal response with stopReason: "end_turn". bun run check:ci green.

Phase 6 — Distribution and UX polish

Phase 7 — Release packaging

P7.1 — Upgrade .github/workflows/ci.yml. Acceptance: CI runs typecheck, lint:deps, test, build:plugin, AND scripts/smoke-tarball.sh on every PR. Matrix on Ubuntu and macOS. Caches bun deps. README badge points at the workflow.
P7.2 — Add .github/workflows/release.yml. Acceptance: tag-triggered workflow runs the full check, builds the tarball, attaches it to a draft GitHub release, and surfaces an npm-publish step gated on a manual production environment approval. Does NOT publish unattended.
P7.3 — ACP registry submission package. Acceptance: release/acp-registry/agent.json + icon.svg conforming to the public ACP registry schema. README at release/acp-registry/README.md explains the PR process the user follows to submit.
P7.4 — Claude Code marketplace submission package. Acceptance: release/marketplace/SUBMIT.md lists every artifact Anthropic requires (description, screenshots paths, plugin manifest validation steps) plus the submission URL. All referenced artifacts exist in the repo or have placeholder paths flagged for the user.
P7.5 — docs/release/publish.md runbook. Acceptance: step-by-step guide covering: bump check, CHANGELOG finalization, tag creation, GitHub Actions release run, manual npm publish (with --otp reminder), marketplace submission, ACP registry PR, post-release smoke test against the published package. Total length under 400 lines.
P7.6 — scripts/check-release-ready.sh. Acceptance: shell script (Bun-friendly) that asserts: version alignment across all manifests, CHANGELOG has an entry for the current version, bun run check:ci green, tarball builds and installs, all required release/ artifacts present. Exits non-zero on any failure. README references the script in the release workflow section.

Phase 8 — Real ACP → Claude Code routing (no mock)

Why this block lands before the first public 0.1.0 release. The v0.1 ACP inbound currently resolves prompt against an in-process EchoGateway — OpenClaw / Zed / VS Code users see Echo: <text> instead of a real Claude Code reply. Before a Join skill is useful (Phase 9) the a2a-bridge acp subprocess must actually reach the attached Claude Code session through the daemon. Unit tests may still use fakes; the CLI default path must never short-circuit to a mock or echo reply.

Plugin↔CC contract reference. The a2a-bridge plugin is itself a Channel (MCP claude/channel capability). As Phase 8 wires ACP turns through the plugin, the plugin must keep emitting notifications/claude/channel with identifier-safe meta keys ([a-z0-9_]+ only — hyphens and other non-word chars are silently dropped by Claude Code), keep its reply tool registered via ListToolsRequestSchema / CallToolRequestSchema, and keep an instructions string telling Claude how to route replies back. Source of truth: https://code.claude.com/docs/en/channels-reference. Full production reference: references/claude-plugins-official/external_plugins/telegram/server.ts (capability declaration at L369–396; permission_request handler at L405–430; reply / react / edit tool schemas at L432–504). Any Phase 8 change that alters plugin↔CC wire shape must be audited against both.

P8.1 — Control-plane wire format for ACP turns. Acceptance: new message variants in transport/ (or the daemon's control-message types) covering acp_turn_start, acp_turn_chunk, acp_turn_complete, acp_turn_error, and acp_turn_cancel. Typed on both ends; round-trip unit tests. Any meta / context fields on these frames that are eventually forwarded to notifications/claude/channel use identifier-safe keys only ([a-z0-9_]+); unit test rejects hyphenated or otherwise non-identifier keys at the boundary so CC never silently drops them.
P8.2 — Daemon-side handler for ACP turns. Acceptance: when an attached client sends acp_turn_start, the daemon calls inboundGateway.startTurn(text) on the shared DaemonClaudeCodeGateway and pipes the resulting chunk / complete / error events back to that client as the matching control-plane frames. One ACP connection can own at most one in-flight turn; acp_turn_cancel fires turn.cancel(). Unit test drives the handler with a stub gateway and asserts the frame sequence.
P8.2a — Permission-relay policy for ACP-triggered turns. Acceptance: decide and document how the plugin's notifications/claude/channel/permission_request handler behaves when the in-flight turn originated from an ACP client (ACP has no "chat to reply in" analogue for the five-letter verdict dance Channels specifies). Pick one and write the rationale into docs/design/architecture.md: (a) auto-allow, gated by a daemon-side policy flag such as A2A_BRIDGE_ACP_AUTO_PERMISSION; (b) bridge to ACP session/request_permission (if the SDK exposes it) so the ACP client is prompted; (c) auto-deny, turn ends with stopReason: "refusal". Integration test exercises a turn that triggers a tool-use permission prompt and asserts the chosen behavior end-to-end. References: Channels §Relay permission prompts; telegram reference handler at references/claude-plugins-official/external_plugins/telegram/server.ts:405-430.
P8.3 — DaemonProxyGateway in runtime-daemon/inbound/acp/. Acceptance: new class implementing ClaudeCodeGateway that opens a control-plane WS connection to the daemon and uses the Phase-8.1 wire format to relay turns. startTurn(text) returns a ClaudeCodeTurn whose chunk / complete / error events come straight from the WS frames. Reconnect logic mirrors the plugin-side DaemonClient. The chunk events that eventually propagate to the plugin's notifications/claude/channel emitter must round-trip without lossy key renaming — meta keys stay identifier-safe end-to-end.
P8.4 — Wire runAcp() onto DaemonProxyGateway (no echo fallback). Acceptance: src/cli/acp.ts default stdio path constructs DaemonProxyGateway after lifecycle.ensureRunning() resolves. When the daemon is unreachable the subcommand fails loudly (non-zero exit + friendly error helper from errors.ts) rather than falling back to echo. EchoGateway stays in the codebase but only test files import it; a lint:deps rule enforces this. Update acp-cli.test.ts to assert the end-to-end SDK round-trip lands a daemon-originated reply, not an echo.
P8.5 — End-to-end integration test with a stub Claude Code channel. Acceptance: new test that boots the real daemon, attaches a stub CC channel (via the existing DaemonClient seam) that replies with deterministic text, spawns a2a-bridge acp as a child process, drives it via the ACP SDK, and asserts the returned session/update text equals the stub CC reply verbatim. Runs under check:ci; covers the full real wire end-to-end.
P8.6 — Update smoke-e2e.sh to exercise the real ACP → CC path. Acceptance: the ACP half of scripts/smoke-e2e.sh no longer runs with A2A_BRIDGE_INBOUND_ECHO=1; it attaches a stub CC client to the daemon and asserts the smoke ACP prompt returns the stub's reply (not Echo: ...). The env-var hook stays but is documented as a test/debug only knob.
P8.7 — Fix hardcoded agentInfo.version: "0.0.1" in src/runtime-daemon/inbound/acp/index.ts. Acceptance: the ACP initialize response advertises the real package version (via the same JSON-import trick src/cli/cli.ts uses). Unit test asserts the response's agentInfo.version matches package.json. The same fix applies plugin-side: the MCP Server({ name, version }, ...) constructor in src/runtime-plugin/bridge.ts (or wherever the plugin instantiates its Server) must pull name and version from package.json rather than hardcoding — this matches the Channels-reference example and the telegram reference (server.ts:370: { name: 'telegram', version: '1.0.0' } is explicit and maintained in package.json). Unit test covers both the ACP initialize response and the plugin's advertised Server info.
P8.8 — Documentation sweep: no more "post-v0.1" caveats on ACP → CC. Acceptance: CHANGELOG.md's ## [0.1.0] block describes the real routing; README.md's Connect-OpenClaw / Zed / VS Code sections drop the echo-gateway warning; docs/guides/cookbook.md gains a fourth pattern example if natural; docs/design/architecture.md's ACP subsection promotes the DaemonProxyGateway from "planned" to "implemented."

Phase 9 — Join skill + first public release

Why this block caps v0.1. Once ACP → CC is real (Phase 8), the user can hand a single URL to Claude Code AND OpenClaw and each AI self-installs its side of the bridge. This is the developer-facing payoff of the whole v0.1 body of work.

P9.1 — Cut a draft GitHub release with the tarball. Acceptance: tag v0.1.0 pushed to GitHub; release.yml runs green on the matrix; the resulting draft release at github.com/firstintent/a2a-bridge/releases/tag/v0.1.0 carries the tarball. Release stays as a draft (do NOT publish-npm yet); the tarball URL is what the skill will reference.
P9.2 — docs/join.md — cross-bridge join skill. Acceptance: a single self-contained Markdown document a user can hand to either Claude Code or OpenClaw via "Read and follow it." The document detects the host environment (by asking the host AI) and runs one of two installers:
- Claude Code side: npm i -g <tarball-URL> → a2a-bridge init → a2a-bridge daemon start → report bearer token + control port back to the user.
- OpenClaw side: npm i -g <tarball-URL> → verify a2a-bridge --version → register the ACP agent in acpx.config.agents → restart acpx → test the bridge by sending a one-shot prompt and asserting the reply is not an echo. Unit / smoke tests for the parsed instructions if feasible; manual verification notes otherwise.
P9.3 — README "Join the bridge" section. Acceptance: new top-level README section (between "Configure" and "Connect Gemini CLI") showing the single-line skill invoke for each side:
```
Claude Code:   Read <skill-url> and follow it.
OpenClaw:      Read <skill-url> and follow it.
```
Short two-sentence intro on what happens after both sides run the skill. Links to docs/join.md for the full text.
P9.4 — End-to-end manual verification of the cross-bridge loop. Acceptance: on the maintainer's machine, execute the skill against a live Claude Code session and a live OpenClaw session, then drive a non-trivial prompt through OpenClaw and record the full reply (must originate from Claude Code, not echo). Save the transcript under docs/release/verified-joins/<date>.md (or similar) as evidence for the first publish.
P9.5 — Bump + CHANGELOG close for the first publish. Acceptance: CHANGELOG.md's ## [0.1.0] header flips from — Unreleased to the release date. docs/release/publish.md remains the runbook; the maintainer takes over from there for npm publish, the marketplace form, and the ACP registry PR.

Phase 10 — Multi-target routing (v0.2)

Ship the kind:id target model from docs/design/multi-target-routing.md. v0.1 hardcodes one CC slot and one Codex peer; v0.2 lets one daemon front multiple Claude Code workspaces and multiple peer instances, selected by --target kind:id.

Phase footers (filled by the loop)

Phase 3 baseline: 240 pass / 0 fail / 19 E2E filtered, 28 test files, 579 expect calls (on P3.9 close; P3.0 sanity-check started at 223 pass).
Phase 4 baseline: 293 pass / 0 fail / 19 E2E filtered, 35 test files, 739 expect calls (on P4.12 close; Phase 4 opened with the 240-pass baseline carried from Phase 3).
Phase 5 baseline: 304 pass / 0 fail / 20 E2E filtered, 38 test files, 768 expect calls (on P5.8 close; Phase 5 opened with the 293-pass baseline carried from Phase 4).
Phase 6 baseline: 339 pass / 0 fail / 20 E2E filtered, 42 test files, 868 expect calls (on P6.10 close; Phase 6 opened with the 304-pass baseline carried from Phase 5).
Phase 7 baseline: 339 pass / 0 fail / 20 E2E filtered, 42 test files, 868 expect calls (on P7.6 close; Phase 7 opened with the 339-pass baseline carried from Phase 6). v0.1 complete — every P3–P7 task is [x]; remaining release steps (npm publish, marketplace form, ACP registry PR) require human credentials per ROADMAP and are documented in docs/release/publish.md.
Phase 8 baseline: 384 pass / 0 fail / 20 E2E filtered, 47 test files, 935 expect calls (on P8.8 close; Phase 8 opened with the 339-pass baseline carried from Phase 7). Real ACP → Claude Code routing via DaemonProxyGateway lands end-to-end; a2a-bridge acp no longer falls back to the echo gateway in production. Permission-relay policy documented in docs/design/architecture.md §"Permission-relay policy for ACP-originated turns" (option b: bridge to ACP session/request_permission); daemon-side bridge mechanism landed — plugin-side MCP handler + subprocess-side AgentSideConnection bridge remain as follow-ups.
Phase 9 baseline: 384 pass / 0 fail / 20 E2E filtered, 47 test files, 935 expect calls (on P9.5 close; no test-count delta vs. Phase 8 — Phase 9 shipped the v0.1.0 draft release, docs/join.md cross-bridge skill, README "Join the bridge" section, and the CHANGELOG release-date flip. P9.4 scaffold landed but the live-systems manual verification transcript is the maintainer's before the final publish). v0.1.0 tag pushed to GitHub; draft release at <github.com/firstintent/a2a-bridge/releases/tag/v0.1.0> carries the tarball. Remaining human-gated work: publish the GitHub draft, run npm publish, submit the Claude Code marketplace entry, and open the ACP registry PR — all steps enumerated in docs/release/publish.md.

v0.1 starting baseline

Carry-over from Phase 2 (ed50f4c): 223 pass / 0 fail / 19 E2E filtered, 27 test files, 532 expect calls.
lint:deps: 0 violations (84 modules / 213 deps).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TASKS — v0.1 (Phases 3–7)

Phase 3 — Verification and delegation patterns

Phase 4 — RoomRouter and TaskLog

Phase 5 — ACP inbound (multi-client reach)

Phase 6 — Distribution and UX polish

Phase 7 — Release packaging

Phase 8 — Real ACP → Claude Code routing (no mock)

Phase 9 — Join skill + first public release

Phase 10 — Multi-target routing (v0.2)

Phase footers (filled by the loop)

v0.1 starting baseline

FilesExpand file tree

TASKS.md

Latest commit

History

TASKS.md

File metadata and controls

TASKS — v0.1 (Phases 3–7)

Phase 3 — Verification and delegation patterns

Phase 4 — RoomRouter and TaskLog

Phase 5 — ACP inbound (multi-client reach)

Phase 6 — Distribution and UX polish

Phase 7 — Release packaging

Phase 8 — Real ACP → Claude Code routing (no mock)

Phase 9 — Join skill + first public release

Phase 10 — Multi-target routing (v0.2)

Phase footers (filled by the loop)

v0.1 starting baseline