Skip to content
Open
No due date
Last updated May 25, 2026

Make long-running agentic work interruptible and recoverable across every supported provider — DeepSeek as reference, plus the additive hosts (Hugging Face Inference Providers, OpenRouter, Novita, Fireworks, SGLang, vLLM, Ollama, SiliconFlow) that the harness aims to be best-in-class for. Performance SLOs and the Homebrew tap rename land in the same window. Plan / Agent / YOLO remain the three execution modes; --model auto remains model routing, not a mode.

In scope

  • Control plane. Pause / redirect / cancel / resume on a running turn. Synchronous tool cancellation (#1791). Crash recovery for in-flight turns. Restore points / undo / checkpoints — the workbench can roll back to a prior state.
  • Provider parity sweep. Control-plane semantics tested against every supported provider so they behave identically. DeepSeek is the reference; the additive hosts must match. Hugging Face Inference Providers gets included in this sweep as preparatory work for its first-class promotion in v0.8.47 — cancel/resume must work against Qwen/*, deepseek-ai/*, and meta-llama/* model IDs via the HF router. Inbox-zero pass over hostability / long-session / Windows regressions re-targeted from v0.8.41 + the v0.8.43 redistribution: #1835, #1737, #1690, #1679, #1651, #1596, #1531, #1791, #1806, #1472, #1786, #1425, #1827, #1732, #765.
  • Performance SLOs. Cold startup p50 < 500 ms, no synchronous I/O on the UI thread, tool dispatch p50 < 100 ms. tracing spans instrumented; /timing slash command. Treat root causes, not symptoms.
  • Homebrew tap rename. Hmbown/homebrew-deepseek-tuiHmbown/homebrew-codewhale, redirect/alias kept.

Out of scope

  • New execution modes. Plan / Agent / YOLO stay as the three execution modes; --model auto stays as model routing, not a mode. "Auto mode" as a fourth execution mode is explicitly ruled out — the three-mode model is the project's contract.
  • New providers beyond the HF Inference Providers parity inclusion. (HF as a config-block first-class provider lands in v0.8.47; v0.8.45 only tests parity through the existing OpenAI-compatible path.)
  • New OAuth flows.
  • ~/.deepseek/ config path rename.
  • Tool surface expansion (defer to v0.8.46).
  • Verifier work (defer to v0.8.46 preview; Goal-mode-specific Fin wakeup lives in v0.8.43).
  • Model Lab workset implementation (post-v0.9.0; see #1977). The Serving Workset (vLLM / SGLang / TGI / llama.cpp / Ollama recipes) is downstream of this milestone's provider parity work.

Definition of done

  • A 30+ minute session can be paused, redirected, and resumed without state loss against DeepSeek (reference) plus all additive providers, including HF Inference Providers.
  • /timing shows cold-startup p50 < 500 ms on macOS / Linux / Windows reference machines.
  • Every v0.8.41-origin or v0.8.43-redistributed control-plane / hostability issue is fixed, deferred-with-cause, or closed-as-unreproducible.
  • brew install hmbown/codewhale/codewhale works against the renamed tap.

Release gate

  • Parity gates green.
  • CHANGELOG.md [0.8.45] entry calls out cancel/resume by provider and perf SLOs.
  • README provider matrix updated; the modes section continues to document Plan / Agent / YOLO (three modes), with --model auto covered separately as model routing.
96% complete

List view