OpenMay 25, 2026
No due date
•Last updated Make long-running agentic work interruptible and recoverable across every supported provider — DeepSeek as reference, plus the additive hosts (Hugging Face Inference Providers, OpenRouter, Novita, Fireworks, SGLang, vLLM, Ollama, SiliconFlow) that the harness aims to be best-in-class for. Performance SLOs and the Homebrew tap rename land in the same window. Plan / Agent / YOLO remain the three execution modes; --model auto remains model routing, not a mode.
In scope
- Control plane. Pause / redirect / cancel / resume on a running turn. Synchronous tool cancellation (#1791). Crash recovery for in-flight turns. Restore points / undo / checkpoints — the workbench can roll back to a prior state.
- Provider parity sweep. Control-plane semantics tested against every supported provider so they behave identically. DeepSeek is the reference; the additive hosts must match. Hugging Face Inference Providers gets included in this sweep as preparatory work for its first-class promotion in v0.8.47 — cancel/resume must work against
Qwen/*,deepseek-ai/*, andmeta-llama/*model IDs via the HF router. Inbox-zero pass over hostability / long-session / Windows regressions re-targeted from v0.8.41 + the v0.8.43 redistribution: #1835, #1737, #1690, #1679, #1651, #1596, #1531, #1791, #1806, #1472, #1786, #1425, #1827, #1732, #765. - Performance SLOs. Cold startup p50 < 500 ms, no synchronous I/O on the UI thread, tool dispatch p50 < 100 ms.
tracingspans instrumented;/timingslash command. Treat root causes, not symptoms. - Homebrew tap rename.
Hmbown/homebrew-deepseek-tui→Hmbown/homebrew-codewhale, redirect/alias kept.
Out of scope
- New execution modes. Plan / Agent / YOLO stay as the three execution modes;
--model autostays as model routing, not a mode. "Auto mode" as a fourth execution mode is explicitly ruled out — the three-mode model is the project's contract. - New providers beyond the HF Inference Providers parity inclusion. (HF as a config-block first-class provider lands in v0.8.47; v0.8.45 only tests parity through the existing OpenAI-compatible path.)
- New OAuth flows.
~/.deepseek/config path rename.- Tool surface expansion (defer to v0.8.46).
- Verifier work (defer to v0.8.46 preview; Goal-mode-specific Fin wakeup lives in v0.8.43).
- Model Lab workset implementation (post-v0.9.0; see #1977). The Serving Workset (vLLM / SGLang / TGI / llama.cpp / Ollama recipes) is downstream of this milestone's provider parity work.
Definition of done
- A 30+ minute session can be paused, redirected, and resumed without state loss against DeepSeek (reference) plus all additive providers, including HF Inference Providers.
/timingshows cold-startup p50 < 500 ms on macOS / Linux / Windows reference machines.- Every v0.8.41-origin or v0.8.43-redistributed control-plane / hostability issue is fixed, deferred-with-cause, or closed-as-unreproducible.
brew install hmbown/codewhale/codewhaleworks against the renamed tap.
Release gate
- Parity gates green.
CHANGELOG.md[0.8.45]entry calls out cancel/resume by provider and perf SLOs.- README provider matrix updated; the modes section continues to document Plan / Agent / YOLO (three modes), with
--model autocovered separately as model routing.
96% complete
List view
0 issues of 1 selected
- Status: Open (in progress).Hmbown/CodeWhalenumber 2049#2049 In Hmbown/CodeWhale;