GitHub

Make long-running agentic work interruptible and recoverable across every supported provider — DeepSeek as reference, plus the additive hosts (Hugging Face Inference Providers, OpenRouter, Novita, Fireworks, SGLang, vLLM, Ollama, SiliconFlow) that the harness aims to be best-in-class for. Performance SLOs and the Homebrew tap rename land in the same window. Plan / Agent / YOLO remain the three execution modes; --model auto remains model routing, not a mode.

In scope

Control plane. Pause / redirect / cancel / resume on a running turn. Synchronous tool cancellation (#1791). Crash recovery for in-flight turns. Restore points / undo / checkpoints — the workbench can roll back to a prior state.
Provider parity sweep. Control-plane semantics tested against every supported provider so they behave identically. DeepSeek is the reference; the additive hosts must match. Hugging Face Inference Providers gets included in this sweep as preparatory work for its first-class promotion in v0.8.47 — cancel/resume must work against Qwen/*, deepseek-ai/*, and meta-llama/* model IDs via the HF router. Inbox-zero pass over hostability / long-session / Windows regressions re-targeted from v0.8.41 + the v0.8.43 redistribution: #1835, #1737, #1690, #1679, #1651, #1596, #1531, #1791, #1806, #1472, #1786, #1425, #1827, #1732, #765.
Performance SLOs. Cold startup p50 < 500 ms, no synchronous I/O on the UI thread, tool dispatch p50 < 100 ms. tracing spans instrumented; /timing slash command. Treat root causes, not symptoms.
Homebrew tap rename. Hmbown/homebrew-deepseek-tui → Hmbown/homebrew-codewhale, redirect/alias kept.

Out of scope

New execution modes. Plan / Agent / YOLO stay as the three execution modes; --model auto stays as model routing, not a mode. "Auto mode" as a fourth execution mode is explicitly ruled out — the three-mode model is the project's contract.
New providers beyond the HF Inference Providers parity inclusion. (HF as a config-block first-class provider lands in v0.8.47; v0.8.45 only tests parity through the existing OpenAI-compatible path.)
New OAuth flows.
~/.deepseek/ config path rename.
Tool surface expansion (defer to v0.8.46).
Verifier work (defer to v0.8.46 preview; Goal-mode-specific Fin wakeup lives in v0.8.43).
Model Lab workset implementation (post-v0.9.0; see #1977). The Serving Workset (vLLM / SGLang / TGI / llama.cpp / Ollama recipes) is downstream of this milestone's provider parity work.

Definition of done

A 30+ minute session can be paused, redirected, and resumed without state loss against DeepSeek (reference) plus all additive providers, including HF Inference Providers.
/timing shows cold-startup p50 < 500 ms on macOS / Linux / Windows reference machines.
Every v0.8.41-origin or v0.8.43-redistributed control-plane / hostability issue is fixed, deferred-with-cause, or closed-as-unreproducible.
brew install hmbown/codewhale/codewhale works against the renamed tap.

Release gate

Parity gates green.
CHANGELOG.md [0.8.45] entry calls out cancel/resume by provider and perf SLOs.
README provider matrix updated; the modes section continues to document Plan / Agent / YOLO (three modes), with --model auto covered separately as model routing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.8.45

In scope

Out of scope

Definition of done

Release gate

chore(release): prepare v0.8.45 flash release

v0.8.45

In scope

Out of scope

Definition of done

Release gate

List view

chore(release): prepare v0.8.45 flash release