Skip to content

steve-8000/amaze

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AMAZE rock scout mascot raising a signal hand

Amaze

Amaze is a compact, tool-grounded coding-agent runtime for verified repository work. The top-level agent stays small: it owns the goal, plans the work, delegates bounded slices, integrates results, and verifies acceptance criteria. Detailed reading and edits happen in scoped subagents with explicit contracts.

The current system is built around a scout-and-orchestrate architecture: tools and specialized agents gather evidence, a compact orchestrator makes decisions, Mission Control records the work, Nexus memory contributes non-authoritative prior context, and proposal/apply/rollback loops make learning and configuration changes auditable.

What is in this repository

  • packages/coding-agent — the amaze CLI, interactive runtime, tools, goal verifier, task/subagent orchestration, Mission Control, research, proposals, rules, Nexus memory, and session plumbing.
  • packages/ai — provider/model integration and streaming utilities used by the agent runtime.
  • packages/tui — terminal UI primitives used by interactive mode.
  • packages/agent, packages/utils, packages/stats, packages/natives — shared runtime, CLI, telemetry, and native helper packages.
  • packages/swarm-extension and packages/typescript-edit-benchmark — extension and benchmark/support packages.
  • python/rocky and python/rocky/web — Rocky Python service and web workspace.
  • .amaze/ — checked-in project settings, skills, commands, and rules used by this repository profile.
  • docs/ — current operator documentation plus historical implementation records. Start with docs/README.md.

Core architecture

Compact orchestrator and bounded subagents

The main agent is optimized to be a low-token orchestrator. It keeps the objective, acceptance criteria, todos, approvals, and integration state in view, then delegates detailed work to subagents such as task, quick_task, explore, plan, reviewer, oracle, source_scout, memory_scout, x_researcher, and visual_qa.

Non-trivial subagent work is passed through a structured contract: scope, success criteria, escalation behavior, and output requirements. Scope is enforced at mutation tools, so prompt text is not the only boundary. Contract verification distinguishes hard failures from uncertainty: onUncertainty: "ask-parent" lets the parent continue with the subagent output, while blocking contracts still stop on uncertainty.

Mission Control and Mission Inspector

Mission Control is the operator-facing read model for mission state. The CLI exposes read-only mission views:

bun run dev -- mission <list|show|stream|lanes|evidence|decision|verify|rollback>

The interactive TUI includes a Mission Control view with objective state, lane/evidence summaries, decisions, verification, proposals, and rollback status. Mission Inspector links a mission back to tool traces, artifacts, subagent details, and session files.

Research, evidence, and decisions

Research work is modeled as lanes and evidence rather than unstructured notes. Mission records can include lane runs, evidence cards, decisions, verification records, related events, and rollback anchors. The research and mission CLI surfaces are the operator path into that flow.

Nexus memory

Nexus is the active memory backend. It stores durable user/project/failure/workflow knowledge and indexes prior sessions for search. Memory is guidance, not authority: current user instructions and repository state override it.

Useful commands:

bun run dev -- memory doctor
bun run dev -- memory search <query>

memory://root can be read by the runtime as the active memory artifact root.

Proposal, apply, and rollback loop

Amaze can turn rules, metrics, objectives, and memory signals into learning proposals. Proposals are explicit records that can be listed, inspected, approved, rejected, diffed, applied, and rolled back.

bun run dev -- proposals <list|show|approve|reject|apply|rollback|diff>
bun run dev -- evolve <status|preview|proposals|inspect|approve|apply|rollback|simulate|doctor>

Settings proposals carry patches and rollback values; skill and rule proposals are applied through the learning apply path with snapshots/anchors where available.

Local/runtime model routing

Routing is local configuration, not hard-coded documentation. Project defaults live in .amaze/settings.json; package-level provider code lives under packages/ai; subagent model/thinking overrides are exercised by the task agent tests. The checked-in profile currently uses a compact main context, Nexus memory, prompt-cache prefix reuse, project-local skills/rules, and an optional local scout role.

Amaze supports a provider-agnostic local LLM scout role for cheap prepasses before expensive remote reasoning. Projects configure this through modelRoles.local_scout and localLlm.* settings; task routing resolves the role alias instead of depending on a concrete local model name. Bundled source_scout and memory_scout prompts can use pi/local_scout with structured outputs when local scouting is enabled, while explicit per-agent overrides and normal fallback routing remain available.

The local scout helpers live under packages/coding-agent/src/local-llm/ and cover config resolution, conservative evidence-bundle types/validation, stable prompt construction for prefix-cache reuse, local role/config health checks, and cache/token accounting helpers. See docs/models.md for configuration examples.

Commands

Root package scripts are the canonical local entry points:

Purpose Command
Install dependencies bun install
Link local development CLI/packages bun run install:dev
Run the CLI from source bun run dev
Show CLI stats bun run stats
Build workspaces bun run build
Build native package bun run build:native
Typecheck and Biome-check TypeScript workspaces bun run check:ts
Full check, including Rust lane bun run check
Run TypeScript tests bun run test:ts
Rerun failed TypeScript tests bun run test:ts:failed
Full tests, including Rust lane bun run test
Lint TypeScript and Rust lanes bun run lint
Format TypeScript and Rust lanes bun run fmt
Memory doctor bun run dev -- memory doctor
Mission Control CLI bun run dev -- mission list
Proposals CLI bun run dev -- proposals list

Additional root scripts cover CI release/build jobs, docs-index generation, model generation, Rocky service/docker workflows, Python tests, and session statistics. See package.json for the complete list.

Documentation

Use docs/README.md as the canonical docs map. It separates current/operator docs from historical phase records and design/archive material.

Key current docs include:

Historical implementation records are preserved under docs/Phase0 through docs/Phase7 and docs/analysis.

About

No description, website, or topics provided.

Resources

License

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors