- Universal kit for Software and AI coding agents: Claude Code, Cursor, GitHub Copilot, Antigravity/Gemini, Codex.
- Drop-in context kit that turns any AI coding agent into a governed engineering team for developing software.
- Based on Software Engineering & Architecture Practices and Procedures.
- Works for any project type, new or existing, any language, any team size.
- Write Once, Run Everywhere: SKILLS for Cursor (
.mdc), VS Code (folder/skill), Claude (.skill.md), Gemini. - The system features an experimental Meta-Cognitive Orchestrator "Brain" (Neurocognitive based architecture derived from Neuroscience, Cognitive Psychology, Epistemology, treating the Software System not as a machine, but as a Cognitive Entity), using Active Inference to plan, route, and optimize work.
- The system features a Multi-Agent Swarm "Loki" Engineering Team (30+ specialized agent roles for large projects), providing episodic and semantic memory, project history auditing and checkpoints.
- Experimental support for budget enforcement, tokens, hard stops, policy rules, cli tool gateway (via MCP server), audit tracing and logs, human escalation, and deterministic replay, with built-in rules for agents to select the best specialized skills/guides, proactively recommend necessary MCP servers, and default to continuous cost & budget optimizationβalways requiring human approval for expensive operations.
It contains:
- 140+ Skills (specialized capabilities)
- 70+ Templates (standardized documents)
- 40+ Guides (language & domain expertise)
- 30+ Personas (specialized roles)
- 40+ MCP servers (configuration and guides for AI tools)
- Brain Mode (meta-cognitive orchestration)
- Loki Mode (multi-agent swarm engineering personas team for large projects)
140+ Skills Β· 70+ Templates Β· 40+ Guides Β· 30+ Personas Β· 40+ MCPs Β· Loki / Brain Mode CLI
Full documentation: README_FULL.md Β· Full quick guide: QUICK_GUIDE.md Β· Quick commands: QUICK_COMMANDS.md Β· CLI reference: CLI_REFERENCE.md Β· MCP servers: MCP_CONFIGURATIONS.md Β· Platform Controls: PLATFORM_CONTROLS.md Β· Verification Guide: VERIFICATION_GUIDE.md
Online articles: Medium, Substack, TechRxiv
Example project where GABBE is used: Agentic AI Medical Imagery Diagnostic Helper
The init.py script is a Universal Skill Compiler. It generates the correct formats for your AI tools:
- Cursor: Generates
.cursor/rules/*.mdc(Optimized with Globs) - VS Code / Copilot: Generates
.github/skills/withconfig.json - Claude Code: Symlinks
.claude/skillsfor instant updates - Gemini: Wires up
.gemini/settings.json
- Run the Wizard
python3 scripts/init.py-
Verify Context
- Open
agents/AGENTS.mdand check theTech Stacksection and other [PLACEHOLDER] or Optional sections. - Open
agents/CONSTITUTION.mdand review project laws.
- Open
-
Feed the Mission
- The script generates
BOOTSTRAP_MISSION.md(orSETUP_MISSION.mdif dynamic setup is disabled) in your root. - Copy its content and paste it into your AI Agent's chat window.
- This aligns the agent with your project context immediately.
- The script generates
-
Git Tracking
- If you want to keep the initial structure of
agents/memory/andproject/in your repository but prevent Git from tracking the continuous autonomous modifications your agents will make to them locally, run:git ls-files agents/memory/ project/ | xargs git update-index --skip-worktree
- If you want to keep the initial structure of
Manual Setup:
cp -r GABBE/agents .
chmod +x agents/setup-context.sh && agents/setup-context.sh- Linux / macOS / WSL: Native support.
- Windows (Native):
- Use
python scripts/init.py(Symlinks automatically fallback to file copies if needed). - Use
agents/scripts/setup-context.ps1instead of.sh.
- Use
"Use business-case/strategy skills to validate exactly why we are building [description] and who it is for."
"Read AGENTS.md. I want to build [description]. Start with spec-writer skill."
Flow: Strategy β Spec β Design β Tasks β TDD Implementation β Security β Deploy
"Read AGENTS.md and agents/memory/PROJECT_STATE.md. Resume the project."
"Read AGENTS.md. Bug: [description]. Use debug skill with TDD."
Flow: Reproduce β Root Cause β Failing Test β Fix β Green β Regression Check
"Use tech-debt skill on [directory]. Then refactor the top-priority item."
"Run security-audit skill on the entire codebase."
"Run arch-review skill. Check for SOLID violations and coupling."
"Use the performant-nodejs skill to audit the current Node.js architecture for scalability bottlenecks and propose optimizations."
"Use the performant-laravel skill to audit the current Laravel architecture for scalability bottlenecks and propose optimizations."
"Use the performant-python skill to audit the current Python architecture for scalability bottlenecks and propose optimizations."
"Use the performant-go skill to audit the current Go architecture for scalability bottlenecks and propose optimizations."
"Use the performant-ai skill to audit the current AI/LLM architecture for latency and cost bottlenecks."
"Use the time-complexity skill to scan src/ for Big-O complexity hotspots and identify functions worse than O(n)."
"Act as a Principal Staff Engineer. Review the codebase in [directory] and generate a C4 system architecture diagram (Context and Container levels). Identify any bottlenecks and propose scaling strategies."
"Use the visual-whiteboarding skill. Connect to the Draw.io/Miro MCP and generate a visual spatial architecture diagram for the current microservice layout."
"Use the design-patterns and domain-model skills. We are building a [feature segment]. Propose the optimum architecture pattern (e.g. Event-driven, CQRS, Hexagonal) and define the core domain entities."
"Use the vibe-coding skill. Build a [component/page] using [framework]. I want it to feel [aesthetic, e.g. glassmorphism, cyberpunk, sleek corporate]. Include micro-animations and smooth transitions. Prioritize visual WOW over generic utility."
"Activate Brain Mode. Goal: [build X / migrate Y / solve Z]."
Uses Active Inference to plan, route between local/remote models, and learn from past outcomes.
Using Pure Agent Mode (No CLI):
"Activate
agents/skills/brain/loki-mode.skill.md. Goal: [build X]. Do not ask me for permission unless you hit a mandatory Human Approval Gate or a task requires True A2A Delegation."
Multi-agent swarm with 30+ specialized personas for projects >5 features or >20 files.
graph TD
%% Phase 1: Setup
subgraph Setup [1. Setup Phase]
Start([Start]) --> Init[Run init.py]
Init --> Mission[Feed BOOTSTRAP_MISSION.md or SETUP_MISSION.md]
Mission --> Config[Edit AGENTS.md]
end
%% Phase 2: Definition
subgraph Definition [2. Definition Phase]
Config --> Spec[Trigger: spec-writer.skill]
Spec --> PRD[Artifact: PRD.md]
PRD --> Review1{Human Approve?}
Review1 -- No --> Spec
end
%% Phase 3: Design
subgraph Design [3. Design Phase]
Review1 -- Yes --> Plan[Trigger: arch-design.skill]
Plan --> Arch[Artifact: PLAN.md + C4]
Arch --> Review2{Human Approve?}
Review2 -- No --> Plan
end
%% Phase 4: Execution
subgraph Execution [4. Execution Loop]
Review2 -- Yes --> Tasks[Trigger: Decompose project/TASKS.md]
Tasks --> LoopCheck{Tasks Remaining?}
LoopCheck -- Yes --> Pick[Pick Task]
Pick --> TDD[Trigger: tdd-cycle.skill]
TDD --> Red[Test Fails]
Red --> Green[Implement Pass]
Green --> Refactor[Refactor]
Refactor --> Verify[Audit Log]
Verify --> LoopCheck
end
%% Phase 5: Delivery
subgraph Delivery [5. Delivery Phase]
LoopCheck -- No --> Integrity[Trigger: integrity-check.skill]
Integrity --> Security[Trigger: security-audit.skill]
Security --> HumanRev{Human Review?}
HumanRev -- No --> Fix[Fix Issues]
Fix --> Integrity
HumanRev -- Yes --> Deploy[Deploy]
Deploy --> End([Done])
end
[START]
|
[INSTALL] python3 scripts/init.py -> Generates BOOTSTRAP_MISSION.md (or SETUP_MISSION.md)
|
[SETUP] Feed Mission to Agent -> Edit AGENTS.md (Stack/Rules)
|
[DEFINE] "Start new feature" -> spec-writer.skill -> PRD.md
| (Human Reviews & Approves PRD)
v
[DESIGN] Plan Architecture -> arch-design.skill -> PLAN.md + ADRs
| (Human Reviews & Approves Plan)
v
[TASKS] Decompose to project/TASKS.md (Atomic 15-min units)
|
+---> [IMPLEMENTATION LOOP] ----------------------------------+
| 1. Pick Task from project/TASKS.md |
| 2. Write Failing Test (Red) |
| 3. Write Code to Pass (Green) |
| 4. Refactor & Clean Up |
| 5. Verify (Tests + Lint) & Log to AUDIT_LOG.md |
| (Repeat until project/TASKS.md is empty) |
+-------------------------------------------------------------+
|
[VERIFY] integrity-check.skill -> security-audit.skill
| (Human Final Review)
v
[DEPLOY] Merge PR -> Staging -> Production -> [DONE]
How the pieces fit together to create a "Cognitive Entity".
graph TB
subgraph Human ["User (Steering Wheel)"]
H1[Strategy & Goals]
H2[Review & Approval]
end
subgraph Agent ["Agent / Brain (Engine)"]
B1[Active Inference Loop]
B2[Task Router]
end
subgraph Context ["Project Context"]
C1[AGENTS.md]
C2[CONSTITUTION.md]
end
subgraph Tools ["Capability Layer"]
S[120+ Skills]
T[60+ Templates]
G[30+ Guides]
end
subgraph Memory ["Memory System"]
M1[Working Memory]
M2["Episodic (Logs)"]
M3["Semantic (Facts)"]
end
H1 --> B1
C1 --> B1
B1 --> B2
B2 --> S
S --> T
S --> M2
M3 --> B1
S --> H2
H2 -- Feedback --> B1
[HUMAN USER]
| (Goal/Feedback)
v
+-------------------+ +------------------+
| AGENT BRAIN | <--- | PROJECT CONTEXT |
| (Active Inference)| | (AGENTS/Rules) |
+--------+----------+ +------------------+
|
v
[ROUTER & ORCHESTRATOR]
|
+--------+--------------------------+
| |
[SKILLS] (Function) [MEMORY] (Context)
| |
+-> [Coding] +-> [Episodic Logs]
+-> [Architecture] +-> [Semantic Facts]
+-> [Security] +-> [Continuity]
+-> [Ops / SRE] |
| |
v v
[TEMPLATES] (Structured Output) <---+
The "Golden Path" for every feature.
flowchart TD
S0[S0: Strategy] -->|Business Case| S1[S1: Specify]
subgraph Definition
S1 -->|PRD Draft| Ambiguity{Ambiguous?}
Ambiguity -- Yes --> Clarify[Clarify Questions]
Clarify --> S1
Ambiguity -- No --> S2[S2: Plan]
end
subgraph Design
S2 -->|PLAN.md + C4| Review1{Approved?}
Review1 -- No --> S2
Review1 -- Yes --> S3[S3: Tasks]
end
subgraph Execution
S3 -->|project/TASKS.md| Decomp{Task < 15m?}
Decomp -- No --> S3
Decomp -- Yes --> S4[S4: Implement]
S4 --> TDD[TDD Cycle]
TDD --> RARV[RARV: Reason/Act/Reflect/Verify]
RARV --> Audit[Audit Log]
end
Audit --> Done([Feature Complete])
0. STRATEGY
-> Why are we building this? (Value/ROI)
v
1. SPECIFY (S01)
-> spec-writer.skill -> PRD.md (EARS Syntax)
-> (Optional) visual-specs.skill -> Visual Spec Package (UI/Arch from scans)
-> Human Review & Approval
v
2. PLAN (S02)
-> arch-design.skill -> PLAN.md + C4 Diagrams
-> adr-writer.skill -> Architectural Decisions
v
3. DECOMPOSE (S03/S04)
-> project/TASKS.md -> Atomic steps (<15 mins each)
v
4. IMPLEMENT (S05)
-> One task at a time
-> TDD Loop: Red -> Green -> Refactor
-> RARV Loop: Reason -> Act -> Reflect -> Verify
v
5. VERIFY & SHIP (S06-S10)
-> Integrity Check -> Security Audit -> Deploy
6. COGNITIVE ORCHESTRATION & HEALING
-> gabbe brain activate -> Predict bottlenecks & route complexity
-> gabbe brain evolve -> Meta-optimize failing skills into Semantic Memory
-> gabbe brain heal -> Recover from environment/DB corruption
agents/
βββ AGENTS.md # Universal config (edit per project)
βββ CONSTITUTION.md # Immutable project law
βββ skills/ # 120+ .skill.md files
β βββ 00-index.md # Full skills registry
β βββ coding/ # tdd, review, debug, refactor, git...
β βββ architecture/ # arch-design, patterns, api-design...
β βββ security/ # audit, threat-model, privacy...
β βββ ops/ # sre, docker, k8s, deploy, cost...
β βββ product/ # spec-writer, req-elicitation...
β βββ core/ # research, self-heal, lifecycle...
β βββ data/ # data-engineering, db-migration
β βββ coordination/ # multi-agent-orch, agent-protocol
β βββ brain/ # active-inference, consciousness, memory...
βββ templates/ # 60+ fill-in-the-blank documents
β βββ 00-index.md # Full templates registry
β βββ coding/ # test plans, checklists, devcontainer
β βββ architecture/ # ADR, C4, domain model, integration
β βββ security/ # threat model, safety case, ethics
β βββ ops/ # incident, deploy, capacity, benchmark
β βββ product/ # PRD, spec, user story, business case
β βββ core/ # plan, tasks, audit log, traceability
β βββ coordination/ # agent profiles, swarm config
β βββ brain/ # inference loop, episodic memory, OODA
βββ guides/ # 30+ language & domain guides
βββ personas/ # 30+ specialized agent roles
βββ memory/ # Episodic + semantic + project state
βββ docs/ # Whitepapers & research
Sits above Loki. Decides how to execute, not just what to execute.
| Feature | Description |
|---|---|
| Active Inference | Predict β Act β Observe β Compare β Adapt loop |
| Cost Routing | Simple tasks β local free models, complex β remote SOTA |
| Episodic Memory | Recalls past project outcomes to avoid repeated mistakes |
| System 2 Thinking | Strategic planning before execution |
Guide: guides/js-ts-nodejs.md
Stack: Vitest, Zod, Prisma, Playwright, Hono
Config in AGENTS.md: test_cmd="npx vitest", lint_cmd="npx eslint ."
Guide: guides/go-lang.md
Stack: Echo/Gin, Ent, Testify, Testcontainers
Config in AGENTS.md: test_cmd="go test ./...", lint_cmd="golangci-lint run"
Guide: guides/php-laravel.md
Stack: DDD, Actions, Pest PHP, PHPStan L9, Enlightn
Config in AGENTS.md: test_cmd="vendor/bin/pest", lint_cmd="vendor/bin/pint"
Guide: guides/python-fastapi-ai.md
Stack: Pydantic, Pytest, Ruff, FastAPI
Config in AGENTS.md: test_cmd="pytest", lint_cmd="ruff check ."
| Phase | Gate | Key Artifact |
|---|---|---|
| S01 | Requirements | PRD_TEMPLATE.md (EARS syntax) |
| S02 | Design | ADR_TEMPLATE.md + C4 diagrams |
| S03 | Specification | SPEC_TEMPLATE.md + API contracts |
| S04 | Tasks | TASKS_TEMPLATE.md (15-min rule) |
| S05 | Implementation | TDD RedβGreenβRefactor + RARV |
| S06 | Testing | Unit >99% + integration + E2E |
| S07 | Security | SECURITY_CHECKLIST.md + audit |
| S08 | Review | Human code review |
| S09 | Staging | Smoke tests + benchmarks |
| S10 | Production | Rollback plan + monitoring |
| Category | Count | Key Skills |
|---|---|---|
| Coding | 10+ | tdd-cycle, debug, refactor, code-review, git-workflow |
| Architecture | 15+ | arch-design, microservices, systems-architecture, system-scalability, blockchain-dlt |
| Operations | 15+ | reliability-sre, production-health, dev-environments, cost-optimization |
| Security | 15+ | security-audit, secure-architecture, privacy-data-protection, api-security |
| Product | 10+ | spec-writer, req-elicitation, visual-specs, green-software |
| Core | 10+ | research, self-heal, knowledge-gap, meta-optimize |
| Data | 5+ | data-engineering, db-migration, semantic-web |
| Coordination | 5+ | multi-agent-orch, agent-protocol |
| Brain | 10+ | active-inference, consciousness-loop, cost-benefit-router |
| AI/Swarm | 5+ | multi-agent-systems, agent-communication, beyond-llms |
| Industry | 5+ | healthcare-fhir, telecom-networks, industrial-iot, global-standards, engineering-standards |
| Loki Modes | 2+ | brain-mode, loki-mode |
Full catalog: agents/skills/00-index.md (generated by init.py)
| Category | Count | Examples |
|---|---|---|
| Coding | 5+ | Clean Code Checklist, Test Plan, E2E Suite |
| Architecture | 10+ | ADR, C4, Domain Model, Scalability Plan, Smart Contract |
| Ops | 5+ | Incident Postmortem, Deploy Config, Capacity Plan, FinOps |
| Security | 5+ | Threat Model, Safety Case, Ethical Impact |
| Product | 10+ | PRD, Spec, User Story Map, Visual Spec, Green Software |
| Core | 5+ | Plan, Tasks, Audit Log, Project State, Continuity |
| Coordination | 5+ | Agent Profile, Swarm Config, Handshake, Voting Log |
| Brain | 5+ | Active Inference Loop, Episodic Memory, OODA Trace |
| Data | 5+ | Data Pipeline, Database Schema, OWL Ontology |
| Industry | 5+ | FHIR Interop, IoT Telemetry, Telecom API, Global SDLC Audit |
Full catalog: agents/templates/00-index.md (generated by init.py)
| Server | Purpose |
|---|---|
| Context-7 | Up-to-date SDK docs (prevents hallucination) |
| Sequential Thinking | Chain-of-thought reasoning |
| GitHub MCP | PR review, code search |
| PostgreSQL MCP | Live schema introspection |
| Playwright MCP | Browser automation / visual TDD |
| Brave Search | Authoritative web research |
| Time Complexity | Local Big-O static analysis via tree-sitter |
| Excalidraw | Programmatic Excalidraw diagram creation |
Config: templates/core/MCP_CONFIG_TEMPLATE.json Β· Full guide: MCP_CONFIGURATIONS.md
Task β Knowledge gap? β research.skill β Execute β Verify
β FAIL
self-heal.skill (max 5Γ)
β STILL FAIL
Human escalation report
| Layer | Location | Purpose |
|---|---|---|
| Project State | agents/memory/PROJECT_STATE.md |
Current SDLC phase |
| Audit Log | agents/memory/AUDIT_LOG.md |
Append-only decision history |
| Continuity | agents/memory/CONTINUITY.md |
Past failures (read every session) |
| Episodic | agents/memory/episodic/ |
Per-session decision logs |
| Semantic | agents/memory/semantic/ |
Crystallized project knowledge |
| Problem | Fix |
|---|---|
| Agent ignores AGENTS.md | Run setup-context.sh to create symlinks |
| Agent repeats mistakes | Check CONTINUITY.md β tell agent to read it |
| Tests pass immediately | False positive β test is broken, fix it first |
| Uses deprecated APIs | Activate Context-7 MCP |
| Session lost | "Use session-resume skill to load all memory" |
| Context too large | Use context_cost: low skills or activate Loki Mode |
| Stack | Guide |
|---|---|
| JS/TS/Node.js | guides/languages/js-ts-nodejs.md |
| Node.js Advanced & TS | guides/languages/nodejs-advanced.md |
| Go | guides/languages/go-lang.md |
| PHP/Laravel | guides/languages/php-laravel.md |
| Python/FastAPI | guides/languages/python-fastapi-ai.md |
| SQL/NoSQL | guides/data/sql-nosql.md |
| Microservices | guides/architecture/microservices.md |
| Architecture | guides/architecture/systems-architecture.md |
| Testing | guides/principles/testing-strategy.md |
| Design Patterns | guides/patterns/design-patterns.md |
| Agentic AI | guides/ai/ai-agentic.md |
| Multi-Agent Systems | guides/ai/multi-agent-systems.md |
| Security/Compliance | guides/ops/compliance-audit.md |
| API Security | guides/security/api-security.md |
| Cryptography | guides/security/cryptography-standards.md |
| Data Protection | guides/security/privacy-data-protection.md |
| Secure Arch | guides/security/secure-architecture.md |
| Threat Modeling | guides/security/threat-modeling.md |
| DevOps/Environments | guides/ops/dev-environments.md |
| Developer Workflow | guides/ops/dev-workflow.md |
| Legacy/COBOL | guides/principles/legacy-tech.md |
| Future Tech 2030 | guides/principles/future-tech.md |
| C-Sharp / .NET | guides/languages/c-sharp.md |
| Self-Healing | guides/ai/self-healing-summary.md |
| Time Complexity | guides/patterns/time-complexity-analysis.md |
| Troubleshooting | guides/ops/troubleshooting-guide.md |
| Enterprise Migration | guides/patterns/enterprise-migration-scenario.md |
All 120+ skills in the GABBE kit feature a heavily enforced "Security & Guardrails" section that binds agents to the project's CONSTITUTION.md. The 3-layer security constraints include:
- Skill Security: Tool-specific protection (e.g., preventing command injection or sandbox escapes).
- System Integration Security: Safe external integration (e.g., verifying boundary enforcement or ensuring test coverage).
- LLM/Agent Guardrails: Protection from AI-specific failures (e.g., hallucinated metrics, confirmation bias, or prompt injection).
Agents are explicitly configured to Fail-Closedβthey must wait for human approval rather than bypassing a guardrail.
GABBE has also an experimental helper, Zero-Dependency CLI (gabbe) for a "Hybrid Mode" (Markdown files and a SQLite database) and launching different commands. It is a work-in-progress and you can do without it, only with the rest of the kit.
- Python 3.8+
- LLM API Key: For Brain/Route features, set
GABBE_API_KEY(OpenAI-compatible).
Environment Variables (full reference in CLI_REFERENCE.md):
| Variable | Default | Description |
|---|---|---|
GABBE_API_URL |
https://api.openai.com/v1/chat/completions |
OpenAI-compatible endpoint |
GABBE_API_KEY |
(required for LLM features) | Bearer token for the LLM API |
GABBE_API_MODEL |
gpt-4o |
Model name sent in API requests |
GABBE_LLM_TEMPERATURE |
0.7 |
Sampling temperature (0.0β1.0) |
GABBE_LLM_TIMEOUT |
30 |
HTTP timeout in seconds |
GABBE_LLM_MAX_RETRIES |
3 |
Number of LLM retry attempts on transient errors |
GABBE_LOG_LEVEL |
INFO |
Logging verbosity (DEBUG, INFO, WARNING, ERROR) |
GABBE_ROUTE_THRESHOLD |
50 |
Complexity score above which prompts route REMOTE |
GABBE_MAX_COST_USD |
5.0 |
Maximum cost (USD) budget per run |
GABBE_MAX_TOKENS_PER_RUN |
100000 |
Maximum token limit per run |
GABBE_MAX_TOOL_CALLS_PER_RUN |
50 |
Maximum tool calls allowed per run |
GABBE_MAX_ITERATIONS |
25 |
Maximum active-inference iterations per run |
GABBE_MAX_WALL_TIME |
300 |
Maximum wall-clock time (seconds) per run |
GABBE_MAX_RECURSION_DEPTH |
5 |
Maximum agent recursion depth |
GABBE_MAX_RETRIES_PER_TOOL |
3 |
Maximum retries for a single tool call |
GABBE_POLICY_FILE |
project/policies.yml |
Path to the YAML policy file |
GABBE_ESCALATION_MODE |
cli |
Escalation mode: cli, file, or silent |
GABBE_SUBPROCESS_TIMEOUT |
300 |
Timeout (seconds) for verify sub-commands |
GABBE_OTEL_ENABLED |
false |
Enable OpenTelemetry tracing |
The CLI is a Python package.
# 1. Install locally (Recommended)
pip install -e .
# 2. Verify installation
gabbe --help| Command | Description |
|---|---|
gabbe init |
Initialize the SQLite Database (Run this after python scripts/init.py). |
gabbe sync |
Hybrid Sync: Bidirectional sync between project/TASKS.md and SQLite DB. |
gabbe verify |
Enforcer: programmable integrity check (files, tests, lint). |
gabbe status |
Dashboard: Visualizes project phase and task progress. |
gabbe brain |
Meta-Cognition: Activates Active Inference loop or Evolutionary Prompt Optimization (Requires API Key). |
gabbe route |
Cost Router: Arbitrates between Local and Remote LLMs based on task complexity (Requires API Key). |
gabbe forecast |
Strategic Forecast: Projects remaining work cost and tokens based on historical run data. |
gabbe serve-mcp |
MCP Gateway: Zero-dependency JSON-RPC Model Context Protocol server for standalone agents to access tools safely. |
gabbe runs |
Run History: List recent agent runs with status, cost, and timestamps. |
gabbe audit <run-id> |
Audit Trace: Display structured span-level trace for a past run (--format json|table). |
gabbe replay <run-id> |
Deterministic Replay: Replay a past run from its checkpoints (--from-step N). |
gabbe resume <run-id> |
Escalation Resume: Approve or reject pending escalations for a paused run. |
The experimental gabbe CLI supports a platform control layer. It covers budget enforcement, cost and token controls, hard stops, policy rules, the tool gateway, audit tracing, human escalation, and deterministic replay. Detailed documentation is available in PLATFORM_CONTROLS.md.
GABBE uses a Hybrid Architecture where agents and humans interact via Markdown, but the system of record is SQLite.
graph TD
subgraph User["User (Legacy Flow)"]
Edit[Edit project/TASKS.md]
end
subgraph CLI["GABBE CLI (pip installed)"]
Sync[gabbe sync]
Verify[gabbe verify]
Brain[gabbe brain]
Router[gabbe route]
Forecast[gabbe forecast]
MCP[gabbe serve-mcp]
end
subgraph Storage["Hybrid Memory"]
MD[Markdown Files]
DB[(SQLite state.db)]
end
User -->|Manual Edits| MD
MD <-->|Bi-Directional| Sync
Sync <--> DB
Brain -->|Read/Write| DB
Verify -->|Check| MD
Verify -->|Check| DB
Forecast -->|Analyze| DB
MCP -->|Write Telemetry| DB
# 1. Generate Context Configs
python3 scripts/init.py
# 2. Initialize Database
gabbe init# Check status
gabbe status
# Sync tasks (manual edits)
gabbe sync
# Optimize a skill (Requires GABBE_API_KEY)
gabbe brain evolve --skill tdd-cyclegabbe verifyΒ© 2026 Andrei Nicolae Besleaga. This work is licensed CC BY-SA 4.0
