Godmode — Autonomous AI Coding Agent for Claude Code, Cursor, Codex & Gemini

Your AI writes code. Godmode makes it write great code — then proves it.

The open-source autonomous coding agent that turns AI assistants into engineering systems. 134 skills for building, testing, optimizing, securing, and shipping software — with iterative optimization, parallel multi-agent execution, automatic rollback, failure memory, Karpathy-style authoring discipline, pre-commit discard audit, and a four-layer token optimization stack that cuts routing context by ~90%. Plugin for Claude Code, Cursor, Codex, Gemini CLI, and OpenCode.

TL;DR: Install with claude plugin install godmode. Say /godmode optimize my API and walk away. It measures, iterates, keeps improvements, reverts failures, and stops when done.

See It In Action

Performance Optimization — 847ms to 198ms, fully autonomous

$ /godmode:optimize
Goal: Reduce API response time
Iterations: 20

  BASELINE    847ms
  ROUND 1     554ms  KEPT  (-34.5%)  -- added index on category_id
  ROUND 2     382ms  KEPT  (-31.0%)  -- enabled gzip compression
  ROUND 3     276ms  KEPT  (-27.7%)  -- eager loading for posts
  ROUND 4     290ms  REVERTED        -- batch loader (guard failed)
  ROUND 5     226ms  KEPT  (-18.2%)  -- connection pool to 20
  ROUND 6     198ms  KEPT  (-12.4%)  -- Redis response cache

  === 847ms --> 198ms (76.6% improvement) ===
  Keeps: 5 | Discards: 1

Multi-Agent Build — 4 agents, isolated worktrees, clean merge

$ /godmode:build
Goal: Add user authentication system

  PLAN        Decomposed into 4 parallel tasks
  AGENT 1     [worktree] Auth middleware + JWT tokens        DONE
  AGENT 2     [worktree] User model + password hashing       DONE
  AGENT 3     [worktree] Login/register API endpoints         DONE
  AGENT 4     [worktree] Integration tests for auth flow      DONE
  MERGE       Sequential merge + test after each              ALL PASS
  REVIEW      4-agent code review (security, perf, style)     APPROVED

  === 4 tasks | 4 agents | 12 files | 47 tests passing ===

Security Audit — STRIDE + OWASP + red team

$ /godmode:secure
Target: src/api/

  RECON       Mapped 23 endpoints, 4 auth flows, 2 data stores
  STRIDE      6 threat categories analyzed
  OWASP       Top 10 checklist applied
  RED TEAM    4 personas: script kiddie, insider, APT, researcher

  CRITICAL  1   SQL injection in /api/search (parameterize query)
  HIGH      2   Missing rate limit on /api/login, weak CORS policy
  MEDIUM    3   Verbose error messages, missing CSP header, session fixation
  LOW       1   Server version disclosed in headers

  7 findings with fix code + verification commands

Every finding comes with code evidence, severity, a concrete fix, and a command to verify the fix works.

Quick Start

# Install (Claude Code)
claude plugin install godmode

# Or install for other platforms
bash adapters/cursor/install.sh
bash adapters/gemini/install.sh
bash adapters/opencode/install.sh

# Use specific skills
/godmode:optimize   # Autonomous performance iteration
/godmode:build      # Build with parallel agents
/godmode:secure     # Security audit
/godmode:ship       # Pre-flight + deploy + verify

# Or just describe what you want
/godmode make this API faster       # --> routes to optimize
/godmode fix the failing tests      # --> routes to fix
/godmode build a rate limiter       # --> routes to think --> plan --> build

Godmode auto-detects what you need and routes to the right skill.

What Is Godmode?

Godmode is an open-source plugin that adds autonomous coding capabilities to AI assistants like Claude Code, Cursor, Codex, Gemini CLI, and OpenCode. Instead of generating code once and hoping it works, Godmode runs a disciplined engineering loop: measure, modify, verify, keep or revert, repeat — until the goal is met.

It ships with 134 expert skills across 12 domains (performance optimization, security auditing, TDD, deployment, database tuning, and more), 7 specialized subagents that work in parallel, a failure memory system that learns from every discarded change, and a built-in authoring-discipline prelude that prevents the most common LLM coding mistakes before they hit the repo.

How Godmode Compares to Plain AI Coding

Problem	Godmode's answer
AI generates code once and stops	Autonomous iteration loops that run until the goal is met
No way to know if a change helped	Every change is benchmarked against a mechanical metric
Bad changes stay in the codebase	Automatic git revert on any regression
Complex tasks bottleneck on one agent	Up to 5 parallel agents in isolated git worktrees
AI repeats the same mistakes	Failure memory classifies and logs every failed approach
Crashes lose all progress	Session state persists and auto-resumes
Improvements might be noise	Variance testing and generalization gates prevent overfitting
AI silently picks one interpretation	Karpathy prelude: state assumptions, surface alternatives, emit `NEEDS_CONTEXT` on ambiguity
AI adds "while we're here" refactors	Line-trace rule + pre-commit discard audit surgically drops drift hunks
Long autonomous loops burn tokens	Terse mode, stdio patterns, Progressive Disclosure routing — ~90% routing context reduction, 40-60% emit reduction
Agents silently guess missing dispatch fields	DispatchContext schema validation — missing field → `BLOCKED: invalid_dispatch`

How It Works

Every iterative skill follows the same disciplined loop:

1. REVIEW   -- read state, logs, git history
2. IDEATE   -- pick the next change (informed by past failures)
3. MODIFY   -- make ONE atomic change, commit before verify
4. VERIFY   -- run guard (tests + lint + build must all pass)
5. DECIDE   -- improved --> KEEP. Worse --> DISCARD (git reset)
6. LOG      -- append to .godmode/<skill>-results.tsv
7. REPEAT   -- until goal met or iteration budget exhausted

No human approval needed between iterations. Every experiment is committed to git. Every discard is classified and remembered.

The Full Pipeline

THINK --> PLAN --> BUILD --> TEST --> FIX --> OPTIMIZE --> SECURE --> SHIP

Godmode detects which phase you're in and routes to the right skill. Skills chain automatically — optimize finds an issue, triggers fix, re-optimizes, then ships.

Failure Intelligence

Every discard is classified into one of 8 failure types and logged to .godmode/<skill>-failures.tsv. On 3+ consecutive failures, the agent writes a reflective diagnosis analyzing the pattern before trying again. Lessons persist in .godmode/lessons.md across sessions.

Multi-Agent Execution

Complex tasks are decomposed and run in parallel:

Round 1:  Agent 1 [worktree] --\
          Agent 2 [worktree] ---+-- merge + test
          Agent 3 [worktree] --/
Round 2:  Agent 4 [worktree] --\
          Agent 5 [worktree] ---+-- merge + test

Max 5 agents per round. Each gets its own git worktree. Merge sequentially, test after each merge.

What Fires by Default

Every /godmode:* invocation — and every natural-language request that routes to a pipeline skill — fires the full learning stack automatically. No flags, no opt-in. See SKILL.md §14 Default Activations for the authoritative list.

Authoring discipline (Karpathy family)

Principles prelude — every agent reads skills/principles/SKILL.md before the first Edit. Four rules: Think Before Coding (state assumptions, surface alternatives), Simplicity First (pre-MODIFY strike for single-use helpers and unrequested configurability), Surgical Changes (line-trace rule), Goal-Driven Execution (success criterion is a shell command exiting zero).
Pre-commit discard audit — builder, tester, and optimizer agents classify every hunk in git diff --cached before committing. line_scope_drift hunks (formatting churn, "while we're here" refactors, adjacent improvements) are surgically dropped via git restore -p --staged. Spec: docs/discard-audit.md.
DispatchContext schema — all 7 subagents validate their input at dispatch time. Missing required field → BLOCKED: invalid_dispatch. See AGENTS.md § DispatchContext Schema.
Discard cost hierarchy — Cost-0 (pre-MODIFY strike) / Cost-1 (pre-commit audit) / Cost-2 (post-commit revert). Cost-2 discards that should have been caught earlier are logged as escaped_discard feedback to .godmode/lessons.md.

Token optimization (four-layer stack)

Progressive Disclosure routing — the orchestrator reads only Tier 1 of each skill file (~20 lines) to match triggers. ~2,700 lines to route vs ~27,000 for full reads. ~90% routing-time context reduction.
Stdio input-side compression — skills/stdio/SKILL.md documents 13 canonical command patterns (git log → git log --oneline -20, cat → wc -l, ls -la → ls -1, etc.) every agent prefers. Pairs with rtk for shell-hook level enforcement.
Terse output-side compression — skills/terse/SKILL.md auto-activates from round 2 onward. Compresses round summaries, status lines, agent reports by 40-60%. TSVs, code, errors, commit messages, and final summary stay verbose. Opt out with /godmode:terse off or GODMODE_TERSE=0.
Token observability — skills/tokens/SKILL.md logs per-round input/output token counts to .godmode/token-log.tsv using a reproducible chars/4 heuristic. Answers "is my loop getting cheaper or more expensive?" with one awk one-liner. Opt out with GODMODE_TOKENS=0.

Coordination and research

Named coordination patterns — every plan declares its outermost pattern from docs/coordination-patterns.md: Pipeline, Fan-out/Fan-in, Expert Pool, Producer-Reviewer, Supervisor, or Hierarchical Delegation. Plans without a declared pattern return BLOCKED: invalid_plan.
Research auto-dispatch — before routing to think on any non-trivial task (mentions external lib/framework, >5 file scope, no prior .godmode/research.md), the orchestrator auto-dispatches skills/research/SKILL.md to gather prior art. Skip for trivial fixes or with --no-research.

The 8 pipeline skills inherit by default

THINK → PLAN → BUILD → TEST → FIX → OPTIMIZE → SECURE → SHIP

Each of these skills has a Rule 0 in its Hard Rules section explicitly inheriting all of the above. No per-skill opt-in. One command runs the whole stack.

Subagents (7)

Agent	Role
planner	Decomposes goals into parallel tasks
builder	Implements tasks with TDD in isolated worktrees
reviewer	Code review for correctness, security, performance, style
optimizer	Autonomous measure --> modify --> verify loop
explorer	Read-only codebase reconnaissance
security	STRIDE + OWASP audit with 4 adversarial personas
tester	TDD test generation, RED-GREEN-REFACTOR

Skills (134)

Godmode includes 134 skills across 13 domains. Each skill encodes a real engineering workflow — not just instructions, but a complete protocol with verification steps.

Domain	Count	Highlights
Core Workflow	15	`godmode` `think` `plan` `build` `test` `review` `optimize` `debug` `fix` `ship` `verify`
Discipline & Context	8	`principles` `terse` `stdio` `tokens` `research` `bench` `team` `tutorial` (new)
Architecture & Design	10	`architect` `rfc` `ddd` `pattern` `schema` `distributed` `scale` `migration`
API & Backend	14	`api` `graphql` `grpc` `orm` `cache` `queue` `event` `realtime` `webhook`
Frameworks	12	`react` `nextjs` `vue` `svelte` `node` `fastapi` `django` `rails` `spring`
Security & Compliance	8	`secure` `auth` `rbac` `pentest` `devsecops` `comply`
Testing	7	`e2e` `integration` `loadtest` `perf` `webperf`
DevOps & Infra	16	`k8s` `docker` `cicd` `ghactions` `infra` `observe` `resilience`
Frontend & UI	9	`ui` `a11y` `seo` `mobile` `designsystem` `responsive`
Databases	3	`postgres` `redis` `nosql`
AI & ML	5	`ml` `mlops` `rag` `prompt` `eval`
Developer Experience	13	`docs` `refactor` `git` `pr` `monorepo` `changelog` `slo`
Integrations	14	`i18n` `pay` `cli` `agent` `feature` `chaos` `experiment`

The 8 Discipline & Context skills (shipped across Phases 0–E)

Skill	What it does
`principles`	Karpathy authoring-discipline prelude. Four rules every agent reads before the first Edit. Imported via `@./` so every adapter gets it automatically.
`terse`	Output compression for long autonomous loops. Auto-activates from round 2. 40-60% token reduction on emit side. TSVs, code, errors, commits, final summary stay verbose.
`stdio`	13 canonical command patterns for minimizing tool-output token waste. Godmode-native alternative/complement to rtk.
`tokens`	Per-round token-budget observability. Logs input/output counts to `.godmode/token-log.tsv`. Answers "is my loop getting cheaper over time?"
`research`	Prior-art gathering phase. Auto-dispatched before `think` on non-trivial tasks. Uses the `explorer` subagent. Writes `.godmode/research.md`.
`bench`	Formal benchmark harness. 3-arm eval (baseline + variants) with N-run variance recovery. Writes `.godmode/bench-results.tsv`.
`team`	Team bundle primitive. Compose existing skills into named sequences via YAML bundles in `.godmode/teams/<name>.yaml`. Uses one of the 6 coordination patterns.
`tutorial`	Day-0 walkthrough. 7 steps, ≤5 minutes, first `/godmode:optimize` run.

Full skill reference: skills/

Platforms

Platform	Agents	Setup
Claude Code	Parallel (worktrees)	`claude plugin install godmode`
Codex	Native agents	Clone + use `.codex/` config
Cursor	Background agents	`bash adapters/cursor/install.sh`
Gemini CLI	Sequential	`bash adapters/gemini/install.sh`
OpenCode	Sequential	`bash adapters/opencode/install.sh`

All 134 skills work on every platform. Parallel agent skills automatically degrade to sequential on platforms without native agent dispatch. The authoring-discipline prelude, Progressive Disclosure routing, and pre-commit discard audit all reach every adapter — Claude Code via SKILL.md, Gemini and OpenCode via their respective entry files importing @./skills/principles/SKILL.md.

Verify your installation: bash adapters/<platform>/verify.sh

Philosophy

Discipline before speed. Every change is measured. Bad changes are reverted.

Evidence before claims. "Looks good" is rejected. Numeric proof is required.

Git is memory. Every experiment is committed. Every revert is in the log.

Keep or discard. Binary decisions only. No maybes.

Learn from failure. Every discard is classified and remembered.

Simplicity first. Complex changes that marginally improve metrics are discarded. The pre-MODIFY checklist strikes speculative code before it's written.

Think before coding. If two interpretations exist, present them — never silently pick. Emit NEEDS_CONTEXT on ambiguity.

Surgical changes only. Every semantically changed line must trace directly to the user's request. Adjacent "improvements" are line_scope_drift — dropped at pre-commit audit.

Goal-driven execution. Success is a shell command that exits zero. Subjective criteria ("works well," "looks good," "is faster") are vibes — reject them before coding.

Token budget is a first-class metric. Input-side (stdio), routing (Progressive Disclosure), output-side (terse), and observability (tokens) stack multiplicatively. Every round logs its context cost.

Contributing

Every skill is a Markdown file. If you can write clear instructions, you can add a skill.

See CONTRIBUTING.md for the complete guide:

Frequently Asked Questions

How does Godmode differ from Copilot, Cursor, or other AI coding tools?

Godmode is not a replacement for these tools — it's a plugin that makes them better. Copilot and Cursor generate code in a single pass. Godmode adds autonomous iteration: it measures results, keeps improvements, reverts failures, and repeats until the goal is met. It works inside Cursor and Claude Code, not instead of them.

Does Godmode work with any programming language?

Yes. Godmode skills are language-agnostic — they define engineering workflows (test, measure, verify), not language-specific syntax. If your AI assistant supports a language, Godmode's skills work with it. Framework-specific skills (React, Django, Rails, etc.) provide additional specialized guidance.

Can I use Godmode for free?

Godmode itself is free and open source (MIT license). You need a working installation of one of the supported AI coding tools (Claude Code, Cursor, Codex, Gemini CLI, or OpenCode), which may have their own pricing.

What does "autonomous" mean? Does it run without human input?

Yes. Once you set a goal and a metric (e.g., "reduce API latency, measured by curl -w '%{time_total}'"), Godmode runs the optimization loop autonomously — modifying code, verifying results, keeping improvements, reverting failures — without asking for approval between iterations. You can set an iteration limit or let it run until interrupted.

Is it safe? Can it break my code?

Every change is committed to git before verification. If a change makes things worse, it's automatically reverted with git reset. Your codebase never stays in a broken state. Guard commands (tests, linting, build) must pass for any change to be kept.

How do I add my own skills?

Every skill is a Markdown file. Copy an existing skill, modify it, and drop it in the skills/ directory. See CONTRIBUTING.md for the full guide. New skills should follow the Progressive Disclosure convention: ## Activate When immediately after the frontmatter (Tier 1), workflow + hard rules next (Tier 2), optional  marker before examples and error recovery.

What is the authoring-discipline prelude?

A set of four behavioral rules based on Andrej Karpathy's observations on common LLM coding mistakes, shipped as skills/principles/SKILL.md. Every agent reads it before the first Edit on every task: (1) Think Before Coding — state assumptions, never silently pick; (2) Simplicity First — run the pre-MODIFY checklist, strike speculative code; (3) Surgical Changes — line-trace rule, adjacent improvements are scope_drift; (4) Goal-Driven Execution — success is a shell command exiting zero. Enforced mechanically by the pre-commit discard audit in agents/builder.md, agents/tester.md, and agents/optimizer.md.

How does Godmode save tokens on long autonomous loops?

Four layers of compression that stack multiplicatively:

Progressive Disclosure — the orchestrator reads only Tier 1 (~20 lines) of each skill when routing. ~2,700 lines to route vs ~27,000 for full reads. ~90% routing-time context reduction.
Stdio input-side compression — 13 canonical command patterns (git log --oneline -20 instead of git log, wc -l instead of cat, etc.). Godmode's native convention; pairs with rtk for shell-hook enforcement.
Terse output-side compression — auto-activates from round 2. Round summaries and agent reports compress 40-60%. TSVs, code, errors, commits, final summary stay verbose.
Token observability — per-round input/output counts logged to .godmode/token-log.tsv so you can actually measure whether the other three are helping.

Opt out selectively: /godmode:terse off, GODMODE_TOKENS=0. Principles prelude and Progressive Disclosure have no opt-out — they're mechanical gates.

What does "Default Activations" mean?

Every improvement shipped in Phases 0–E fires automatically on every /godmode:* invocation. No flags. No opt-in. The 8 pipeline skills (think, plan, build, test, fix, optimize, secure, ship) each inherit the full stack via a Rule 0 in their Hard Rules section that references SKILL.md §14 Default Activations. A first-time user running /godmode make my API faster on a fresh repo gets: research auto-dispatched (if non-trivial), principles prelude read by every agent, pre-commit audit dropping drift hunks before commit, terse mode activating at round 2, token logging at every round, Progressive Disclosure routing at ~90% context reduction, and a plan with a declared coordination pattern. Every one of those was opt-in before Phase E.

License

MIT -- see LICENSE.

Discipline before speed. Evidence before claims. Git is memory.

Install | Docs | FAQ | Troubleshooting | Discuss

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Godmode — Autonomous AI Coding Agent for Claude Code, Cursor, Codex & Gemini

Your AI writes code. Godmode makes it write great code — then proves it.

See It In Action

Performance Optimization — 847ms to 198ms, fully autonomous

Multi-Agent Build — 4 agents, isolated worktrees, clean merge

Security Audit — STRIDE + OWASP + red team

Quick Start

What Is Godmode?

How Godmode Compares to Plain AI Coding

How It Works

The Full Pipeline

Failure Intelligence

Multi-Agent Execution

What Fires by Default

Authoring discipline (Karpathy family)

Token optimization (four-layer stack)

Coordination and research

The 8 pipeline skills inherit by default

Subagents (7)

Skills (134)

The 8 Discipline & Context skills (shipped across Phases 0–E)

Platforms

Philosophy

Contributing

Frequently Asked Questions

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 559 Commits
.claude-plugin		.claude-plugin
.codex		.codex
.github		.github
.godmode		.godmode
adapters		adapters
agents		agents
commands		commands
docs		docs
hooks		hooks
scripts		scripts
skills		skills
tests		tests
.gitignore		.gitignore
.mcp.json		.mcp.json
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
GEMINI.md		GEMINI.md
LICENSE		LICENSE
OPENCODE.md		OPENCODE.md
README.md		README.md
SKILL.md		SKILL.md
package.json		package.json

Folders and files

Latest commit

History

Repository files navigation

Godmode — Autonomous AI Coding Agent for Claude Code, Cursor, Codex & Gemini

Your AI writes code. Godmode makes it write great code — then proves it.

See It In Action

Performance Optimization — 847ms to 198ms, fully autonomous

Multi-Agent Build — 4 agents, isolated worktrees, clean merge

Security Audit — STRIDE + OWASP + red team

Quick Start

What Is Godmode?

How Godmode Compares to Plain AI Coding

How It Works

The Full Pipeline

Failure Intelligence

Multi-Agent Execution

What Fires by Default

Authoring discipline (Karpathy family)

Token optimization (four-layer stack)

Coordination and research

The 8 pipeline skills inherit by default

Subagents (7)

Skills (134)

The 8 Discipline & Context skills (shipped across Phases 0–E)

Platforms

Philosophy

Contributing

Frequently Asked Questions

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages