SafePaw Gateway

The security perimeter for self-hosted AI assistants.

Free, open-source security gateway for self-hosted AI backends — secure defaults, documented threat model, 534 tests.

No cloud. No accounts. No telemetry. We don't collect your data — at all. You can scan the repo to verify; see docs/NO-DATA-COLLECTION.md. Just a gateway, a wizard, and 534 tests.

SafePaw is a standalone security program developed by BeautifulPlanet. It is NOT a fork of OpenPaw and has no affiliation with Flexasaurusrex.

What is this?

Most self-hosted AI backends ship with security features that are opt-in and scattered across docs. Auth needs manual setup. Rate limiting needs configuration. SSRF and brute-force protection require knowing they exist first. Most people skip it and run exposed.

SafePaw makes the secure path the easy path. It wraps OpenClaw (or any HTTP backend) in a hardened local perimeter — a Go reverse-proxy gateway with auth, rate limiting, and prompt-injection scanning, plus a React wizard for setup, monitoring, and admin — and ships with secure defaults on: auth enforced, rate limiting active, brute-force protection on, output scanning running. Everything runs in Docker Compose on your machine with a single command. Nothing phones home.

Think of it as the deadbolt kit for your AI stack — pre-configured, pre-tested, secure out of the box. Not because your backend is broken. Because hardening it yourself takes expertise most people don't have and time nobody wants to spend.

Who needs this?

You are…	Your problem	SafePaw gives you
Tinkerer who just got OpenClaw running	It's exposed on a port with no auth, no rate limiting, no scanning	Gateway with HMAC auth, brute-force protection, 14-pattern prompt-injection scanner
Small-team admin sharing an AI assistant	No audit trail, no MFA, users share one password	Wizard with session tokens, optional TOTP, per-action audit log
Security-conscious dev who reads threat models	You want proof, not promises	48 STRIDE threats documented, 534 tests, 7 fuzz targets, govulncheck in CI
Ops person who gets paged at 3 AM	Something broke, no runbook	6 incident playbooks, backup/restore procedures, Grafana dashboards

What you get

Layer	Highlights
Gateway	HMAC-SHA256 auth · Redis-backed revocation · per-IP rate limits · brute-force IP bans · 14-pattern body scanner · output scanner (XSS, secret leaks) · Prometheus metrics · structured JSON logging
Wizard	React 19 SPA embedded via `go:embed` · admin login with optional TOTP · prerequisite checks · live container dashboard · masked `.env` editing · audit log
Compose stack	5 services (wizard, gateway, OpenClaw, Redis, Postgres) · health checks · resource limits · only wizard + gateway exposed on `127.0.0.1`
Ops docs	STRIDE threat model (48 threats) · 6 incident runbooks · backup/restore · secret rotation · Grafana alerts
CI pipeline	Build · test (`-race`) · lint · gosec · govulncheck · fuzz seeds · coverage gates · Docker build

Quick start

New here? Try the demo in 3 steps: (1) Clone the repo, (2) run LAUNCH.bat (Windows) or ./LAUNCH.sh (Mac/Linux), (3) press 2 for Demo → browser opens at http://localhost:3000. Sign in with the password shown in the launcher window. No API keys or OpenClaw required. Full steps →

git clone https://github.com/beautifulplanet/SafePaw.git
cd SafePaw
# Windows: LAUNCH.bat  |  macOS/Linux: ./LAUNCH.sh  → press 2 for Demo
./LAUNCH.sh

Windows: Use LAUNCH.bat for a one-click menu (Full / Demo / Shut down / Show processes). Use STOP-SAFEPAW.bat for an emergency stop (can be shortcut on desktop).
macOS/Linux: Use LAUNCH.sh for the same menu (./LAUNCH.sh). See docs/VERIFY-LAUNCHER.md to verify.

First time testing or demoing? See docs/HOW-TO-TEST-AND-WHERE-WE-ARE.md for a minimal test path and what “working product” means.

That's it. The script checks Docker, generates secrets, picks a memory profile for your system, starts everything, and opens your browser at http://localhost:3000.

What happens

start.sh generates .env with secure random passwords (Redis, Postgres, auth secret, wizard password)
Detects RAM → sets SYSTEM_PROFILE (small / medium / large / very-large)
Runs docker compose up -d --build
Waits for health checks, prints your wizard password, opens the wizard

Prerequisites: Docker + Compose V2 · ports 3000 and 8080 free

Verify it works

curl -s http://localhost:3000/api/v1/health | jq .   # wizard
curl -s http://localhost:8080/health | jq .           # gateway

Open http://localhost:3000, sign in, check the dashboard. Full verification script: ./scripts/verify-deployment.sh.

Evidence

Every number in this README is provable. Run the commands yourself.

Claim	How to verify
534 Go tests (346 gateway + 188 wizard)	`grep -rE '^func Test' services/gateway --include='*.go'
7 fuzz targets	`make fuzz` or `grep -r "^func Fuzz" services/gateway/`
Gateway coverage 80.5% (CI gate: >65%)	`cd services/gateway && go test -coverprofile=c.out ./... && go tool cover -func=c.out \| grep total`
Wizard coverage 64.2% (CI gate: ≥60%)	Same command under `services/wizard`
48 STRIDE threats modeled	`grep -cE "^\| [A-Z][0-9]" THREAT-MODEL.md`
6 incident runbooks	`grep -c "^## INC-" RUNBOOK.md`
Zero CVEs in deps	`make vulncheck` (runs govulncheck on both services)
Stack runs entirely local	`docker compose ps` — no outbound connections except your LLM API key

How the pieces fit

Internet ─→ [Gateway :8080] ─→ [OpenClaw :18789]
                │                       │
            Auth · Rate Limit      AI Assistant
            Scanner · Metrics      Channels · LLM
                │
            [Redis :6379]          [Postgres :5432]
             Revocation              Config · Audit
             Brute-force

Browser ──→ [Wizard :3000]
             Setup · Dashboard
             .env · TOTP · Audit

Only the gateway and wizard are exposed on 127.0.0.1. OpenClaw, Redis, and Postgres live on an internal Docker network with no host ports. Full architecture with mermaid diagrams: docs/ARCHITECTURE.md.

Documentation

Everything deep lives in its own doc. The README is the routing table.

Architecture & design

Document	What's in it
docs/ARCHITECTURE.md	Full architecture — mermaid diagrams, middleware pipeline, auth flows, WebSocket proxy, scanning, deployment topology, technology decisions
docs/adr/	9 architecture decision records — HMAC over JWT, zero deps, Go choice, socket proxy, heuristic scanning, embedded UI, receipt ledger, CSRF, Codespaces routing

Security & threats

Document	What's in it
SECURITY.md	Defense-in-depth walkthrough, incident response, hardening checklist, MFA setup, logging for SIEM, vulnerability management, password recovery
THREAT-MODEL.md	STRIDE analysis — 48 threats across 6 categories, mitigations, residual risks, trust boundaries
docs/PENTEST-POLICY.md	Penetration testing scope, methodology, responsible disclosure

Operations

Document	What's in it
docs/PHONE-ACCESS.md	Let people use their phone with OpenClaw — expose gateway, auth, tokens, CORS, TLS
RUNBOOK.md	6 incident playbooks — token compromise, injection detected, gateway down, brute force, secret rotation, disk full
BACKUP-RECOVERY.md	Backup and restore for Postgres, Redis, Docker volumes, `.env`
docs/SECRETS-MIGRATION.md	Migration guide from env vars to Vault / external secrets

Development & configuration

Document	What's in it
docs/DEVELOPMENT.md	Configuration reference, build commands, testing guide, project structure, troubleshooting
CONTRIBUTING.md	Dev workflow, coding standards, PR process, pre-commit hooks
PITFALLS.md	Known gotchas and edge cases
CHANGELOG.md	Release history (Keep a Changelog format)

Compliance & policy

Document	What's in it
docs/COMPLIANCE.md	SOC 2 & GDPR control mapping with gap analysis
docs/PATCHING-POLICY.md	Dependency update SLAs, Dependabot workflow, freeze policy

Scope & delivery

Document	What's in it
docs/scope/SOW-001.md	Original statement of work — all items delivered
docs/scope/CO-001.md	Change order: per-endpoint rate limits — delivered
docs/scope/CO-002.md	Change order: Playwright E2E login tests — delivered
SCOPE-IMPROVEMENTS.md	Review feedback triage and improvement backlog

FAQ

Can I use this without OpenClaw? Yes. The gateway proxies to any HTTP backend — change PROXY_TARGET in .env. The wizard's container dashboard is OpenClaw-aware, but the gateway is generic.

How do I add MFA to the wizard? Set WIZARD_TOTP_SECRET in .env to a base32 secret. The login page will prompt for a TOTP code automatically. Details in SECURITY.md.

Is this production-ready? For localhost or VPN deployments, yes. For public-facing setups, enable TLS, set a strong AUTH_SECRET, and work through the hardening checklist in SECURITY.md.

How does the scanner work? Heuristic pattern-matching — 14 patterns for prompt injection, plus an output scanner for XSS and secret leaks. It's one layer of defense-in-depth, not a silver bullet. Design rationale: ADR-005.

Where's the config reference? docs/DEVELOPMENT.md has the full environment variable table. Quick version: copy .env.example and fill in your API keys.

What SafePaw does NOT cover

SafePaw is a transport-layer perimeter. It secures the path between clients and the AI backend. It does not sandbox what happens inside the backend or on the host.

Threat	SafePaw coverage
Unauthenticated access to the AI backend	✅ Blocked (HMAC auth + brute-force IP bans)
Credential stuffing / brute force	✅ Blocked (rate limiting + escalating bans)
Prompt injection in HTTP request bodies	✅ Flagged (14-pattern heuristic scanner)
XSS / secret leaks in HTTP responses	✅ Sanitized (output scanner)
WebSocket stream injection/exfiltration	⚠️ Logged only — not blocked (see note below)
Agent executing dangerous host tool calls	❌ Out of scope — transport layer only
Browser SSRF to cloud metadata endpoints	❌ Out of scope — host/browser sandbox layer
Arbitrary JS execution via browser tools	❌ Out of scope — backend sandbox responsibility
Plugin/hook filesystem integrity	❌ Out of scope — host-level controls

WebSocket note: Output sanitization (stripping XSS, secret leak patterns) applies to HTTP responses only. WebSocket streams are scanned and logged but passed through unmodified — modifying payload bytes without updating binary frame headers corrupts the stream. This is a documented design trade-off in services/gateway/middleware/output_scanner.go. Most real-time AI chat traffic flows over WebSocket. See ARCHITECTURE.md for details.

SafePaw secures the front door. It does not sandbox what happens inside the house. For host-level and agent-level sandboxing, configure your backend's sandbox mode and run it in Docker with restricted capabilities.

Design boundaries

SafePaw secures a single AI stack with depth, not breadth:

Transport-layer scope. Auth, rate limiting, scanning, and audit logging at the HTTP/WebSocket boundary. Host-level and agent-level controls are out of scope by design.
Scanning is heuristic. Pattern-based tripwires, not ML classifiers. Documented as a scope boundary in THREAT-MODEL.md.
Single-instance. Designed for indie and small-team deployments. Enterprise can layer WAF and external IdP on top.
Local-first. No cloud dependencies at runtime. Your data stays on your machine.
No data collection. No analytics, no tracking, no phone-home. We don't have your data because we never collect it. How to verify →

License

MIT — see LICENSE.

Acknowledgements

Built in collaboration with Claude (Opus) by Anthropic. Architecture decisions, security hardening, and all review by @beautifulplanet.

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.agents/workflows		.agents/workflows
.devcontainer		.devcontainer
.github		.github
docs		docs
monitoring		monitoring
scripts		scripts
services		services
shared		shared
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
BACKUP-RECOVERY.md		BACKUP-RECOVERY.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LAUNCH.bat		LAUNCH.bat
LAUNCH.sh		LAUNCH.sh
LICENSE		LICENSE
Log-Launch.ps1		Log-Launch.ps1
Makefile		Makefile
PITFALLS.md		PITFALLS.md
README.md		README.md
RELEASE.md		RELEASE.md
RUNBOOK.md		RUNBOOK.md
SAFEGUARDS.md		SAFEGUARDS.md
SCOPE-IMPROVEMENTS.md		SCOPE-IMPROVEMENTS.md
SECURITY.md		SECURITY.md
SPEC-COST-MONITORING.md		SPEC-COST-MONITORING.md
START-DEMO.bat		START-DEMO.bat
START-DEMO.ps1		START-DEMO.ps1
STOP-SAFEPAW.bat		STOP-SAFEPAW.bat
THREAT-MODEL.md		THREAT-MODEL.md
create-shortcuts.bat		create-shortcuts.bat
docker-compose.demo.yml		docker-compose.demo.yml
docker-compose.yml		docker-compose.yml
go.work		go.work
run-local.ps1		run-local.ps1
start.bat		start.bat
start.sh		start.sh
status.bat		status.bat
status.sh		status.sh
stop.bat		stop.bat
stop.sh		stop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SafePaw Gateway

What is this?

Who needs this?

What you get

Quick start

What happens

Verify it works

Evidence

How the pieces fit

Documentation

Architecture & design

Security & threats

Operations

Development & configuration

Compliance & policy

Scope & delivery

FAQ

What SafePaw does NOT cover

Design boundaries

License

Acknowledgements

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SafePaw Gateway

What is this?

Who needs this?

What you get

Quick start

What happens

Verify it works

Evidence

How the pieces fit

Documentation

Architecture & design

Security & threats

Operations

Development & configuration

Compliance & policy

Scope & delivery

FAQ

What SafePaw does NOT cover

Design boundaries

License

Acknowledgements

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages