Add exploit development workflow and vector memory system design by valITino · Pull Request #77 · valITino/blhackbox

valITino · 2026-03-25T09:36:27Z

Summary

This PR introduces two major capabilities to BLHACKBOX inspired by PentAGI's architecture:

Exploit Development Skill (/exploit-dev) — A structured workflow for writing, testing, and iterating on custom exploit code
Vector Memory System Design — Comprehensive design document for semantic search across past findings and exploits using PostgreSQL + pgvector
Supporting Infrastructure — ExploitDB integration, loop detection hooks, and Neo4j knowledge graph auto-population

Key Changes

Exploit Development Workflow

New skill: /exploit-dev command for autonomous exploit development with write-test-iterate loops
Skill template (blhackbox/prompts/templates/exploit-dev.md) — 6-phase methodology: intelligence gathering, design, development, validation, payload variants, and results aggregation
Kali MCP enhancement: Added searchsploit tool integration with two new MCP tools:
- search_exploits() — Query ExploitDB with structured JSON results
- get_exploit_code() — Retrieve full source code of discovered exploits
Exploit templates — Quick-reference code patterns for HTTP exploits, SQL injection, and other common attack vectors

Vector Memory System (Design Document)

Design document (docs/vector-memory-design.md) — Complete architecture for semantic search across engagement history
6 MCP tools planned: recall_similar, store_finding, store_exploit, store_guide, get_memory_stats, auto_store_session
Database schema — PostgreSQL + pgvector with HNSW indexing for fast similarity search
Docker service — memory-mcp container behind --profile memory flag
Embedding strategy — Support for OpenAI, local Ollama, and future Anthropic embeddings
Integration points — Auto-import from aggregate_results, memory recall in skill templates, exploit storage in /exploit-dev

PentAGI Comparison & Roadmap

Analysis document (docs/pentagi-comparison.md) — Detailed comparison of PentAGI's 13-agent architecture vs BLHACKBOX's single-LLM approach
Optimization roadmap — P0-P3 priorities for adopting PentAGI patterns (exploit dev, vector memory, loop detection, model routing)
Barrier function pattern — Explanation of PentAGI's agent supervision mechanism

Supporting Infrastructure

Loop detection hook (.claude/hooks/loop-detector.sh) — Detects repeated identical tool calls and injects warnings to prevent infinite loops (inspired by PentAGI's Reflector agent)
Claude Code settings — Integrated loop detector as PreToolUse hook for MCP tools
Neo4j auto-population — Enhanced aggregate_results to auto-populate knowledge graph with hosts, services, vulnerabilities, and relationships
Knowledge graph helper (_populate_knowledge_graph()) — Structured extraction from AggregatedPayload into Neo4j nodes

Documentation Updates

Updated README.md to list /exploit-dev skill and mention searchsploit in Kali MCP tools
Updated DOCKER.md with new skill documentation
Added .env.example placeholders for future vector memory configuration

Implementation Notes

Exploit-dev skill is fully functional and ready for use; it leverages existing Kali MCP tools (run_kali_tool, capture_packets, follow_stream)
Vector memory system is documented but not yet implemented — marked as "Design (ready for implementation)" with a 4-phase migration path
Loop detector uses session-based tracking with configurable thresholds (blocks after 4 identical calls, warns at 3)
Neo4j integration is best-effort; failures don't block aggregate_results execution
All new code follows existing patterns: async/await, structured JSON responses, comprehensive error handling

Testing Recommendations

Test /exploit-dev skill against a known vulnerable service (e.g., DVWA, WebGoat)
Verify search_exploits and `

https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

…dmap Detailed analysis of PentAGI's 13-agent delegation system ("simulated security firm") compared to BLHACKBOX's MCP-native architecture. Identifies 8 specific optimization opportunities prioritized by ROI, with P0 items being: exploit development skill (Coder agent equivalent) and vector memory system (pgvector for cross-session learning). Also documents areas where BLHACKBOX is already superior (70+ tools, WireMCP, Screenshot MCP, MCP standard, PoC rigor). https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

Added findings from Go source code analysis (commit e97bbe5): - Barrier function pattern per agent (hack_result, code_result, etc.) - Tool isolation: each agent sees ONLY its allowed tools (enforced in Go) - Full orchestration flow: Generator → Primary Agent → sub-agent loops - Reflector catches text-only responses AND tool-limit approach - Memory system dual thresholds: 0.45 for broad recall, 0.75 for search - Retry constants: maxRetries=3, delay=5s - Key source file reference table for implementation guidance https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

…inding - PentAGI uses 4-tier similarity thresholds: 0.45 general, 0.7 code, 0.75 search, 0.8 guides - Clarified PentAGI is custom Go framework, NOT LangChain/LangGraph - Added "Key Architectural Insights" section with 5 design principles - Noted tool isolation as security model (Primary Agent can't run commands) - Documented Coder→Pentester delegation flow for exploit development https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

PentAGI recommends isolating worker execution on a separate server with Docker-in-Docker over TLS to prevent untrusted exploit code from running on the primary system. Worth considering for BLHACKBOX production deployments. https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

Five features inspired by PentAGI's multi-agent architecture: 1. /exploit-dev skill (P0) — New Claude Code skill for custom exploit development with write-test-iterate loops. Mirrors PentAGI's Coder agent: research vuln, design exploit, write code, test in Kali container, iterate on failures, capture evidence. 2. Loop detection hook (P1) — PreToolUse hook that tracks MCP tool calls and detects repeated identical calls (threshold: 4). Blocks looping calls and warns at session milestones (50/80 calls). Mirrors PentAGI's Reflector agent. 3. Auto knowledge graph population (P1) — aggregate_results now auto-populates Neo4j with hosts, services, vulnerabilities, subdomains, and endpoints from AggregatedPayload. Previously only stored a flat session node. 4. search_exploits + get_exploit_code tools (P2) — Dedicated searchsploit MCP tools for Kali MCP server with structured JSON output. Mirrors PentAGI's Sploitus integration. 5. Vector memory system design doc (P0) — Architecture for pgvector- backed cross-session memory with 6 MCP tools and 4-tier similarity thresholds. Ready for implementation. All 187 existing tests pass. No breaking changes. https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

Documentation (README.md, DOCKER.md): - Add /exploit-dev to skills table (10→11 skills) - Add "Advanced Features" section: exploit-dev, searchsploit tools, loop detection, auto knowledge graph - Update architecture diagram to include searchsploit - Update project structure with loop-detector.sh and exploit-dev - Update components table with searchsploit mention Neo4j auto-graph fixes: - Validate port range (1-65535) before creating port nodes - Add index to vuln identifier fallback to prevent node collisions - Strip and validate subdomains before creating domain nodes - Use proper type annotations (string forward refs) instead of Any - Upgrade Neo4j exception logging from DEBUG to INFO Template system: - Create blhackbox/prompts/templates/exploit-dev.md (MCP template) - Register exploit-dev in TEMPLATES dict (prompts/__init__.py) - Now 11 templates accessible via get_template/list_templates API Environment: - Add vector memory system vars to .env.example (commented out) - Update OpenAI key description to mention embedding use All 187 tests pass. https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

- Revert type annotations to Any (ruff F821: types not imported at module scope) - Rewrite exploit-dev SKILL.md with specific tool calls, decision trees, iteration tracking, and PentAGI Coder patterns (search first, 5-iter limit) - All ruff checks pass, all 187 tests pass https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

Applied three key improvements across all skills: 1. Tool discovery mandate — All skills now include "Query each MCP server's tool listing" in their pre-flight checklist 2. MCP Tool Quick Reference — Every skill now has a reference section with specific tool calls: - Exploitation skills: searchsploit, msfconsole search, capture_packets, extract_credentials, follow_stream, take_screenshot, take_element_screenshot, annotate_screenshot, /exploit-dev reference - Recon skills: passive DNS tools, passive OSINT tools - OSINT skill: passive-only tools with explicit "no active testing" note 3. Decision gates and specifics — vuln-assessment and quick-scan now have explicit exploitation decision flows (validate → exploit → if WAF blocks → try bypass → if needs custom code → /exploit-dev) Skills updated: full-pentest, full-attack-chain, quick-scan, vuln-assessment, web-app-assessment, network-infrastructure, api-security, bug-bounty, recon-deep, osint-gathering All ruff checks pass, all 187 tests pass. https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

CVE-2026-4539 is a low-severity local ReDoS in pygments' AdlLexer (CVSS 3.3, local-only attack vector). No upstream patch exists yet. - Pin pygments>=2.19.2 in requirements.txt to auto-upgrade when fix lands - Add --ignore-vuln CVE-2026-4539 to CI pip-audit step with comment to remove when pygments patches - Reference: https://nvd.nist.gov/vuln/detail/CVE-2026-4539 https://claude.ai/code/session_01E2e15egjZn2tbVSRPCHWpm

claude added 9 commits March 25, 2026 08:26

valITino marked this pull request as ready for review March 25, 2026 12:27

valITino merged commit c1303bf into main Mar 25, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add exploit development workflow and vector memory system design#77

Add exploit development workflow and vector memory system design#77
valITino merged 9 commits intomainfrom
claude/analyze-pentagi-architecture-G3UT0

valITino commented Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

valITino commented Mar 25, 2026

Summary

Key Changes

Exploit Development Workflow

Vector Memory System (Design Document)

PentAGI Comparison & Roadmap

Supporting Infrastructure

Documentation Updates

Implementation Notes

Testing Recommendations

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants