A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
-
Updated
Apr 24, 2026
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing
SutroYaro — Sutro Group research workspace for energy-efficient AI training. Point any coding agent at the repo and it becomes a research agent. 34 experiments, eval environment, weekly catch-ups, multi-researcher workflow.
🤖 CodeForge AI: An autonomous multi-agent coding system powered by LangGraph for agentic software development and automated workflows. SOTA custom agentic GraphRag, shared-state memory, auto-model routing for cost optimization, and a range of custom tooling.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
LitReview Skill is an installable agent skill for end-to-end literature review generation. It helps agents conduct literature reviews with a well-designed and widely used review framework so the search process is broad, iterative, and less likely to miss relevant articles.
Foundation for an open strong-agent platform: controllers, operators, skills, A2A, runtime, and graph execution.
Lightweight Python CLI for the Exa API (Search, Contents, Find Similar, Answer, Research, Context) with JSON-first output, SSE streaming, and model-aware polling. LLM‑agnostic: integrate with OpenAI Agents SDK/Codex CLI or Claude tool use by invoking CLI commands, no MCP server required.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
A markdown protocol for AI agents that analyze public agenda instead of summarizing it badly.
Codex-native autonomous research loop: source-gated mentor council, submission advisor, small-step execution, and GitHub delivery.
🤖 Build and interact with Claude Agent using this Python SDK for seamless integration and efficient asynchronous querying.
An advanced agentic workflow implementation using LangGraph and LangChain, featuring iterative research, autonomous planning, and persistent state management for high-quality content generation.
this is a tool to use AI agents to help with job applications
Autonomous ML research loops for Claude Code with mechanical anti-fabrication guards.
Modular multi-agent AI architecture for deep research, long-context reasoning, and reliable execution.
Provider-agnostic multi-agent orchestration runtime with LangGraph, MCP tools, CLI, FastAPI, evidence capture, and citation-aware outputs.
Track public autoresearch use cases across industries with a curated list of repos, write-ups, and discussions
Organize genealogy research with structured AI prompts, vault templates, and workflows for source-backed family history work
Falsification-first research agent for trustworthy scientific writing.
Add a description, image, and links to the research-agents topic page so that developers can more easily learn about it.
To associate your repository with the research-agents topic, visit your repo's landing page and select "manage topics."