Experiment Agent

A Claude Code skill for executing, monitoring, interpreting, and verifying experiments in academic research.

What It Does

Runs code experiments — executes scripts (Python, R, etc.), monitors for stalls/crashes in real-time, collects results
Manages human studies — plans protocols, checks IRB ethics, tracks data collection progress
Interprets statistics — reads p-values, effect sizes, CIs; checks 11 types of statistical fallacies (Simpson's Paradox, survivorship bias, etc.)
Verifies reproducibility — re-runs experiments and compares results

Why It Exists

Lu et al. (2026, Nature) demonstrated an Experiment Progress Manager for autonomous AI research. This skill brings the same execute-and-monitor capability to human-in-the-loop academic workflows — without the risks of full automation.

Modes

Mode	What It Does
`run`	Execute code + monitor process
`manage`	Plan + track human studies
`validate`	Statistical interpretation + reproducibility check
`plan`	Socratic dialogue to design experiments

Quick Start

Clone this repo into your project or .claude/skills/
Start a Claude Code session
Try: "Run my analysis: Rscript analysis.R"

ARS Compatibility

This skill works independently. It also integrates optionally with Academic Research Skills (ARS):

Reads ARS Stage 1 output (RQ Brief, Methodology Blueprint) to pre-populate experiment design
Produces Material Passport-compatible output, including an explicit verification status, for ARS Stage 2 consumption
ARS requires zero modification — the user bridges manually

When to use with ARS

In the ARS pipeline, experiment-agent fits between Stage 1 (RESEARCH) and Stage 2 (WRITE):

ARS Stage 1 RESEARCH  →  you get RQ Brief + Methodology Blueprint
        ↓
  [pause ARS pipeline]
        ↓
  experiment-agent     →  plan → run/manage → validate → get analyzed or verified results
        ↓
  [resume ARS pipeline]
        ↓
ARS Stage 2 WRITE     →  write paper using your experiment results

Use experiment-agent when your research requires running experiments (code or human studies) before writing. If your paper is purely based on literature review or secondary data analysis, you don't need this — go directly from ARS Stage 1 to Stage 2.

How to load

Step 1: Clone this repo alongside your ARS project (or anywhere on your machine):

cd ~/Projects/HEEACT
git clone https://github.com/Imbad0202/experiment-agent.git

Step 2: When you need to run experiments, open a Claude Code session in the experiment-agent directory:

cd ~/Projects/HEEACT/experiment-agent
claude

Step 3: Paste the relevant ARS Stage 1 output (RQ Brief, Methodology Blueprint) into the session. The agent will auto-detect the ARS headings and pre-populate your experiment plan.

Step 4: After your experiments are done and validated, copy the output (which includes a Material Passport header and verification status) back into your ARS session to continue Stage 2.

You can also add this skill to any project via .claude/skills/ symlink — see Claude Code docs for skill installation.

Safety

Only executes commands you specify — never auto-generates or modifies your code
Never auto-retries crashed experiments
Never touches raw participant data
Statistical interpretation describes, never concludes
Full list: see SKILL.md Safety Rules

License

CC-BY-NC 4.0

Author

Cheng-I Wu

Changelog

See CHANGELOG.md

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.claude		.claude
.github		.github
agents		agents
references		references
templates		templates
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README.zh-TW.md		README.zh-TW.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Experiment Agent

What It Does

Why It Exists

Modes

Quick Start

ARS Compatibility

When to use with ARS

How to load

Safety

License

Author

Changelog

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 1

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Experiment Agent

What It Does

Why It Exists

Modes

Quick Start

ARS Compatibility

When to use with ARS

How to load

Safety

License

Author

Changelog

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 1

Packages