Skip to content

Melodiz/CartPole-PolicyGradient

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Policy Gradient Methods and Behaviour Cloning on CartPole-v1

HW1 — Research in LLMs, HSE × Central University

📄 Read the report

Reproduce

pip install -r requirements.txt

# Core experiments (5 seeds, ~7 min on M2)
cd src && python run_experiments.py --full

# Extended experiments (15 seeds + RLOO sweep + BC noise, ~20 min)
python additional_experiments.py --all

# Statistical analysis + extra figures
python statistical_analysis.py
python plotting.py
python plotting_extra.py

About

Statistical replication study of policy gradient methods on CartPole — 3 of 5 initial claims falsified with extended experiments

Resources

License

Stars

Watchers

Forks

Contributors