A bilingual, open-source research repository for a partially observable cat-and-mouse grid world built on the PettingZoo Parallel API, with recurrent MARL baselines, diagnostics, and reproducible training utilities.
- Partially observable
PettingZoo Parallel APIenvironment - Three agents:
cat_0,mouse_0,mouse_1 - Asymmetric actions, local observations, shared mouse policies, and recurrent actor-critic baselines
- Mainline training scripts for
MAPPO,IPPO, andQMIX - Diagnostics for task events, recording, regression checks, and visualization
- Bilingual documentation for public release
这是一个面向多智能体强化学习研究的猫鼠网格世界项目。
- 环境基于
POMDP与PettingZoo Parallel API - 参与智能体为
cat_0、mouse_0、mouse_1 - 支持局部观测、RNN 记忆、非对称动作与对抗训练
- 提供
MAPPO、IPPO、QMIX主线实现 - 历史博弈算法探索已归档,保持主仓库结构清晰
中文入口:docs/README.md
This project studies multi-agent pursuit, evasion, navigation, and resource collection under partial observability.
English entry: README_EN.md
conda env create -f environment.yml
conda activate cat_mouse_marlRun a regression check:
python -m unittest tests.test_grid_core_regressionsTrain a baseline:
python scripts/train_mappo.py --run-dir runs/quick_mappo --device cudaEvaluate a checkpoint:
python scripts/evaluate_task_events.py --mouse-policy model --mouse-checkpoint runs/quick_mappo/checkpoints/mappo_final.pt --cat-policy random --device cudaRecord a match:
python scripts/play_match.py --cat-checkpoint runs/quick_mappo/checkpoints/mappo_final.pt --mouse-checkpoint runs/quick_mappo/checkpoints/mappo_final.pt- 中文文档入口:docs/README.md
- English docs entry: docs/README_EN.md
- 本地实验产物说明:docs/LOCAL_ARTIFACTS_ZH.md
- Local artifact guide: docs/LOCAL_ARTIFACTS_EN.md
- 贡献指南:CONTRIBUTING.md
- 行为准则:CODE_OF_CONDUCT.md
envs/: environment core and PettingZoo wrapperalgorithms/: recurrent MARL implementationsscripts/: training, evaluation, recording, and diagnosticsutils/: policies, logging, and visualization helpersdocs/: tracked public documentationtests/: regression checks and API testsarchive/: archived exploratory work outside the current mainlineruns/: local experiment outputs, ignored by git by defaultdemos/: local recorded videos, ignored by git by default
- The repository is organized so source code, docs, and tests are tracked in git.
- Large or fast-changing experiment artifacts stay local under
runs/anddemos/. - Artifact indexes that should appear on GitHub are mirrored under
docs/.
MIT. See LICENSE.