Skip to content

HCP-AI-Research-Lab/MAT-Agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MAT-Agent: Adaptive Multi-Agent Training Optimization

Official implementation for the NeurIPS 2025 paper "MAT-Agent: Adaptive Multi-Agent Training Optimization".

NeurIPS 2025 | Arxiv | 中文


📖 Introduction

MAT-Agent is a novel multi-agent framework that reimagines the training optimization process for Multi-Label Image Classification (MLIC). Instead of relying on static configurations, MAT-Agent treats training as a collaborative, real-time optimization process.

Key Features:

  • 🤖 Multi-Agent Collaboration: Deploys four autonomous agents to dynamically tune Data Augmentation, Optimizers, Learning Rates, and Loss Functions in real-time.
  • ⚖️ Dynamic Trade-off: leverages non-stationary multi-armed bandit algorithms to balance exploration and exploitation.
  • 🎯 Composite Reward: Guided by a reward system that harmonizes accuracy, rare-class performance, and training stability.
  • 🚀 SOTA Performance: Achieves 97.4 mAP on Pascal VOC 2007 and 92.8 mAP on MS-COCO, significantly outperforming conventional static training methods.

For more details, please refer to our paper.


🛠️ Installation

1. Clone Repository

git clone https://github.com/HCP-AI-Research-Lab/MAT-Agent

2. Environment Setup

We recommend using Conda to manage the environment.

# Create environment
conda create -n mat-agent python==3.10
conda activate mat-agent

# Install PyTorch 1.31.1
pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu117

# Install dependencies
pip3 install Pillow randaugment numpy opencv-python tensorboard pycocotools scikit-learn pandas

This project requires inplace_abn. Please install it from source ensuring you checkout version v1.1.0.

cd MAT-Agent # change directory to root
git clone https://github.com/mapillary/inplace_abn.git
cd inplace_abn
git checkout v1.1.0
python setup.py install
cd scripts
pip3 install -r requirements.txt
cd ../..

📂 Data Preparation

1. Download Datasets

Please download the Pascal VOC 2007 and MS-COCO 2014 datasets.

For Pascal VOC 2007:

# Download and extract
wget https://huggingface.co/datasets/HuggingFaceM4/pascal_voc/resolve/main/voc2007.tar.gz -O datasets/voc2007.tar.gz
cd datasets
tar xvzf voc2007.tar.gz

# Convert to COCO format
python3 convert_to_coco.py

# Cleanup (Optional)
rm voc2007.tar.gz && rm -rf VOC2007

For MS-COCO 2014: Download the dataset (Train/Val/Test 2014) and annotations from this link, then organize them into datasets/MSCOCO-2014.

2. Directory Structure

Ensure your datasets/ directory looks exactly like this:

datasets/
├── convert_to_coco.py
├── PascalVOC2COCO.py
├── VOC2007_COCO/              # Processed VOC data
├── MSCOCO-2014/
│   ├── annotations/
│   ├── train2014/
│   ├── val2014/
└── └── test2014/

🚀 Usage

Training & Evaluating on MS-COCO

To train and evaluate the model on the MS-COCO dataset using MAT-Agent:

python3 run_coco.py

Training on Pascal VOC

To train and evaluate on Pascal VOC 2007:

python3 run_voc.py

📊 Results

MAT-Agent demonstrates superior performance compared to state-of-the-art methods across multiple benchmarks.

Method Backbone Pascal VOC (mAP) MS-COCO (mAP) VG-256 (mAP)
ML-GCN ResNet-101 94.0 83.0 52.3
ASL ResNet-101 95.8 86.6 56.3
PAT-T ResNet-101 96.2 91.8 59.5
MAT-Agent (Ours) ResNet-101 97.4 92.8 60.9

🔗 Citation

If you find our work or code helpful, please cite our paper:

@misc{zhang2025matagentadaptivemultiagenttraining,
      title={MAT-Agent: Adaptive Multi-Agent Training Optimization}, 
      author={Jusheng Zhang and Kaitong Cai and Yijia Fan and Ningyuan Liu and Keze Wang},
      year={2025},
      eprint={2510.17845},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2510.17845}, 
}

📄 License

This project is released under the MIT License.

About

Official implementation for NIPS2025 paper "MAT-Agent: Adaptive Multi-Agent Training Optimization"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages