Skip to content

Mixed Multi-Agent Reinforcement Learning framework for efficient coordination

Notifications You must be signed in to change notification settings

johnMinelli/CoordinatedQMIX

Repository files navigation

MARL Coordination with QMIX

ArXiv

Official repository for "CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision Making"

Project structure

The master branch repository provide the code used for the paper. The dev branch contains all material for testing and ablation study.

Setup

  • Setup an environment (optional) and install the requirements:

    conda create -n env python=3.9.2
    conda activate env
    pip install -r requirements.txt
    

Usage

Default parameters are loaded for each environment in automatic through the params_{env_name}.yaml config file if exists. Otherwise a custom configuration can be loaded specifying the file path in --yaml_params parameter or specifying individual parameters in the command line.

To train:

python train.py --env CoMix_switch
  • Environment names available are: CoMix_switch, CoMix_predator_prey_4, CoMix_predator_prey_8, CoMix_predator_prey_16, CoMix_transport_1, CoMix_transport_2, CoMix_transport_4

  • Fine tuning procedure explained in the paper, can be executed using --fine_tune 1 and modifying the Q optimizer parameters)

To evaluate:

python eval.py --models_path save --model_epoch -1  -ve 1000
  • -1 for last checkpoint in folder
  • 1000 validation episodes to get a strong average score

📚 Citation

Consider giving it a ⭐ and cite our paper:

@article{Minelli2023Comix,
  author = {Giovanni Minelli and Mirco Musolesi},
  journal = {Transactions on Machine Learning Research},
  title={CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-Making},
  year = {2024},
  url={https://arxiv.org/abs/2308.10721}
}

About

Mixed Multi-Agent Reinforcement Learning framework for efficient coordination

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages