Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added n-player/generalised inTheMatrix/STORM implementation. #181

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

ali-shihab
Copy link

Generalised STORM implementation developed at FLAIR. Key changes:

  1. Allows for any number of agents to be specified.
  2. Allows for any grid size to be specified (though this needs to be greater than the number of objects contained in the grid, obviously).
  3. Allows for any number of coins (per type) to be specified.

Some plots of: 1. mean proportion of red/defect coins held by agents in the environment; 2. mean returns; during IPPO test runs are provided (population sizes of 256, 512, 1024). Note that in all cases, the (defection) ratio converges to approx 0.8, which is expected behaviour for IPPO.
1024_agent_returns
1024_agent_coin_ratio
512_agent_returns
512_agent_coin_ratio
256_agent_returns
256_agent_coin_ratio

@github-actions github-actions bot added the envs label Sep 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant