Support Pettingzoo Multi-agent Atari envs with PPO #188

vwxyzjn · 2022-05-29T20:41:37Z

Description

Follow up to #144.

Types of changes

New feature

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2022-05-29T20:41:40Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	Jun 1, 2022 at 10:10PM (UTC)

gitpod-io · 2022-05-29T20:41:47Z

vwxyzjn · 2022-05-31T13:55:20Z

@benblack769 @araffin @Miffyli @jkterry1 @kcorder would you mind helping review this PR? In particular, could you help review the following:

This filedifff (https://www.diffchecker.com/WQ3yzb1Y) highlights the lines of code changes.
The documentation (https://cleanrl-git-pettingzoo-docs-vwxyzjn.vercel.app/rl-algorithms/ppo/#ppo_pettingzoo_ma_ataripy) specifies implementation details and results.

Thanks!

kcorder · 2022-05-31T14:38:30Z

This all looks good to me!

Just some things I think we should try out:

we have a NoopReset wrapper for PZ envs
Jordan/Ben previously found using the InvertColor agent indicator was better than normal agent indicator

vwxyzjn · 2022-05-31T14:49:04Z

Thank you @kcorder, I’d be happy to try out the no-op reset wrapper. Is the InvertColor agent indicator in supersuit? Also see https://wandb.ai/costa-huang/cleanRL/reports/MA-ALE--VmlldzoxNzAzMDQx#invert-color-indicator which shows the performance of the invertcolor indicator - at least in pong it does not perform as well as the naive indicator.

kcorder · 2022-05-31T15:01:52Z

Oh interesting, good to know about agent indicator - I hadn't tried myself.

The NoopReset is here: https://github.com/jkterry1/MA-ALE2/blob/74f562d088c795e7fa4fdeba494f2573ac9c6c7e/env_utils.py#L324-L345

We've been using this InvertColorAgentIndicator - there was a bug fix there since the original code actually

vwxyzjn · 2022-06-01T23:50:22Z

@kcorder thanks for the helpful pointers. While it would be interesting to try this preprocessing, I would like to defer this as future work since we are aiming for a 1.0.0 release soon.

vwxyzjn added 2 commits May 29, 2022 16:39

Pettingzoo integration

e82bfbf

Add test cases

ac838d5

vwxyzjn mentioned this pull request May 29, 2022

Support pettingzoo MA ALE envs with PPO #144

Closed

19 tasks

pre-commit

f4b3d24

vercel bot deployed to Preview May 29, 2022 20:43 View deployment

Fix CI

b1f1afa

vercel bot deployed to Preview May 29, 2022 20:50 View deployment

remove windows CI and add docs

1afcf5f

vercel bot deployed to Preview May 30, 2022 00:05 View deployment

Update docs

9e1aed2

vercel bot deployed to Preview May 31, 2022 02:19 View deployment

Add benchmark

9b13d05

vercel bot deployed to Preview May 31, 2022 13:42 View deployment

Update docs

b10540e

vercel bot deployed to Preview May 31, 2022 13:43 View deployment

vwxyzjn marked this pull request as ready for review May 31, 2022 13:52

vwxyzjn requested review from dipamc, dosssman and yooceii May 31, 2022 13:52

change color

40fe210

vercel bot deployed to Preview June 1, 2022 22:10 View deployment

dosssman approved these changes Jun 1, 2022

View reviewed changes

vwxyzjn merged commit e547cc7 into master Jun 1, 2022

vwxyzjn mentioned this pull request Jun 1, 2022

Refactor documentation #121

Closed

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Pettingzoo Multi-agent Atari envs with PPO #188

Support Pettingzoo Multi-agent Atari envs with PPO #188

Uh oh!

vwxyzjn commented May 29, 2022 •

edited

Loading

Uh oh!

vercel bot commented May 29, 2022 •

edited

Loading

Uh oh!

gitpod-io bot commented May 29, 2022

Uh oh!

vwxyzjn commented May 31, 2022 •

edited

Loading

Uh oh!

kcorder commented May 31, 2022

Uh oh!

vwxyzjn commented May 31, 2022

Uh oh!

kcorder commented May 31, 2022 •

edited

Loading

Uh oh!

vwxyzjn commented Jun 1, 2022

Uh oh!

Uh oh!

Support Pettingzoo Multi-agent Atari envs with PPO #188

Support Pettingzoo Multi-agent Atari envs with PPO #188

Uh oh!

Conversation

vwxyzjn commented May 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Types of changes

Checklist:

Uh oh!

vercel bot commented May 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gitpod-io bot commented May 29, 2022

Uh oh!

vwxyzjn commented May 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kcorder commented May 31, 2022

Uh oh!

vwxyzjn commented May 31, 2022

Uh oh!

kcorder commented May 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vwxyzjn commented Jun 1, 2022

Uh oh!

Uh oh!

vwxyzjn commented May 29, 2022 •

edited

Loading

vercel bot commented May 29, 2022 •

edited

Loading

vwxyzjn commented May 31, 2022 •

edited

Loading

kcorder commented May 31, 2022 •

edited

Loading