The discrete action counterpart of #48 Associated PR: https://github.com/DLR-RM/stable-baselines3/pull/110 - [x] A2C - [x] PPO - [x] DQN (I'm currently working on that in #28 and it looks good) Test envs: Atari Games (Pong - easy, Breakout - medium, ...)