PPO reward normalization works only for default gamma

## Problem Description
Current implementation of continuous action PPO uses `gym.wrappers.NormalizeReward` with default gamma value, for all other gamma's except default `0.99` this normalization will be not correct. 
https://github.com/vwxyzjn/cleanrl/blob/94a685de9290435623d7cf5e4e770418ddb10a4f/cleanrl/ppo_continuous_action.py#L92

## Possible Solution
Very easy, just add `gamma=args.gamma` as an argument to the normalization wrapper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PPO reward normalization works only for default gamma #203

Problem Description

Possible Solution

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

PPO reward normalization works only for default gamma #203

Description

Problem Description

Possible Solution

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions