-
Notifications
You must be signed in to change notification settings - Fork 114
Closed
Labels
documentationImprovements or additions to documentationImprovements or additions to documentation
Description
Describe the bug
Unless I'm mistaken, the env resets automatically when an episode is over, which means the terminal observation is not accessible to the agent which prevent from doing proper bootstrapping for infinite horizon problems.
See openai/gym#2484 and openai/gym#1632
Expected behavior
Reason and Possible fixes
Add terminal_observation
key to the info dict, as already done for the timelimit truncation.
Checklist
- I have checked that there is no similar issue in the repo (required)
- I have read the documentation (required)
- I have provided a minimal working example to reproduce the bug (required)
Metadata
Metadata
Assignees
Labels
documentationImprovements or additions to documentationImprovements or additions to documentation