Replies: 1 comment 1 reply
-
We don't look at NaN afair. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've created an auto-resetting transformed environment where observation at done steps are marked as NaN. Then I tried to use the bundled GAE with the rollout data of my env, I found NaNs in the output entries of the GAE. Here is my transformed env.
And GAE is used like this: (btw am I using the right selection of data?)
Then by printing "advantage", "state_value", and "value_target", I found NaNs in the tensor. So my question is: does the GAE implementation consider autoresetting envs? If yes, then there shouldn't be NaNs or I am using it wrongly. Could you please help me diagnose the problem?
Beta Was this translation helpful? Give feedback.
All reactions