Number of time steps in one iteration of RLlib training

Question

I am new to reinforcement learning and I am working on the RL of a custom environment in OpenAI gym with RLlib. When I create a custom environment, do I need to specify the number of episodes in the __init__() method? ALso, when I train the agent with

for _ in range(10):
     trainer.train()

how many time steps are taken in one iteration? is it equal to number of episodes defined in the custom environment? Thank you.

score 1 · Answer 1 · answered Jul 01 '21 at 21:25

1

I think what you need to set for the max number of steps in one episode is the hyperparameter `horizon'

answered Jul 01 '21 at 21:25

vwaq

11
1

score 0 · Answer 2 · answered Jul 26 '20 at 08:10

I found with Ray that episodes only terminate when your environment sets 'done/_terminated'. When running on other frameworks, the algorithms often had a hyperparameter for num_steps, etc. I discovered this because if my agent got stuck, it would just sit there forever, so I needed to add a max time steps check in the environment itself.

The number of episodes is set up outside of the environment though.

Number of time steps in one iteration of RLlib training

2 Answers2