LSTM based policy in stable baselines3 model

Question

I am trying to make a PPO model using the stable-baselines3 library. I want to use a policy network with an LSTM layer in it. However, I can't find such a possibility on the library's website although it exists on the previous version of stable-baselines here https://stable-baselines.readthedocs.io/en/master/modules/policies.html#stable_baselines.common.policies.MlpLstmPolicy.

Does this possibility exist in stable-baselines3 (not stable-baselines)? if not, is there any other possibility I can do this? Thanx.

bnye · Answer 1 · 2022-01-24T01:34:41.683

3

From the migration doc.

https://stable-baselines3.readthedocs.io/en/master/guide/migration.html

Breaking Changes¶

LSTM policies (MlpLstmPolicy, CnnLstmPolicy) are not supported for the time being (see PR #53 for a recurrent PPO implementation)

edited Jan 24 '22 at 01:34

answered Jan 24 '22 at 01:26

bnye

41
5

score 3 · Answer 2 · answered May 10 '22 at 11:31

Currently this functionality does not exist on stable-baselines3.

However, on their contributions repo (stable-baselines3-contrib) they have an experimental version of PPO with LSTM policy. I have not tried it myself, but according to this pull request it works.

You can find it on the feat/ppo-lstm branch, which may get merged onto master soon.

LSTM based policy in stable baselines3 model

2 Answers2