Questions tagged [mdptoolbox]

Markov Decision Process (MDP) Toolbox for Python - The MDP toolbox provides classes and functions for the resolution of discrete-time Markov Decision Processes.

5 questions

vote

2 answers

Python returning two identical matrices

I am trying to write a small program for Markov Decision Process (inventory problem) using Python. I cannot figure out why the program outputs two identical matrices (for profit and decision matrices). The programming itself has some problems too…

asked Feb 01 '22 at 19:12

Chris

vote

1 answer

Understanding the argument values for mdptoolbox forest example

I am trying to understand how to use mdptoolbox and had a few questions. What does 20 mean in the following statement? P, R = mdptoolbox.example.forest(10, 20, is_sparse=False) I understand that 10 here denotes the number of possible states. What…

python numpy reinforcement-learning mdptoolbox

asked Jun 08 '19 at 13:38

Amanda

2,013
3
24
57

vote

2 answers

ModuleNotFoundError: No module named 'mdptoolbox'

I receive an error when importing the mdptoolbox module. import mdptoolbox, mdptoolbox.example Error logs: ModuleNotFoundError Traceback (most recent call last) in () ----> 1 import mdptoolbox,…

python mdptoolbox

asked Jan 28 '19 at 10:56

Kavikayal

vote

1 answer

OverflowError as I try to use the value-iteration algorithm with mdptoolbox

I set up a simple MDP for a board that has 4 possible states and 4 possible actions. The board and reward setup looks as follows: Here S4 is the goal state and S2 is the absorbing state. I have defined the transition probability matrices and reward…

python dynamic-programming markov-chains stochastic mdptoolbox

asked Nov 21 '18 at 11:56

Suhail Gupta

22,386
64
200
328

vote

2 answers

Must a transition matrix from a Markov Decision Process be stochastic?

I'm trying to find the optimal policy for a Markov Decision Process problem specified in this diagram, using Value Iteration (via pymdptoolbox) and NumPy. But pymdptoolbox says my transition matrix "is not stochastic". Is it because of the arrays…

python dynamic-programming markov-chains stochastic mdptoolbox

asked Apr 27 '17 at 18:50

James C