Questions tagged [mdptoolbox]

Markov Decision Process (MDP) Toolbox for Python - The MDP toolbox provides classes and functions for the resolution of discrete-time Markov Decision Processes.

5 questions
1
vote
2 answers

Python returning two identical matrices

I am trying to write a small program for Markov Decision Process (inventory problem) using Python. I cannot figure out why the program outputs two identical matrices (for profit and decision matrices). The programming itself has some problems too…
Chris
  • 95
  • 5
1
vote
1 answer

Understanding the argument values for mdptoolbox forest example

I am trying to understand how to use mdptoolbox and had a few questions. What does 20 mean in the following statement? P, R = mdptoolbox.example.forest(10, 20, is_sparse=False) I understand that 10 here denotes the number of possible states. What…
Amanda
  • 2,013
  • 3
  • 24
  • 57
1
vote
2 answers

ModuleNotFoundError: No module named 'mdptoolbox'

I receive an error when importing the mdptoolbox module. import mdptoolbox, mdptoolbox.example Error logs: ModuleNotFoundError Traceback (most recent call last) in () ----> 1 import mdptoolbox,…
Kavikayal
  • 143
  • 4
  • 14
1
vote
1 answer

OverflowError as I try to use the value-iteration algorithm with mdptoolbox

I set up a simple MDP for a board that has 4 possible states and 4 possible actions. The board and reward setup looks as follows: Here S4 is the goal state and S2 is the absorbing state. I have defined the transition probability matrices and reward…
Suhail Gupta
  • 22,386
  • 64
  • 200
  • 328
1
vote
2 answers

Must a transition matrix from a Markov Decision Process be stochastic?

I'm trying to find the optimal policy for a Markov Decision Process problem specified in this diagram, using Value Iteration (via pymdptoolbox) and NumPy. But pymdptoolbox says my transition matrix "is not stochastic". Is it because of the arrays…