I am learning markov decision process. Am I don't know where to mark terminal states.
In 4x3 grid world, I marked the terminal state that I think correct(I might be wrong) with T. Pic
I saw an instruction mark terminal states as follow.
terminals=[(3, 2), (3, 1)]
Can someone explain how does it work?