When policy iteration or value iteration is performed on a "make_int_mdp", the values include an additional terminal state, is this needed?