True/False: For a given Markov decision process, in order to extract the optimal policy π∗, it is sufficient to know the transition function T(s,a,s′) and optimal value function V ∗. If false, explain...

True/False: For a given Markov decision process, in order to extract the optimal policy π∗,
it is sufficient to know the transition function T(s,a,s′) and optimal value function V ∗.
If false, explain why this is false. If true, explain how to extract the policy.

Jun 07, 2022