b) Assume that k’ satisfies 4.50 (i.e., that it satisfies the termination condition of the policy improvement algorithm) and that k satisfies the conditions of part a). Show that is satisfied for all states.
c) Show that w ≤ w’.
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here