Prove Corollary 1.3 (p. 9) from the script Theory of Reinforcement Learning 3: Every policy for which satisfies the Bellman optimality equations