The sequence of policy iterations obtained when solving this problem.
The corresponding sequence of state value function obtained when solving this problem.
John Weatherwax
Last modified: Sun May 15 08:46:34 EDT 2005