Value iteration is a fixed point technique utilized to obtain the optimal value function and policy in discounted reward Markov decision process (MDP). Here, contraction operator constructed applied repeatedly arrive at solution. first-order method and, therefore, it may take large number of iterations converge Successive relaxation popular that can be solve equation. It has been shown literatu...