This paper investigates the adaptive robust control problem based on reinforcement learning for an affine nonlinear system with unknown time-varying uncertainty. Inspired by ability to estimate uncertainty of neural network, a novel policy iteration algorithm is proposed which alternates between value evaluation, estimation, and update steps until law obtained. Especially during step approximat...