We present H-TD 2 : Hybrid Temporal Difference Learning for Taxi Dispatch, a model-free, adaptive decision-making algorithm to coordinate large fleet of automated taxis in dynamic urban environment minimize expected customer waiting times. Our scalable exploits the natural transportation network...