Abstract Traffic forecasting has attracted widespread attention recently. In reality, traffic data usually contains missing values due to sensor or communication errors. The Spatio-temporal feature in brings more challenges for processing such values, which the classic techniques (e.g., imputations) are limited: (1) temporal axis, can be randomly consecutively missing; (2) spatial happen on one...