Federated learning has attracted increasing attention with the emergence of distributed data. While extensive federated algorithms have been proposed for non-convex problem, in practice still faces numerous challenges, such as large training iterations to converge since sizes models and datasets keep increasing, lack adaptivity by SGD-based model updates. Meanwhile, study adaptive methods is sc...