Federated learning (FL), as an emerging edge artificial intelligence paradigm, enables many devices to collaboratively train a global model without sharing their private data. To enhance the training efficiency of FL, various algorithms have been proposed, ranging from first-order second-order methods. However, these cannot be applied in scenarios where gradient information is not available, e....