In order to solve the problem of large model computing power consumption, this paper proposes a novel compression algorithm. Firstly, an interpretable weight allocation method for loss between student network (a with poor performance), teacher better performance) and real label. Then, different from previous simple pruning fine-tuning, performs knowledge distillation on pruned model, quantifies...