A well-designed loss function can effectively improve the characterization ability of network features without increasing amount calculation in model inference stage, and has become focus attention recent research. Given that existing lightweight adds a to last layer, which severely attenuates gradient during backpropagation process, we propose hierarchical polynomial kernel prototype this stud...