We analyzed the network structure of real-time object detection models and found that features in feature concatenation stage are very rich. Applying an attention module here can effectively improve accuracy model. However, commonly used or self-attention shows poor performance inference efficiency. Therefore, we propose a novel module, called 2D local superimposed self-attention, for neck netw...