Due to the large number of parameters and heavy computation, real-time operation deep learning in low-performance embedded board is still difficult. Network Pruning one effective methods reduce without additional network structure modification. However, conventional method prunes redundant up same rate for all layers. It may cause a bottleneck problem, which leads performance degradation, becau...