Depth-adaptive neural networks can dynamically adjust depths according to the hardness of input words, and thus improve efficiency. The main challenge is how measure such decide required (i.e., layers) conduct. Previous works generally build a halting unit whether computation should continue or stop at each layer. As there no specific supervision depth selection, may be under-optimized inaccura...