Video object detection is a fundamental research task for scene understanding. Compared with in images, videos has been less researched due to shortage of labelled video datasets. As frames clip are highly correlated, larger quantity labels needed have good data variation, which not always available as the much more expensive attain. Regarding above-mentioned problem, it easy train an image det...