Object detection can be regarded as a pixel clustering task, and its boundary is determined by four extreme points (leftmost, top, rightmost, bottom). However, most studies focus on the center or corner of object, which are conditional results points. In this paper, we present an Extreme-Point-Prediction-Based object detector (EPP-Net), directly regresses relative displacement vector between ea...