Small inter-class and large intra-class variations are the key challenges in fine-grained visual classification. Objects from different classes share visually similar structures, objects same class can have poses viewpoints. Therefore, proper extraction of discriminative local features (e.g., bird’s beak or car’s headlight) is crucial. Most recent successes on this problem based upon attention ...