Extreme multi-label classification (XML) is becoming increasingly relevant in the era of big data. Yet, there no method for effectively generating stratified partitions XML datasets. Instead, researchers typically rely on provided test-train splits that, 1) aren’t always representative entire dataset, and 2) are missing many labels. This can lead to poor generalization ability unreliable perfor...