Evaluating data imputation and augmentation performance is a critical issue in science. In statistics, methods like Kolmogorov-Smirnov K-S test, Cramér-von Mises $$W^2$$ , Anderson-Darling $$A^2$$ Pearson’s $$\chi ^2$$ Watson’s $$U^2$$ exists for decades to compare the distribution of two datasets. context generation, typical evaluation metrics have same flaw: They calculate feature’s error glo...