Data scientists often develop data sets for analysis by drawing upon sources of available to them. A major challenge is ensure that the set used has an appropriate representation relevant (demographic) groups: it meets desired distribution requirements. Whether collected through some experiment or obtained from provider, any single source may not meet Therefore, a union multiple required. In th...