Generative adversarial networks (GANs) learn the distribution of observed samples through a zero-sum game between two machine players, generator and discriminator. While GANs achieve great success in learning complex image, sound, text data, they perform suboptimally multimodal distribution-learning benchmarks such as Gaussian mixture models (GMMs). In this paper, we propose Adversarial Trainin...