MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

David Keetae Park , Seungjoo Yoo , Hyojin Bahng , Jaegul Choo , Noseong Park

Authors on Pith no claims yet

classification 💻 cs.CV

keywords generatornetworksimagesmeganmodalitiesmultipleadversarialapproach

read the original abstract

Recently, generative adversarial networks (GANs) have shown promising performance in generating realistic images. However, they often struggle in learning complex underlying modalities in a given dataset, resulting in poor-quality generated images. To mitigate this problem, we present a novel approach called mixture of experts GAN (MEGAN), an ensemble approach of multiple generator networks. Each generator network in MEGAN specializes in generating images with a particular subset of modalities, e.g., an image class. Instead of incorporating a separate step of handcrafted clustering of multiple modalities, our proposed model is trained through an end-to-end learning of multiple generators via gating networks, which is responsible for choosing the appropriate generator network for a given condition. We adopt the categorical reparameterization trick for a categorical decision to be made in selecting a generator while maintaining the flow of the gradients. We demonstrate that individual generators learn different and salient subparts of the data and achieve a multiscale structural similarity (MS-SSIM) score of 0.2470 for CelebA and a competitive unsupervised inception score of 8.33 in CIFAR-10.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Mixture-of-Experts Framework for Practical Hybrid-Quantum Models in Credit Card Fraud Detection
quant-ph 2026-03 unverdicted novelty 5.0

A mixture-of-experts hybrid quantum model achieves 0.793 average precision on credit card fraud detection compared to 0.770 for XGBoost, with modest extra inference time.