pith. sign in

arxiv: 1705.02894 · v2 · pith:ZFKR5QADnew · submitted 2017-05-08 · 📊 stat.ML · cond-mat.dis-nn· cs.AI· cs.CV· cs.LG

Geometric GAN

classification 📊 stat.ML cond-mat.dis-nncs.AIcs.CVcs.LG
keywords geometrichyperplaneseparatinggenerativeadversarialdiscriminatorgeneratorupdate
0
0 comments X
read the original abstract

Generative Adversarial Nets (GANs) represent an important milestone for effective generative models, which has inspired numerous variants seemingly different from each other. One of the main contributions of this paper is to reveal a unified geometric structure in GAN and its variants. Specifically, we show that the adversarial generative model training can be decomposed into three geometric steps: separating hyperplane search, discriminator parameter update away from the separating hyperplane, and the generator update along the normal vector direction of the separating hyperplane. This geometric intuition reveals the limitations of the existing approaches and leads us to propose a new formulation called geometric GAN using SVM separating hyperplane that maximizes the margin. Our theoretical analysis shows that the geometric GAN converges to a Nash equilibrium between the discriminator and generator. In addition, extensive numerical results show that the superior performance of geometric GAN.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 12 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. AGAN: Towards Automated Design of Generative Adversarial Networks

    cs.LG 2019-06 unverdicted novelty 8.0

    AGAN is the first neural architecture search method for GANs that discovers architectures outperforming state-of-the-art on CIFAR-10 unsupervised image generation and competitive on supervised tasks.

  2. Learning Stratigraphically Consistent Relative Geologic Time from 3D Seismic Data via Sinusoidal Mapping

    physics.geo-ph 2026-05 unverdicted novelty 7.0

    RGT-Est reformulates relative geologic time learning via sinusoidal mapping and joint pointwise-perceptual-adversarial losses to enforce stratigraphic consistency, outperforming prior AI methods on field data especial...

  3. Learning Stratigraphically Consistent Relative Geologic Time from 3D Seismic Data via Sinusoidal Mapping

    physics.geo-ph 2026-05 unverdicted novelty 7.0

    RGT-Est transforms RGT estimation into a sinusoidal space with joint losses to capture fine horizons and global stratigraphic order from seismic data, outperforming prior AI methods especially with sparse horizon priors.

  4. Large Scale GAN Training for High Fidelity Natural Image Synthesis

    cs.LG 2018-09 accept novelty 7.0

    BigGANs achieve state-of-the-art class-conditional synthesis on ImageNet 128x128 with Inception Score 166.5 and FID 7.4 by scaling GANs and applying orthogonal regularization plus truncation.

  5. Tadpole: Autoencoders as Foundation Models for 3D PDEs with Online Learning

    cs.LG 2026-05 unverdicted novelty 6.0

    Tadpole is a pre-trained autoencoder foundation model for 3D PDEs that learns transferable representations from online-generated data and supports efficient fine-tuning for dynamics prediction and other tasks.

  6. Lightweight Unpaired Smartphone ISP Transfer with Semantic Pseudo-Pairing

    cs.CV 2026-05 conditional novelty 6.0

    Semantic pseudo-pairing via DINOv2 embeddings and fused Gromov-Wasserstein optimal transport enables training a 7K-parameter CNN for unpaired smartphone ISP, achieving 22.569 PSNR on the NTIRE 2026 challenge test set.

  7. Learning Stratigraphically Consistent Relative Geologic Time from 3D Seismic Data via Sinusoidal Mapping

    physics.geo-ph 2026-05 unverdicted novelty 6.0

    RGT-Est transforms relative geologic time estimation into a sinusoidal space and applies pointwise, perceptual, and adversarial losses to achieve better stratigraphic consistency and horizon correlation on seismic data.

  8. Continuous Adversarial Flow Models

    cs.LG 2026-04 unverdicted novelty 6.0

    Continuous adversarial flow models replace MSE in flow matching with adversarial training via a discriminator, improving guidance-free FID on ImageNet from 8.26 to 3.63 for SiT and similar gains for JiT and text-to-im...

  9. Exploring Clustering Capability of Inpainting Model Embeddings for Pattern-based Individual Identification

    cs.CV 2026-05 unverdicted novelty 5.0

    Inpainting auxiliary task improves clustering of embeddings for individual zebrafish identification based on skin patterns.

  10. Woosh: A Sound Effects Foundation Model

    cs.SD 2026-04 accept novelty 5.0

    Woosh is a new publicly released foundation model optimized for high-quality sound effect generation from text or video, showing competitive or better results than open alternatives like Stable Audio Open.

  11. Neural Embedding for Physical Manipulations

    cs.LG 2019-07 unverdicted novelty 4.0

    Generative model with normalized pairwise distance constraint discovers output space topologies from sparse data and outperforms GANs and VAEs by avoiding mode collapse.

  12. Venom: A PyTorch Generative Modeling Toolkit

    cs.LG 2026-05 unverdicted novelty 3.0

    Venom is an educational PyTorch toolkit that packages multiple generative modeling families under a single MNIST-first interface with reproducible scripts and tutorials.