Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 96% Match Research Paper ML Researchers,Computer Vision Engineers,Data Scientists 2 weeks ago

IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks

generative-ai › gans
📄 Abstract

Abstract: We propose a new GAN-based unsupervised model for disentangled representation learning. The new model is discovered in an attempt to utilize the Information Bottleneck (IB) framework to the optimization of GAN, thereby named IB-GAN. The architecture of IB-GAN is partially similar to that of InfoGAN but has a critical difference; an intermediate layer of the generator is leveraged to constrain the mutual information between the input and the generated output. The intermediate stochastic layer can serve as a learnable latent distribution that is trained with the generator jointly in an end-to-end fashion. As a result, the generator of IB-GAN can harness the latent space in a disentangled and interpretable manner. With the experiments on dSprites and Color-dSprites dataset, we demonstrate that IB-GAN achieves competitive disentanglement scores to those of state-of-the-art \b{eta}-VAEs and outperforms InfoGAN. Moreover, the visual quality and the diversity of samples generated by IB-GAN are often better than those by \b{eta}-VAEs and Info-GAN in terms of FID score on CelebA and 3D Chairs dataset.
Authors (4)
Insu Jeon
Wonkwang Lee
Myeongjang Pyeon
Gunhee Kim
Submitted
October 23, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

Proposes IB-GAN, a novel GAN-based unsupervised model that integrates the Information Bottleneck framework to achieve disentangled representation learning. It constrains mutual information in an intermediate layer, leading to a more interpretable and disentangled latent space compared to InfoGAN.

Business Value

Enables the creation of more interpretable and controllable generative models, which can be valuable in applications like data augmentation, style transfer, and generating synthetic data with specific attributes.