arxiv_ml 95% Match Research Paper Generative AI Researchers,ML Engineers,Computer Vision Researchers 2 weeks ago

AlignFlow: Improving Flow-based Generative Models with Semi-Discrete Optimal Transport

generative-ai › flow-models

📄 Abstract

Abstract: Flow-based Generative Models (FGMs) effectively transform noise into complex data distributions. Incorporating Optimal Transport (OT) to couple noise and data during FGM training has been shown to improve the straightness of flow trajectories, enabling more effective inference. However, existing OT-based methods estimate the OT plan using (mini-)batches of sampled noise and data points, which limits their scalability to large and high-dimensional datasets in FGMs. This paper introduces AlignFlow, a novel approach that leverages Semi-Discrete Optimal Transport (SDOT) to enhance the training of FGMs by establishing an explicit, optimal alignment between noise distribution and data points with guaranteed convergence. SDOT computes a transport map by partitioning the noise space into Laguerre cells, each mapped to a corresponding data point. During FGM training, i.i.d. noise samples are paired with data points via the SDOT map. AlignFlow scales well to large datasets and model architectures with negligible computational overhead. Experimental results show that AlignFlow improves the performance of a wide range of state-of-the-art FGM algorithms and can be integrated as a plug-and-play component. Code is available at: https://github.com/konglk1203/AlignFlow.

Authors (7)

Lingkai Kong

Molei Tao

Yang Liu

Bryan Wang

Jinmiao Fu

Chien-Chih Wang

+1 more

Submitted

October 16, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

AlignFlow introduces Semi-Discrete Optimal Transport (SDOT) to enhance Flow-based Generative Models (FGMs). By partitioning the noise space into Laguerre cells mapped to data points, SDOT establishes an explicit, optimal alignment between noise and data distributions, offering guaranteed convergence and improved scalability for large, high-dimensional datasets.

Business Value

Enables the generation of higher-quality and more diverse synthetic data, which can be used for training other models, data augmentation, and privacy-preserving data sharing.

Paper Metadata

Innovation Type

Algorithmic/Methodological

Deployment Feasibility

Moderate. Requires implementing SDOT within the FGM training pipeline. Scalability is a key advantage.

Limitations Addressed

Addresses the scalability limitations of existing OT-based methods in FGMs, which struggle with large and high-dimensional datasets due to batch-wise OT plan estimation.

Technical Tags

Flow-based Generative Models (FGMs)Optimal Transport (OT)Semi-Discrete Optimal Transport (SDOT)noise distributiondata distributiontransport mapLaguerre cellsscalability

Research Topics

Generative ModelsFlow-based ModelsOptimal TransportMachine LearningDeep LearningGenerative AI

Methods & Architectures

Semi-Discrete Optimal Transport (SDOT)Laguerre cell partitioningSDOT map for noise-data alignmentTraining FGMs with SDOT Flow-based Generative Models (FGMs)Semi-Discrete Optimal Transport (SDOT)

Applications & Tasks

Image Generation Data Synthesis Density Estimation Improving FGM trainingEnhancing alignment between noise and data distributionsScalability of OT in FGMs Generative ModelingData GenerationDensity Estimation

Related Fields

Machine LearningDeep LearningProbabilityOptimization

Keywords

Generative ModelsFlow-based ModelsOptimal TransportSemi-Discrete Optimal TransportSDOTData GenerationImage SynthesisScalabilityDeep LearningLaguerre Cells

Academic Context

#Generative Models#Flow-based Models#Optimal Transport#Machine Learning#Deep Learning#Generative AI

Commercial Potential

Potential Products

High-fidelity synthetic data generation toolsAdvanced image synthesis platforms

Target Industries

Media and EntertainmentGamingE-commerceHealthcare (for synthetic data)

Use Case Examples

Generating realistic synthetic images for training computer vision modelsCreating diverse datasets for data augmentationSynthesizing complex data distributions

Competitive Edge

Offers a more scalable and theoretically grounded approach to incorporating optimal transport into flow-based generative models compared to previous batch-based methods.

Market Opportunity

Large, the generative AI market is rapidly expanding.

Revenue Models

Licensing of the technologyservices for synthetic data generation.

Resource Requirements

Compute Needs

High, especially for training on large datasets.

Data Requirements

Requires large datasets for training generative models.

Deployment Constraints

Computational cost and training time.

Scalability

Key advantage is improved scalability to large and high-dimensional datasets.

Production Readiness

Maturity Level

Research

Time to Market

Medium, requires integration into existing generative AI pipelines.

Patent Potential

Moderate, for the SDOT application in FGMs.

View Full Paper Back to Papers