arxiv_ml 95% Match Research Paper ML Researchers,Deep Learning Engineers,Students of Generative Models 2 weeks ago

Gradient Variance Reveals Failure Modes in Flow-Based Generative Models

generative-ai › flow-models

📄 Abstract

Abstract: Rectified Flows learn ODE vector fields whose trajectories are straight between source and target distributions, enabling near one-step inference. We show that this straight-path objective conceals fundamental failure modes: under deterministic training, low gradient variance drives memorization of arbitrary training pairings, even when interpolant lines between pairs intersect. To analyze this mechanism, we study Gaussian-to-Gaussian transport and use the loss gradient variance across stochastic and deterministic regimes to characterize which vector fields optimization favors in each setting. We then show that, in a setting where all interpolating lines intersect, applying Rectified Flow yields the same specific pairings at inference as during training. More generally, we prove that a memorizing vector field exists even when training interpolants intersect, and that optimizing the straight-path objective converges to this ill-defined field. At inference, deterministic integration reproduces the exact training pairings. We validate our findings empirically on the CelebA dataset, confirming that deterministic interpolants induce memorization, while the injection of small noise restores generalization.

Authors (4)

Teodora Reu

Sixtine Dromigny

Michael Bronstein

Francisco Vargas

Submitted

October 20, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

This paper reveals that the straight-path objective in Rectified Flows can conceal failure modes driven by low gradient variance, leading to memorization of training pairings. The authors analyze this mechanism using gradient variance in stochastic vs. deterministic regimes and prove the existence of a memorizing vector field that the straight-path objective converges to, explaining why these models can fail unexpectedly.

Business Value

Understanding and mitigating failure modes in generative models can lead to more reliable and robust AI systems for tasks like data synthesis, anomaly detection, and creative content generation.

Paper Metadata

Innovation Type

Theoretical Insight and Analysis

Deployment Feasibility

High, as it provides theoretical insights that can guide the development of more stable training procedures for existing generative models.

Limitations Addressed

Addresses the implicit failure modes and memorization issues in flow-based generative models, particularly Rectified Flows, which arise from deterministic training objectives.

Technical Tags

flow-based modelsrectified flowsODE vector fieldsgradient variancegenerative modelstransport mapsdeterministic trainingstochastic trainingloss landscapememorization

Research Topics

Generative ModelingDeep Learning TheoryOptimization in MLUnderstanding Model FailuresNormalizing Flows

Methods & Architectures

Gradient Variance AnalysisGaussian-to-Gaussian TransportDeterministic TrainingStochastic Training Flow-based ModelsRectified Flows

Applications & Tasks

Generative Modeling Distribution Learning Understanding Failure ModesImproving Generative Model TrainingAnalyzing Optimization Behavior Distribution TransportDensity EstimationSample Generation

Related Fields

Machine Learning TheoryOptimizationProbability TheoryDeep Generative Models

Keywords

Rectified FlowsFlow-based ModelsGenerative ModelsGradient VarianceOptimizationMemorizationFailure ModesODEVector FieldsDeterministic TrainingStochastic TrainingNormalizing Flows

Academic Context

#Generative Modeling#Deep Learning Theory#Optimization in ML#Understanding Model Failures#Normalizing Flows

Commercial Potential

Potential Products

More robust generative model librariesTools for analyzing generative model behavior

Target Industries

AI ResearchSoftware Development

Use Case Examples

Improving GANs and VAEsDeveloping more stable diffusion models

Competitive Edge

Provides a deeper theoretical understanding of a specific class of generative models, complementing empirical advancements.

Resource Requirements

Compute Needs

Moderate (for experiments and analysis)

Data Requirements

Synthetic data (e.g., Gaussian distributions) for analysis

Deployment Constraints

None directly, but insights apply to deployed models.

Scalability

Theoretical analysis is general; empirical validation would depend on model scale.

Production Readiness

Maturity Level

Research

View Full Paper Back to Papers