arxiv_ml 90% Match Research Paper Machine Learning Researchers,Deep Learning Engineers,Theoretical Computer Scientists 2 weeks ago

The Spacetime of Diffusion Models: An Information Geometry Perspective

computer-vision › diffusion-models

📄 Abstract

Abstract: We present a novel geometric perspective on the latent space of diffusion models. We first show that the standard pullback approach, utilizing the deterministic probability flow ODE decoder, is fundamentally flawed. It provably forces geodesics to decode as straight segments in data space, effectively ignoring any intrinsic data geometry beyond the ambient Euclidean space. Complementing this view, diffusion also admits a stochastic decoder via the reverse SDE, which enables an information geometric treatment with the Fisher-Rao metric. However, a choice of $x_T$ as the latent representation collapses this metric due to memorylessness. We address this by introducing a latent spacetime $z=(x_t,t)$ that indexes the family of denoising distributions $p(x_0 | x_t)$ across all noise scales, yielding a nontrivial geometric structure. We prove these distributions form an exponential family and derive simulation-free estimators for curve lengths, enabling efficient geodesic computation. The resulting structure induces a principled Diffusion Edit Distance, where geodesics trace minimal sequences of noise and denoise edits between data. We also demonstrate benefits for transition path sampling in molecular systems, including constrained variants such as low-variance transitions and region avoidance. Code is available at: https://github.com/rafalkarczewski/spacetime-geometry

Authors (5)

Rafał Karczewski

Markus Heinonen

Alison Pouplin

Søren Hauberg

Vikas Garg

Submitted

May 23, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

This paper introduces a novel information geometric perspective on the latent space of diffusion models. It identifies flaws in the standard pullback approach and proposes a latent spacetime representation that enables a non-trivial geometric structure, allowing for efficient geodesic computation and a deeper understanding of the denoising process.

Business Value

Provides a deeper theoretical understanding of diffusion models, which could lead to more efficient and controllable generative models for various applications like image synthesis, drug discovery, and material design.

Paper Metadata

Innovation Type

Theoretical Framework

Deployment Feasibility

High (theoretical contribution)

Limitations Addressed

Addresses the fundamental flaw in the standard pullback approach of diffusion models that ignores intrinsic data geometry and the collapse of the Fisher-Rao metric when using $x_T$ as latent representation.

Technical Tags

diffusion modelsinformation geometryFisher-Rao metricODE decoderSDE decoderlatent spacegeodesicsexponential familydenoising distributions

Research Topics

Geometric Deep LearningGenerative ModelsInformation TheoryStochastic Processes

Methods & Architectures

Pullback approachStochastic decoderInformation geometric treatmentFisher-Rao metricLatent spacetime constructionSimulation-free estimators Diffusion Models

Applications & Tasks

Generative Modeling Image Synthesis Understanding latent space geometryImproving diffusion model expressivity Latent space analysisGeodesic computation

Related Fields

Differential GeometryProbability TheoryMachine Learning Theory

Keywords

diffusion modelsinformation geometrylatent spaceFisher-Rao metricgeodesicsstochastic differential equationsordinary differential equationsgenerative modelsprobability flowdenoisingexponential familyspacetime representation

Academic Context

#Geometric Deep Learning#Generative Models#Information Theory#Stochastic Processes

Commercial Potential

Competitive Edge

Offers a new theoretical lens compared to existing empirical studies on diffusion models.

Production Readiness

Maturity Level

Theoretical

View Full Paper Back to Papers