Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: In recent years, diffusion models trained on equilibrium molecular
distributions have proven effective for sampling biomolecules. Beyond direct
sampling, the score of such a model can also be used to derive the forces that
act on molecular systems. However, while classical diffusion sampling usually
recovers the training distribution, the corresponding energy-based
interpretation of the learned score is often inconsistent with this
distribution, even for low-dimensional toy systems. We trace this inconsistency
to inaccuracies of the learned score at very small diffusion timesteps, where
the model must capture the correct evolution of the data distribution. In this
regime, diffusion models fail to satisfy the Fokker--Planck equation, which
governs the evolution of the score. We interpret this deviation as one source
of the observed inconsistencies and propose an energy-based diffusion model
with a Fokker--Planck-derived regularization term to enforce consistency. We
demonstrate our approach by sampling and simulating multiple biomolecular
systems, including fast-folding proteins, and by introducing a state-of-the-art
transferable Boltzmann emulator for dipeptides that supports simulation and
achieves improved consistency and efficient sampling. Our code, model weights,
and self-contained JAX and PyTorch notebooks are available at
https://github.com/noegroup/ScoreMD.
Key Contributions
This paper addresses the inconsistency between the learned score and the training distribution in energy-based diffusion models for molecular systems. They propose a novel regularization term derived from the Fokker-Planck equation to enforce consistency, which is crucial for accurate force field derivation and reliable molecular dynamics simulations. This work improves the physical interpretability and accuracy of diffusion models in scientific applications.
Business Value
Improved accuracy in molecular simulations can accelerate drug discovery and materials design by enabling more reliable predictions of molecular behavior and properties, reducing the need for expensive physical experiments.