Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: mRNA design and optimization are important in synthetic biology and
therapeutic development, but remain understudied in machine learning.
Systematic optimization of mRNAs is hindered by the scarce and imbalanced data
as well as complex sequence-function relationships. We present RNAGenScape, a
property-guided manifold Langevin dynamics framework that iteratively updates
mRNA sequences within a learned latent manifold. RNAGenScape combines an
organized autoencoder, which structures the latent space by target properties
for efficient and biologically plausible exploration, with a manifold projector
that contracts each step of update back to the manifold. RNAGenScape supports
property-guided optimization and smooth interpolation between sequences, while
remaining robust under scarce and undersampled data, and ensuring that
intermediate products are close to the viable mRNA manifold. Across three real
mRNA datasets, RNAGenScape improves the target properties with high success
rates and efficiency, outperforming various generative or optimization methods
developed for proteins or non-biological data. By providing continuous,
data-aligned trajectories that reveal how edits influence function, RNAGenScape
establishes a scalable paradigm for controllable mRNA design and latent space
exploration in mRNA sequence modeling.
Authors (11)
Danqi Liao
Chen Liu
Xingzhi Sun
Dié Tang
Haochen Wang
Scott Youlten
+5 more
Submitted
October 14, 2025
Key Contributions
Introduces RNAGenScape, a property-guided manifold Langevin dynamics framework for mRNA sequence optimization and interpolation. It combines an organized autoencoder to structure the latent space by target properties with a manifold projector, ensuring generated sequences are viable and robust under scarce data conditions.
Business Value
Accelerates the design and development of novel mRNA-based therapeutics and synthetic biological systems by providing an efficient and robust method for sequence optimization and generation.