Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 90% Match Research Paper Computational chemists,Drug discovery scientists,Machine learning researchers in chemistry 19 hours ago

Prior-Guided Flow Matching for Target-Aware Molecule Design with Learnable Atom Number

generative-ai › flow-models
📄 Abstract

Abstract: Structure-based drug design (SBDD), aiming to generate 3D molecules with high binding affinity toward target proteins, is a vital approach in novel drug discovery. Although recent generative models have shown great potential, they suffer from unstable probability dynamics and mismatch between generated molecule size and the protein pockets geometry, resulting in inconsistent quality and off-target effects. We propose PAFlow, a novel target-aware molecular generation model featuring prior interaction guidance and a learnable atom number predictor. PAFlow adopts the efficient flow matching framework to model the generation process and constructs a new form of conditional flow matching for discrete atom types. A protein-ligand interaction predictor is incorporated to guide the vector field toward higher-affinity regions during generation, while an atom number predictor based on protein pocket information is designed to better align generated molecule size with target geometry. Extensive experiments on the CrossDocked2020 benchmark show that PAFlow achieves a new state-of-the-art in binding affinity (up to -8.31 Avg. Vina Score), simultaneously maintains favorable molecular properties.

Key Contributions

PAFlow is a novel target-aware molecular generation model that uses prior interaction guidance and a learnable atom number predictor. It employs flow matching for efficient generation and incorporates a protein-ligand interaction predictor to guide towards higher affinity, while an atom number predictor aligns molecule size with protein pocket geometry, addressing instability and size mismatch issues in prior models.

Business Value

Accelerates the drug discovery process by enabling the rapid and targeted generation of novel drug candidates with desired properties, potentially reducing R&D costs and time-to-market for new pharmaceuticals.