Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 85% Match Research Paper Computational Biologists,Protein Engineers,Synthetic Biologists,Drug Discovery Scientists,AI Researchers in Biology 1 week ago

EnzyControl: Adding Functional and Substrate-Specific Control for Enzyme Backbone Generation

generative-ai › autoregressive
📄 Abstract

Abstract: Designing enzyme backbones with substrate-specific functionality is a critical challenge in computational protein engineering. Current generative models excel in protein design but face limitations in binding data, substrate-specific control, and flexibility for de novo enzyme backbone generation. To address this, we introduce EnzyBind, a dataset with 11,100 experimentally validated enzyme-substrate pairs specifically curated from PDBbind. Building on this, we propose EnzyControl, a method that enables functional and substrate-specific control in enzyme backbone generation. Our approach generates enzyme backbones conditioned on MSA-annotated catalytic sites and their corresponding substrates, which are automatically extracted from curated enzyme-substrate data. At the core of EnzyControl is EnzyAdapter, a lightweight, modular component integrated into a pretrained motif-scaffolding model, allowing it to become substrate-aware. A two-stage training paradigm further refines the model's ability to generate accurate and functional enzyme structures. Experiments show that our EnzyControl achieves the best performance across structural and functional metrics on EnzyBind and EnzyBench benchmarks, with particularly notable improvements of 13\% in designability and 13\% in catalytic efficiency compared to the baseline models. The code is released at https://github.com/Vecteur-libre/EnzyControl.
Authors (9)
Chao Song
Zhiyuan Liu
Han Huang
Liang Wang
Qiong Wang
Jianyu Shi
+3 more
Submitted
October 29, 2025
arXiv Category
q-bio.BM
arXiv PDF

Key Contributions

This work introduces EnzyControl, a method for functional and substrate-specific control in enzyme backbone generation, addressing limitations of current generative models. It leverages a new dataset (EnzyBind) and a modular component (EnzyAdapter) integrated into a pretrained model, enabling generation conditioned on catalytic sites and substrates.

Business Value

Enables the rational design of novel enzymes with tailored functionalities, accelerating the development of biocatalysts for industrial processes, new therapeutics, and sustainable chemical synthesis.