arxiv_ml 95% Match Research Paper Molecular scientists,AI researchers in chemistry,Drug discovery professionals 1 week ago

Omni-Mol: Multitask Molecular Model for Any-to-any Modalities

large-language-models › multimodal-llms

📄 Abstract

Abstract: In the molecular domain, numerous studies have explored the use of multimodal large language models (LLMs) to construct a general-purpose, multi-task molecular model. However, these efforts are still far from achieving a truly universal molecular model. We identify three key challenges in this endeavor: (1) Existing molecular task datasets are typically small in scale and lack comprehensive domain coverage. (2) Tasks from different molecular subfields are difficult to effectively learn jointly through LLMs due to significant distributional shifts and competition among tasks, which introduces instability in the learning process. (3) Both inter-task and intra-task molecular representations demand different intrinsic dimensions in the language space, making it challenging to balance between redundancy and insufficiency in language model representations. To address these challenges, we innovatively categorize existing small-molecule tasks into four types: Mol2Mol, Mol2Text, Mol2Num, and Text2Mol. We then collect a dataset encompassing over 16 tasks with more than 1.4 million samples, making it the largest molecular instruction-tuning dataset to date. Leveraging the extensive pretraining of LLMs on existing chemical literature, we propose a novel multimodal LLM framework, named Omni-Mol, which unifies all small-molecule tasks and supports both molecular generation and understanding. The core of Omni-Mol is our proposed MoGE, which dynamically adapts to the intrinsic rank of different tasks. This mixture-of-experts architecture enhances the model's ability to handle diverse tasks and modalities effectively. Our model achieves unified instruction tuning across 16 tasks and attains state-of-the-art performance on 13 of them. Extensive experiments further demonstrate the scalability and versatility of Omni-Mol.

Authors (6)

Chengxin Hu

Hao Li

Yihe Yuan

Zezheng Song

Chenyang Zhao

Haixin Wang

Submitted

February 3, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

This paper introduces Omni-Mol, a multitask molecular model designed for any-to-any modalities. It addresses key challenges in molecular LLMs, including small dataset scales, domain coverage, task competition due to distributional shifts, and the need for balanced inter-task and intra-task representations in language spaces. The proposed model aims to achieve a truly universal molecular model.

Business Value

Enables more efficient and comprehensive molecular modeling, potentially accelerating drug discovery and materials science research by providing a unified platform for diverse molecular tasks.

Paper Metadata

Innovation Type

Novel Model Architecture and Training Strategy

Deployment Feasibility

Requires significant computational resources and specialized expertise for training and fine-tuning, but offers a unified approach for various molecular tasks.

Limitations Addressed

Small scale and limited domain coverage of molecular task datasets,Instability in learning due to distributional shifts and task competition,Difficulty balancing intrinsic dimensions for inter-task and intra-task representations

Technical Tags

multimodal learningmolecular modelinglarge language modelsrepresentation learningtask generalizationdistributional shiftembedding spacesfew-shot learning

Research Topics

Molecular RepresentationMultimodal AIGeneral-Purpose ModelsMachine Learning for ChemistryTransfer Learning

Methods & Architectures

Multitask learningLarge Language Models (LLMs)Representation learning Transformer-based LLMs

Applications & Tasks

Drug discovery Materials science Chemical research Molecular property predictionMolecular generationMolecular translation Mol2MolMol2TextMol2N

Related Fields

Computational ChemistryBioinformaticsMachine LearningNatural Language Processing

Keywords

molecular modelingmultimodal LLMdrug discoverymaterials sciencerepresentation learningtask generalizationfew-shot learningtransfer learningchemical informaticsAI for science

Academic Context

#Molecular Representation#Multimodal AI#General-Purpose Models#Machine Learning for Chemistry#Transfer Learning

Commercial Potential

Potential Products

Unified molecular design platformAI-powered chemical research assistant

Target Industries

PharmaceuticalsBiotechnologyChemicalsMaterials Science

Use Case Examples

Predicting molecular propertiesGenerating novel molecular structuresTranslating molecular representations

Competitive Edge

Aims to be a more universal and robust molecular model compared to existing specialized or less integrated approaches.

Resource Requirements

Compute Needs

High (for training large LLMs)

Data Requirements

Large-scale, diverse molecular datasets across various task types

Deployment Constraints

Computational cost, need for domain expertise

Scalability

Scales with model size and data availability, designed for broad task coverage.

View Full Paper Back to Papers