GIT-Mol: A multi-modal large language model for molecular science with graph, image, and text

被引:3
|
作者
Liu, Pengfei [1 ,2 ]
Ren, Yiming [1 ]
Tao, Jun [2 ]
Ren, Zhixiang [1 ]
机构
[1] Peng Cheng Lab, Shenzhen 518055, Guangdong, Peoples R China
[2] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Molecular representation; Molecule generation; Large language model; Multi-modality;
D O I
10.1016/j.compbiomed.2024.108073
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large language models have made significant strides in natural language processing, enabling innovative applications in molecular science by processing textual representations of molecules. However, most existing language models cannot capture the rich information with complex molecular structures or images. In this paper, we introduce GIT-Mol, a multi -modal large language model that integrates the Graph, Image, and Text information. To facilitate the integration of multi -modal molecular data, we propose GIT-Former, a novel architecture that is capable of aligning all modalities into a unified latent space. We achieve a 5%-10% accuracy increase in properties prediction and a 20.2% boost in molecule generation validity compared to the baselines. With the any -to -language molecular translation strategy, our model has the potential to perform more downstream tasks, such as compound name recognition and chemical reaction prediction.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Multi-Modal Topic Model for Image Annotation Using Text Analysis
    Tian, Jing
    Huang, Yu
    Guo, Zhi
    Qi, Xiang
    Chen, Ziyan
    Huang, Tinglei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) : 886 - 890
  • [2] MIGT: Multi-modal image inpainting guided with text
    Li, Ailin
    Zhao, Lei
    Zuo, Zhiwen
    Wang, Zhizhong
    Xing, Wei
    Lu, Dongming
    [J]. NEUROCOMPUTING, 2023, 520 : 376 - 385
  • [3] Image and Encoded Text Fusion for Multi-Modal Classification
    Gallo, I.
    Calefati, A.
    Nawaz, S.
    Janjua, M. K.
    [J]. 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209
  • [4] Multi-modal graph reasoning for structured video text extraction
    Shi, Weitao
    Wang, Han
    Lou, Xin
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 107
  • [5] Cucumber disease recognition with small samples using image-text-label-based multi-modal language model
    Cao, Yiyi
    Chen, Lei
    Yuan, Yuan
    Sun, Guangling
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 211
  • [6] MillenniumDB: A Multi-modal, Multi-model Graph Database
    Vrgoc, Domagoj
    Rojas, Carlos
    Angles, Renzo
    Arenas, Marcelo
    Calisto, Vicente
    Farias, Benjamin
    Ferrada, Sebastian
    Heuer, Tristan
    Hogan, Aidan
    Navarro, Gonzalo
    Pinto, Alexander
    Reutter, Juan
    Rosales, Henry
    Toussiant, Etienne
    [J]. COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 496 - 499
  • [7] TOMGPT: Reliable Text-Only Training Approach for Cost-Effective Multi-modal Large Language Model
    Chen, Yunkai
    Wang, Qimeng
    Wu, Shiwei
    Gao, Yan
    Xu, Tong
    Hu, Yao
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (07)
  • [8] ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model
    Hanyao Huang
    Ou Zheng
    Dongdong Wang
    Jiayi Yin
    Zijin Wang
    Shengxuan Ding
    Heng Yin
    Chuan Xu
    Renjie Yang
    Qian Zheng
    Bing Shi
    [J]. International Journal of Oral Science, 15
  • [9] ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model
    Huang, Hanyao
    Zheng, Ou
    Wang, Dongdong
    Yin, Jiayi
    Wang, Zijin
    Ding, Shengxuan
    Yin, Heng
    Xu, Chuan
    Yang, Renjie
    Zheng, Qian
    Shi, Bing
    [J]. INTERNATIONAL JOURNAL OF ORAL SCIENCE, 2023, 15 (01)
  • [10] A question answering system for assembly process of wind turbines based on multi-modal knowledge graph and large language model
    Hu, Zhiqiang
    Li, Xinyu
    Pan, Xinyu
    Wen, Sijie
    Bao, Jinsong
    [J]. JOURNAL OF ENGINEERING DESIGN, 2023,