Hybrid-LLM-GNN: integrating large language models and graph neural networks for enhanced materials property prediction

被引:0
|
作者
Li, Youjia [1 ]
Gupta, Vishu [1 ,2 ,3 ]
Kilic, Muhammed Nur Talha [4 ]
Choudhary, Kamal [5 ,6 ]
Wines, Daniel [5 ]
Liao, Wei-keng [1 ]
Choudhary, Alok [1 ]
Agrawal, Ankit [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
[2] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ USA
[3] Princeton Univ, Ludwig Inst Canc Res, Princeton, NJ USA
[4] Northwestern Univ, Dept Comp Sci, Evanston, IL USA
[5] Natl Inst Stand & Technol, Mat Measurement Lab, 100 Bur Dr, Gaithersburg, MD USA
[6] DeepMat LLC, Silver Spring, MD 20906 USA
来源
DIGITAL DISCOVERY | 2025年 / 4卷 / 02期
基金
美国国家科学基金会;
关键词
D O I
10.1039/d4dd00199k
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Graph-centric learning has attracted significant interest in materials informatics. Accordingly, a family of graph-based machine learning models, primarily utilizing Graph Neural Networks (GNN), has been developed to provide accurate prediction of material properties. In recent years, Large Language Models (LLM) have revolutionized existing scientific workflows that process text representations, thanks to their exceptional ability to utilize extensive common knowledge for understanding semantics. With the help of automated text representation tools, fine-tuned LLMs have demonstrated competitive prediction accuracy as standalone predictors. In this paper, we propose to integrate the insights from GNNs and LLMs to enhance both prediction accuracy and model interpretability. Inspired by the feature-extraction-based transfer learning study for the GNN model, we introduce a novel framework that extracts and combines GNN and LLM embeddings to predict material properties. In this study, we employed ALIGNN as the GNN model and utilized BERT and MatBERT as the LLM model. We evaluated the proposed framework in cross-property scenarios using 7 properties. We find that the combined feature extraction approach using GNN and LLM outperforms the GNN-only approach in the majority of the cases with up to 25% improvement in accuracy. We conducted model explanation analysis through text erasure to interpret the model predictions by examining the contribution of different parts of the text representation.
引用
收藏
页码:376 / 383
页数:8
相关论文
共 17 条
  • [1] Towards Minimal Edits in Automated Program Repair: A Hybrid Framework Integrating Graph Neural Networks and Large Language Models
    Xu, Zhenyu
    Sheng, Victor S.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT V, 2024, 15020 : 402 - 416
  • [2] Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation
    Yue Kong
    Xiaoman Zhao
    Ruizi Liu
    Zhenwu Yang
    Hongyan Yin
    Bowen Zhao
    Jinling Wang
    Bingjie Qin
    Aixia Yan
    Journal of Cheminformatics, 14
  • [3] Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation
    Kong, Yue
    Zhao, Xiaoman
    Liu, Ruizi
    Yang, Zhenwu
    Yin, Hongyan
    Zhao, Bowen
    Wang, Jinling
    Qin, Bingjie
    Yan, Aixia
    JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [4] Multimodal prediction of student performance: A fusion of signed graph neural networks and large language models
    Wang, Sijie
    Ni, Lin
    Zhang, Zeyu
    Li, Xiaoxuan
    Zheng, Xianda
    Liu, Jiamou
    PATTERN RECOGNITION LETTERS, 2024, 181 : 1 - 8
  • [5] A reproducibility study of atomistic line graph neural networks for materials property prediction
    Li, Kangming
    Decost, Brian
    Choudhary, Kamal
    Hattrick-Simpers, Jason
    DIGITAL DISCOVERY, 2024, 3 (06): : 1123 - 1129
  • [6] Graph convolutional neural networks with global attention for improved materials property prediction
    Louis, Steph-Yves
    Zhao, Yong
    Nasiri, Alireza
    Wang, Xiran
    Song, Yuqi
    Liu, Fei
    Hu, Jianjun
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2020, 22 (32) : 18141 - 18148
  • [7] A Hybrid Model for Soybean Yield Prediction Integrating Convolutional Neural Networks, Recurrent Neural Networks, and Graph Convolutional Networks
    Ingole, Vikram S.
    Kshirsagar, Ujwala A.
    Singh, Vikash
    Yadav, Manish Varun
    Krishna, Bipin
    Kumar, Roshan
    COMPUTATION, 2025, 13 (01)
  • [8] Accelerating Neural Networks for Large Language Models and Graph Processing with Silicon Photonics
    Afifi, Salma
    Sunny, Febin
    Nikdast, Mandi
    Pasricha, Sudeep
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [9] Scalable deeper graph neural networks for high-performance materials property prediction
    Omee, Sadman Sadeed
    Louis, Steph-Yves
    Fu, Nihang
    Wei, Lai
    Dey, Sourin
    Dong, Rongzhi
    Li, Qinyang
    Hu, Jianjun
    PATTERNS, 2022, 3 (05):
  • [10] Prediction of large magnetic moment materials with graph neural networks and random forests
    Kaba, Sekou-Oumar
    Groleau-Pare, Benjamin
    Gauthier, Marc-Antoine
    Tremblay, A. -m. s.
    Verret, Simon
    Gauvin-Ndiaye, Chloe
    PHYSICAL REVIEW MATERIALS, 2023, 7 (04)