Predicting transcription factor binding sites by a multi-modal representation learning method based on cross-attention network

被引:0
|
作者
Wei, Yuxiao [1 ]
Zhang, Qi [2 ]
Liu, Liwei [2 ]
机构
[1] Dalian Jiaotong Univ, Coll Software, Dalian 116028, Peoples R China
[2] Dalian Jiaotong Univ, Coll Sci, Dalian 116028, Peoples R China
关键词
Transcription factor binding sites; Deep learning; Cross-attention mechanism; Model interpretability; CHIP-SEQ; DNA; SPECIFICITIES;
D O I
10.1016/j.asoc.2024.112134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prediction of transcription factor binding sites (TFBS) plays a crucial role in studying cellular functions and understanding transcriptional regulatory processes. With the development of chromatin immunoprecipitation sequencing (ChIP-seq) technology, an increasing number of computer-aided TFBS prediction models have emerged. However, how to integrate multi-modal information of DNA and obtain efficient features to improve prediction accuracy remains a major challenge. Here, we propose MultiTF, a multi-modal representation learning method based on a cross-attention network for predicting transcription factor binding sites. Among TFBS prediction methods, we are the first to use graph neural networks and cross-attention networks for representation learning. MultiTF uses dna2vec to extract global contextual features of DNA sequences, DNAshapeR to extract shape features, and the CDPfold model and graph attention network for learning and representation of DNA structural features. Finally, with the help of our cross-attention module, we successfully combine sequence, structural, and shape features to achieve interactive fusion. When comparing MultiTF to other state-of-the-art methods using 165 ENCODE ChIP-seq datasets, we find that MultiTF exhibits average ACC, ROC-AUC, and PR-AUC values of 0.911, 0.978, and 0.982, respectively. The results show that MultiTF achieves unprecedented prediction accuracy compared to previous TFBS prediction models. In addition, our visual analysis of structural features provides interpretability for the prediction results.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] MLSNet: a deep learning model for predicting transcription factor binding sites
    Zhang, Yuchuan
    Wang, Zhikang
    Ge, Fang
    Wang, Xiaoyu
    Zhang, Yiwen
    Li, Shanshan
    Guo, Yuming
    Song, Jiangning
    Yu, Dong-Jun
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [32] Machine Learning Based Multi-Modal Transportation Network Planner
    Manghat, Neeraj Menon
    Gopalakrishna, Vaishak
    Bonthu, Sai
    Hunt, Victor
    Helmicki, Arthur
    McClintock, Doug
    INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2024: TRANSPORTATION SAFETY AND EMERGING TECHNOLOGIES, ICTD 2024, 2024, : 380 - 389
  • [33] Multi-modal knowledge graphs representation learning via multi-headed self-attention
    Wang, Enqiang
    Yu, Qing
    Chen, Yelin
    Slamu, Wushouer
    Luo, Xukang
    INFORMATION FUSION, 2022, 88 : 78 - 85
  • [34] Multi-scale network with shared cross-attention for audio–visual correlation learning
    Jiwei Zhang
    Yi Yu
    Suhua Tang
    Wei Li
    Jianming Wu
    Neural Computing and Applications, 2023, 35 : 20173 - 20187
  • [35] Single-shot hyperspectral imaging based on dual attention neural network with multi-modal learning
    He, Tianyue
    Zhang, Qican
    Zhou, Mingwei
    Kou, Tingdong
    Shen, Junfei
    OPTICS EXPRESS, 2022, 30 (06) : 9790 - 9813
  • [36] Holistic-Based Cross-Attention Modal Fusion Network for Video Sign Language Recognition
    Gao, Qing
    Hu, Jing
    Mai, Haixing
    Ju, Zhaojie
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [37] Multi-modal Multi-relational Feature Aggregation Network for Medical Knowledge Representation Learning
    Zhang, Yingying
    Fang, Quan
    Qian, Shengsheng
    Xu, Changsheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3956 - 3965
  • [38] Prediction of Transcription Factor Binding Sites With an Attention Augmented Convolutional Neural Network
    Jing, Fang
    Zhang, Shao-Wu
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3614 - 3623
  • [39] Multi-Modal Representation via Contrastive Learning with Attention Bottleneck Fusion and Attentive Statistics Features
    Guo, Qinglang
    Liao, Yong
    Li, Zhe
    Liang, Shenglin
    ENTROPY, 2023, 25 (10)
  • [40] Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion
    Li, Siqi
    Zou, Changqing
    Li, Yipeng
    Zhao, Xibin
    Gao, Yue
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11402 - 11409