Nmix: a hybrid deep learning model for precise prediction of 2'-O-methylation sites based on multi-feature fusion and ensemble learning

被引:0
|
作者
Geng, Yu-Qing [1 ]
Lai, Fei-Liao [1 ]
Luo, Hao [1 ]
Gao, Feng [1 ,2 ,3 ,4 ]
机构
[1] Tianjin Univ, Sch Sci, Dept Phys, 92 Weijin Rd, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Frontiers Sci Ctr Synthet Biol, 92 Weijin Rd, Tianjin 300072, Peoples R China
[3] Tianjin Univ, Key Lab Syst Bioengn, Minist Educ, 92 Weijin Rd, Tianjin 300072, Peoples R China
[4] Collaborat Innovat Ctr Chem Sci & Engn Tianjin, SynBio Res Platform, 92 Weijin Rd, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
2'-O-methylation; multi-feature fusion; deep learning; asymmetric loss; ensemble learning; HIGH-THROUGHPUT; MESSENGER-RNA; IDENTIFICATION; LANDSCAPE;
D O I
10.1093/bib/bbae601
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
RNA 2'-O-methylation (Nm) is a crucial post-transcriptional modification with significant biological implications. However, experimental identification of Nm sites is challenging and resource-intensive. While multiple computational tools have been developed to identify Nm sites, their predictive performance, particularly in terms of precision and generalization capability, remains deficient. We introduced Nmix, an advanced computational tool for precise prediction of Nm sites in human RNA. We constructed the largest, low-redundancy dataset of experimentally verified Nm sites and employed an innovative multi-feature fusion approach, combining one-hot, Z-curve and RNA secondary structure encoding. Nmix utilizes a meticulously designed hybrid deep learning architecture, integrating 1D/2D convolutional neural networks, self-attention mechanism and residual connection. We implemented asymmetric loss function and Bayesian optimization-based ensemble learning, substantially improving predictive performance on imbalanced datasets. Rigorous testing on two benchmark datasets revealed that Nmix significantly outperforms existing state-of-the-art methods across various metrics, particularly in precision, with average improvements of 33.1% and 60.0%, and Matthews correlation coefficient, with average improvements of 24.7% and 51.1%. Notably, Nmix demonstrated exceptional cross-species generalization capability, accurately predicting 93.8% of experimentally verified Nm sites in rat RNA. We also developed a user-friendly web server (https://tubic.org/Nm) and provided standalone prediction scripts to facilitate widespread adoption. We hope that by providing a more accurate and robust tool for Nm site prediction, we can contribute to advancing our understanding of Nm mechanisms and potentially benefit the prediction of other RNA modification sites.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Deep Learning-Based Multi-Feature Fusion for Communication and Radar Signal Sensing
    Li, Ting
    Liu, Tian
    Song, Zhangli
    Zhang, Lin
    Ma, Yiming
    ELECTRONICS, 2024, 13 (10)
  • [22] Non-intrusive Load Disaggregation Based on Deep Learning and Multi-feature Fusion
    Liu, Hang
    Liu, Chunyang
    Tian, Lijun
    Zhao, Haoran
    Liu, Junwei
    2021 3RD INTERNATIONAL CONFERENCE ON SMART POWER & INTERNET ENERGY SYSTEMS (SPIES 2021), 2021, : 210 - 215
  • [23] Research on Radar Target Classification Algorithm Based on Multi-feature Fusion and Deep Learning
    Zhang, Chengxin
    Wang, Ao
    Zhang, Yijin
    Zhang, Weibin
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1186 - 1191
  • [24] Ensemble Learning Based on Hybrid Deep Learning Model for Heart Disease Early Prediction
    Almulihi, Ahmed
    Saleh, Hager
    Hussien, Ali Mohamed
    Mostafa, Sherif
    El-Sappagh, Shaker
    Alnowaiser, Khaled
    Ali, Abdelmgeid A.
    Refaat Hassan, Moatamad
    DIAGNOSTICS, 2022, 12 (12)
  • [25] Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion
    Abdi, Asad
    Shamsuddin, Siti Mariyam
    Hasan, Shafaatunnur
    Piran, Jalil
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) : 1245 - 1259
  • [26] Multi-feature output deep network ensemble learning for face recognition and verification
    Chaorong Li
    Signal, Image and Video Processing, 2024, 18 : 793 - 802
  • [27] Multi-feature output deep network ensemble learning for face recognition and verification
    Li, Chaorong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 793 - 802
  • [28] A deep learning multi-feature based fusion model for predicting the state of health of lithium-ion batteries
    Sonthalia, Ankit
    Josephin, J. S. Femilda
    Varuvel, Edwin Geo
    Chinnathambi, Arunachalam
    Subramanian, Thiyagarajan
    Kiani, Farzad
    ENERGY, 2025, 317
  • [29] SMASH: A Malware Detection Method Based on Multi-Feature Ensemble Learning
    Dai, Yusheng
    Li, Hui
    Qian, Yekui
    Yang, Ruipeng
    Zheng, Min
    IEEE ACCESS, 2019, 7 : 112588 - 112597
  • [30] Epileptic Seizure Prediction Using Deep Neural Networks Via Transfer Learning and Multi-Feature Fusion
    Yu, Zuyi
    Albera, Laurent
    Jeannes, Regine Le Bouquin
    Kachenoura, Amar
    Karfoul, Ahmad
    Yang, Chunfeng
    Shu, Huazhong
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (07)