BoT-Net: a lightweight bag of tricks-based neural network for efficient LncRNA-miRNA interaction prediction

被引:3
|
作者
Asim, Muhammad Nabeel [1 ,2 ]
Ibrahim, Muhammad Ali [1 ,2 ]
Zehe, Christoph [3 ]
Trygg, Johan [3 ,4 ]
Dengel, Andreas [1 ,2 ]
Ahmed, Sheraz [2 ,4 ]
机构
[1] Tech Univ Kaiserslautern, Dept Comp Sci, D-67663 Kaiserslautern, Rhineland Palat, Germany
[2] German Res Ctr Artificial Intelligence GmbH, D-67663 Kaiserslautern, Rhineland Palat, Germany
[3] Sartorius Stedim Cellca GmbH, D-88471 Laupheim, Baden Wurttembe, Germany
[4] Umea Univ, Computat Life Sci Cluster CLiC, S-90187 Umea, Sweden
关键词
Deep learning; Long non-coding RNA; Micro-RNA; Bag of tricks; Deep learning strategies; Robust interaction predictor; lncRNA-miRNA interaction prediction; Lightweight neural network; PROTEIN INTERACTIONS; DROPOUT; MODEL;
D O I
10.1007/s12539-022-00535-x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background and objective: Interactions of long non-coding ribonucleic acids (lncRNAs) with micro-ribonucleic acids (miRNAs) play an essential role in gene regulation, cellular metabolic, and pathological processes. Existing purely sequence based computational approaches lack robustness and efficiency mainly due to the high length variability of lncRNA sequences. Hence, the prime focus of the current study is to find optimal length trade-offs between highly flexible length lncRNA sequences. Method The paper at hand performs in-depth exploration of diverse copy padding, sequence truncation approaches, and presents a novel idea of utilizing only subregions of lncRNA sequences to generate fixed-length lncRNA sequences. Furthermore, it presents a novel bag of tricks-based deep learning approach "Bot-Net" which leverages a single layer long-short-term memory network regularized through DropConnect to capture higher order residue dependencies, pooling to retain most salient features, normalization to prevent exploding and vanishing gradient issues, learning rate decay, and dropout to regularize precise neural network for lncRNA-miRNA interaction prediction. Results BoT-Net outperforms the state-of-the-art lncRNA-miRNA interaction prediction approach by 2%, 8%, and 4% in terms of accuracy, specificity, and matthews correlation coefficient. Furthermore, a case study analysis indicates that BoTNet also outperforms state-of-the-art lncRNA-protein interaction predictor on a benchmark dataset by accuracy of 10%, sensitivity of 19%, specificity of 6%, precision of 14%, and matthews correlation coefficient of 26%. Conclusion In the benchmark lncRNA-miRNA interaction prediction dataset, the length of the lncRNA sequence varies from 213 residues to 22,743 residues and in the benchmark lncRNA-protein interaction prediction dataset, lncRNA sequences vary from 15 residues to 1504 residues. For such highly flexible length sequences, fixed length generation using copy padding introduces a significant level of bias which makes a large number of lncRNA sequences very much identical to each other and eventually derail classifier generalizeability. Empirical evaluation reveals that within 50 residues of only the starting region of long lncRNA sequences, a highly informative distribution for lncRNA-miRNA interaction prediction is contained, a crucial finding exploited by the proposed BoT-Net approach to optimize the lncRNA fixed length generation process. [GRAPHICS] .
引用
收藏
页码:841 / 862
页数:22
相关论文
共 19 条
  • [1] BoT-Net: a lightweight bag of tricks-based neural network for efficient LncRNA–miRNA interaction prediction
    Muhammad Nabeel Asim
    Muhammad Ali Ibrahim
    Christoph Zehe
    Johan Trygg
    Andreas Dengel
    Sheraz Ahmed
    Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 841 - 862
  • [2] Multi-view graph neural network with cascaded attention for lncRNA-miRNA interaction prediction
    Li, Hui
    Wu, Bin
    Sun, Miaomiao
    Ye, Yangdong
    Zhu, Zhenfeng
    Chen, Kuisheng
    KNOWLEDGE-BASED SYSTEMS, 2023, 268
  • [3] Graph embedding ensemble methods based on the heterogeneous network for lncRNA-miRNA interaction prediction
    Zhao, Chengshuai
    Qiu, Yang
    Zhou, Shuang
    Liu, Shichao
    Zhang, Wen
    Niu, Yanqing
    BMC GENOMICS, 2020, 21 (Suppl 13)
  • [4] Graph embedding ensemble methods based on the heterogeneous network for lncRNA-miRNA interaction prediction
    Chengshuai Zhao
    Yang Qiu
    Shuang Zhou
    Shichao Liu
    Wen Zhang
    Yanqing Niu
    BMC Genomics, 21
  • [5] Plant lncRNA-miRNA Interaction Prediction Based on Counterfactual Heterogeneous Graph Attention Network
    He, Yu
    Ning, ZiLan
    Zhu, XingHui
    Zhang, YinQiong
    Liu, ChunHai
    Jiang, SiWei
    Yuan, ZheMing
    Zhang, HongYan
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024,
  • [6] Predicting lncRNA-miRNA interactions based on interactome network and graphlet interaction
    Zhang, Li
    Liu, Ting
    Chen, Haoyu
    Zhao, Qi
    Liu, Hongsheng
    GENOMICS, 2021, 113 (03) : 874 - 880
  • [7] Heterogeneous graph inference based on similarity network fusion for predicting lncRNA-miRNA interaction
    Fan, Yongxian
    Cui, Juan
    Zhu, QingQi
    RSC ADVANCES, 2020, 10 (20) : 11634 - 11642
  • [8] LncRNA-miRNA interaction prediction from the heterogeneous network through graph embedding ensemble learning
    Zhou, Shuang
    Yue, Xiang
    Xu, Xinran
    Liu, Shichao
    Zhang, Wen
    Niu, Yanqing
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 622 - 627
  • [9] Prediction of lncRNA-miRNA interaction based on sequence and structural information of potential binding site
    Qi, Danyang
    Wu, Chengyan
    Hao, Zhihong
    Zhang, Zheng
    Liu, Li
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2025, 307
  • [10] Sequence pre-training-based graph neural network for predicting lncRNA-miRNA associations
    Wang, Zixiao
    Liang, Shiyang
    Liu, Siwei
    Meng, Zhaohan
    Wang, Jingjie
    Liang, Shangsong
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)