Computational prediction of disease related lncRNAs using machine learning

被引:3
|
作者
Khalid, Razia [1 ]
Naveed, Hammad [1 ]
Khalid, Zoya [2 ]
机构
[1] Natl Univ Comp & Emerging Sci, Dept Comp Sci, Computat Biol Res Lab, NUCES FAST, Islamabad, Pakistan
[2] Quaid I Azam Univ, Natl Ctr Bioinformat NCB, Islamabad, Pakistan
关键词
LONG NONCODING RNAS; DATABASE; PROTEIN; HOTAIR; GENOME;
D O I
10.1038/s41598-023-27680-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Long non-coding RNAs (lncRNAs), which were once considered as transcriptional noise, are now in the limelight of current research. LncRNAs play a major role in regulating various biological processes such as imprinting, cell differentiation, and splicing. The mutations of lncRNAs are involved in various complex diseases. Identifying lncRNA-disease associations has gained a lot of attention as predicting it efficiently will lead towards better disease treatment. In this study, we have developed a machine learning model that predicts disease-related lncRNAs by combining sequence and structure-based features. The features were trained on SVM and Random Forest classifiers. We have compared our method with the state-of-the-art and obtained the highest F1 score of 76% on SVM classifier. Moreover, this study has overcome two serious limitations of the reported method which are lack of redundancy checking and implementation of oversampling for balancing the positive and negative class. Our method has achieved improved performance among machine learning models reported for lncRNA-disease associations. Combining multiple features together specifically lncRNAs sequence mutation has a significant contribution to the disease related lncRNA prediction.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Computational prediction of disease related lncRNAs using machine learning
    Razia Khalid
    Hammad Naveed
    Zoya Khalid
    [J]. Scientific Reports, 13
  • [2] Computational Learning Model for Prediction of Heart Disease Using Machine Learning Based on a New Regularizer
    Albahr, Abdulaziz
    Albahar, Marwan
    Thanoon, Mohammed
    Binsawad, Muhammad
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [3] Disease Prediction using Machine Learning
    Dubey, Subham
    Banik, Sreerupa
    Ghosh, Deba
    Dey, Akash
    Das, Rishabh
    Dey, Ipsita
    Chowdhury, Sagarika
    Dey, Prianka
    [J]. 2024 2nd World Conference on Communication and Computing, WCONF 2024, 2024,
  • [4] OBESITY RELATED DISEASE PREDICTION FROM HEALTHCARE COMMUNITIES USING MACHINE LEARNING
    Pereira, Naomi Christianne
    D'souza, Jessica
    Rana, Parth
    Solaskar, Supriya
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [5] Prediction of Heart Disease Using Machine Learning
    Begum, M. Asma
    Abirami, S.
    Anandhi, R.
    Dhivyadharshini, K.
    Devi, R. Ganga
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (04): : 39 - 42
  • [6] Prediction of atherosclerotic disease progression combining computational modelling with machine learning
    Sakellarios, Antonis I.
    Pezoulas, Vasileios C.
    Bourantas, Christos
    Naka, Katerina K.
    Michalis, Lampros K.
    Serruys, Patrick W.
    Stone, Gregg
    Garcia-Garcia, Hector M.
    Fotiadis, Dimitrios I.
    [J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 2760 - 2763
  • [7] Computational prediction of RNA tertiary structures using machine learning methods*
    Huang, Bin
    Du, Yuanyang
    Zhang, Shuai
    Li, Wenfei
    Wang, Jun
    Zhang, Jian
    [J]. CHINESE PHYSICS B, 2020, 29 (10)
  • [8] Computational Prediction of lncRNA-Protein Interactions using Machine learning
    Mushtaq, Muhammad
    Naveed, Hammad
    Khalid, Zoya
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2100 - 2103
  • [9] Computational prediction of RNA tertiary structures using machine learning methods
    黄斌
    杜渊洋
    张帅
    李文飞
    王骏
    张建
    [J]. Chinese Physics B, 2020, 29 (10) : 31 - 37
  • [10] Diabetes Disease Prediction Using Machine Learning Algorithms
    Lyngdoh, Arwatki Chen
    Choudhury, Nurul Amin
    Moulik, Soumen
    [J]. 2020 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES 2020): LEADING MODERN HEALTHCARE TECHNOLOGY ENHANCING WELLNESS, 2021, : 517 - 521