Classification of DNA Sequence Using Machine Learning

被引:0
|
作者
Kanumalli, Satya Sandeep [1 ]
Swathi, S. [1 ]
Sukanya, K. [1 ]
Yamini, V. [1 ]
Nagalakshmi, N. [1 ]
机构
[1] Vignans Nirula Inst Technol & Sci Women, CSE Dept, Guntur, Andhra Pradesh, India
关键词
Machine learning; DNA sequencing; AdaBoost algorithm; Bioinformatics;
D O I
10.1007/978-981-19-3590-9_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of medical information research, the genetic series is widely used as a component of a category. One of the applications of ML is biochemistry. Bioinformatics is an interdisciplinary science that uses computers and communication science to understand biological data. One of its most difficult tasks is to distinguish between regular genes and disease-causing genes. The classification of gene sequences into existing categories is utilized in genomic research to discover the functions of novel proteins. As a result, it is critical to identify and categorize such genes. We employ ML approaches to distinguish between infected and normal genes using classification methods. AdaBoost has a high degree of precision; relative to the bagging algorithm and Random Forest Algorithm, AdaBoost fully considers the weight of each classifier. To generate a sequence of weak classifiers, an AdaBoost-based learning approach is used to find the most 'informative' or 'discriminating' features. The identification cascade structure can also help to limit false-positive results. This study provides an overview of the mechanics of gene sequence classification using ML Techniques, including a brief introduction to bioinformatics and important challenges in DNA Sequencing with ML.
引用
收藏
页码:723 / 732
页数:10
相关论文
共 50 条
  • [21] DNA Genome Classification with Machine Learning and Image Descriptors
    Prado Cussi, Daniel
    Machaca Arceda, V. E.
    ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2, 2023, 652 : 39 - 58
  • [22] Protein Sequence Classification with Improved Extreme Learning Machine Algorithms
    Cao, Jiuwen
    Xiong, Lianglin
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [23] Improved Malicious Code Classification Considering Sequence by Machine Learning
    Paik, Incheon
    18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
  • [24] Large-scale machine learning for metagenomics sequence classification
    Vervier, Kevin
    Mahe, Pierre
    Tournoud, Maud
    Veyrieras, Jean-Baptiste
    Vert, Jean-Philippe
    BIOINFORMATICS, 2016, 32 (07) : 1023 - 1032
  • [25] Melanoma Classification using Machine Learning and Deep Learning
    Tran Anh Vu
    Pham Quang Son
    Dinh Nghia Hiep
    Hoang Quang Huy
    Nguyen Phan Kien
    Pham Thi Viet Huong
    2023 1ST INTERNATIONAL CONFERENCE ON HEALTH SCIENCE AND TECHNOLOGY, ICHST 2023, 2023,
  • [26] RETRACTED: Species Identification using Partial DNA Sequence: A Machine Learning Approach (Retracted Article)
    Kabir, Tasnim
    Shemonti, Abida Sanjana
    Rahman, Atif Hasan
    PROCEEDINGS 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2018, : 235 - 242
  • [27] Classification of Hemilabile Ligands Using Machine Learning
    Kevlishvili, Ilia
    Duan, Chenru
    Kulik, Heather J.
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, 14 (49): : 11100 - 11109
  • [28] Combat vehicle classification using Machine Learning
    Zeng, H
    Huang, J
    Liang, Y
    ADVANCES IN COMPUTER-ASSISTED RECOGNITION, 1999, 3584 : 2 - 7
  • [29] Petrofacies classification using machine learning algorithms
    Silva, Adrielle A.
    Tavares, Monica W.
    Carrasquilla, Abel
    Missagia, Roseane
    Ceia, Marco
    GEOPHYSICS, 2020, 85 (04) : WA101 - WA113
  • [30] Classification of Malicious URLs Using Machine Learning
    Abad, Shayan
    Gholamy, Hassan
    Aslani, Mohammad
    SENSORS, 2023, 23 (18)