A new COVID-19 detection method from human genome sequences using CpG island features and KNN classifier

被引:45
|
作者
Arslan, Hilal [1 ]
Arslan, Hasan [2 ]
机构
[1] Izmir Bakircay Univ, Dept Comp Engn, Izmir, Turkey
[2] Erciyes Univ, Dept Math, Kayseri, Turkey
关键词
COVID-19; SARS-CoV-2; K-Nearest Neighbors; CpG islands; Human coronaviruses;
D O I
10.1016/j.jestch.2020.12.026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Various viral epidemics have been detected such as the severe acute respiratory syndrome coronavirus and the Middle East respiratory syndrome coronavirus in the last two decades. The coronavirus disease 2019 (COVID-19) is a pandemic caused by a novel betacoronavirus called severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). After the rapid spread of COVID-19, many researchers have investigated diagnosis and treatment for this terrifying disease quickly. Identifying COVID-19 from the other types of coronaviruses is a difficult problem due to their genetic similarity. In this study, we propose a new efficient COVID-19 detection method based on the K-nearest neighbors (KNN) classifier using the complete genome sequences of human coronaviruses in the dataset recorded in 2019 Novel Coronavirus Resource. We also describe two features based on CpG island that efficiently detect COVID-19 cases. Thus, genome sequences including approximately 30,000 nucleotides can be represented by only two real numbers. The KNN method is a simple and effective non-parametric technique for solving classification problems. However, performance of the KNN depends on the distance measure used. We perform 19 distance metrics investigated in five categories to improve the performance of the KNN algorithm. Some efficient performance parameters are computed to evaluate the proposed method. The proposed method achieves 98.4% precision, 99.2% recall, 98.8% F-measure, and 98.4% accuracy in a few seconds when any L1 type metric is used as a distance measure in the KNN. (c) 2020 Karabuk University. Publishing services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:839 / 847
页数:9
相关论文
共 50 条
  • [1] A new Covid-19 diagnosis strategy using a modified KNN classifier
    Rabie, Asmaa H.
    Mohamed, Alaa M.
    Abo-Elsoud, M. A.
    Saleh, Ahmed I.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23): : 17349 - 17373
  • [2] A new Covid-19 diagnosis strategy using a modified KNN classifier
    Asmaa H. Rabie
    Alaa M. Mohamed
    M. A. Abo-Elsoud
    Ahmed I. Saleh
    Neural Computing and Applications, 2023, 35 : 17349 - 17373
  • [3] GaussianCpG: a Gaussian model for detection of CpG island in human genome sequences
    Yu, Ning
    Guo, Xuan
    Zelikovsky, Alexander
    Pan, Yi
    BMC GENOMICS, 2017, 18
  • [4] GaussianCpG: a Gaussian model for detection of CpG island in human genome sequences
    Ning Yu
    Xuan Guo
    Alexander Zelikovsky
    Yi Pan
    BMC Genomics, 18
  • [5] A new COVID-19 Patients Detection Strategy (CPDS) based on hybrid feature selection and enhanced KNN classifier
    Shaban, Warda M.
    Rabie, Asmaa H.
    Saleh, Ahmed, I
    Abo-Elsoud, M. A.
    KNOWLEDGE-BASED SYSTEMS, 2020, 205 (205)
  • [6] DETECTION AND CLASSIFICATION OF COVID-19 USING GRAY-LEVEL FEATURES AND ENSEMBLE CLASSIFIER
    Patnaik, Vijaya
    Mohanty, Monalisa
    Subudhi, Asit Kumar
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2024, 36 (05):
  • [7] The Most Redundant Sequences in Human CpG Island Library Are Derived from Mitochondrial Genome
    Ximiao He1
    2Graduate University of Chinese Academy of Sciences
    3Department of Biology
    Genomics, Proteomics & Bioinformatics, 2010, (02) : 81 - 91
  • [8] Classifier Fusion for Detection of COVID-19 from CT Scans
    Taranjit Kaur
    Tapan Kumar Gandhi
    Circuits, Systems, and Signal Processing, 2022, 41 : 3397 - 3414
  • [9] Classifier Fusion for Detection of COVID-19 from CT Scans
    Kaur, Taranjit
    Gandhi, Tapan Kumar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (06) : 3397 - 3414
  • [10] CpG Island microarray probe sequences derived from a physical library are representative of CpG Islands annotated on the human genome
    Heisler, LE
    Torti, D
    Boutros, PC
    Watson, J
    Chan, C
    Winegarden, N
    Takahashi, M
    Yau, P
    Huang, THM
    Farnham, PJ
    Jurisica, I
    Woodgett, JR
    Bremner, R
    Penn, LZ
    Der, SD
    NUCLEIC ACIDS RESEARCH, 2005, 33 (09) : 2952 - 2961