Using alignment-free and pattern mining methods for SARS-CoV-2 genome analysis

被引:0
|
作者
M. Saqib Nawaz
Philippe Fournier-Viger
Memoona Aslam
Wenjin Li
Yulin He
Xinzheng Niu
机构
[1] Shenzhen University,College of Computer Science and Software Engineering
[2] Shenzhen University,Institute for Advanced Study
[3] Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ),School of Computer Science and Engineering
[4] University of Electronic Science and Technology of China,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
COVID-19; SARS-CoV-2; Genome sequence; Amino acids; Alignment-free; Sequential pattern mining; Mutation;
D O I
暂无
中图分类号
学科分类号
摘要
Examining the genome sequences of the SARS-CoV-2 virus, that causes the respiratory disease known as coronavirus disease 2019 (COVID-19), play important role in the proper understanding of this virus, its main characteristics and functionalities. This paper investigates the use of alignment-free (AF) sequence analysis and sequential pattern mining (SPM) to analyze SARS-CoV-2 genome sequences and learn interesting information about them respectively. AF methods are used to find (dis)similarity in the genome sequences of SARS-CoV-2 by using various distance measures, to compare the performance of these measures and to construct the phylogenetic trees. SPM algorithms are used to discover frequent amino acid patterns and their relationship with each other and to predict the amino acid(s) by using various sequence-based prediction models. In last, an algorithm is proposed to analyze mutation in genome sequences. The algorithm finds the locations for changed amino acid(s) in the genome sequences and computes the mutation rate. From obtained results, it is found that that both AF and SPM methods can be used to discover interesting information/patterns in SARS-CoV-2 genome sequences for examining the variations and evolution among strains.
引用
收藏
页码:21920 / 21943
页数:23
相关论文
共 50 条
  • [1] Using alignment-free and pattern mining methods for SARS-CoV-2 genome analysis
    Nawaz, M. Saqib
    Fournier-Viger, Philippe
    Aslam, Memoona
    Li, Wenjin
    He, Yulin
    Niu, Xinzheng
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21920 - 21943
  • [2] Alignment-free genome analysis of SARS-CoV-2 using Machine learning.
    Randhawa, G. S.
    Soltysiak, M. P. M.
    Roz, H. E. L.
    de Souza, C. P. E.
    Hill, K. A.
    Kari, L.
    ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2020, 61 : 55 - 55
  • [3] An updated analysis of variations in SARS-CoV-2 genome
    Ugurel, Osman Mutluhan
    Ata, Oguz
    Turgut-Balik, Dilek
    TURKISH JOURNAL OF BIOLOGY, 2020, 44 (03) : 157 - 167
  • [4] Analysis of SARS-CoV-2 genome evolutionary patterns
    Gupta, Shubhangi
    Gupta, Deepanshu
    Bhatnagar, Sonika
    MICROBIOLOGY SPECTRUM, 2024, 12 (02):
  • [5] Pattern recognition and probabilistic measures in alignment-free sequence analysis
    Schwende, Isabel
    Pham, Tuan D.
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (03) : 354 - 368
  • [6] A fractal scaling analysis of the SARS-CoV-2 genome sequence
    Meraz, M.
    Vernon-Carter, E. J.
    Rodriguez, E.
    Alvarez-Ramirez, J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 73
  • [7] Cloning of a Passage-Free SARS-CoV-2 Genome and Mutagenesis Using Red Recombination
    Herrmann, Alexandra
    Jungnickl, Doris
    Cordsmeier, Arne
    Peter, Antonia Sophia
    Ueberla, Klaus
    Ensser, Armin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (19)
  • [8] Assembly-Free and Alignment-Free Sample Identification Using Genome Skims
    Sarmashghi, Shahab
    Bohmann, Kristine
    Gilbert, M. Thomas P.
    Bafna, Vineet
    Mirarab, Siavash
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2018, 2018, 10812 : 276 - 277
  • [9] SARS-CoV-2 Detection Methods
    Lino, Alexandra
    Cardoso, Marita A.
    Goncalves, Helena M. R.
    Martins-Lopes, Paula
    CHEMOSENSORS, 2022, 10 (06)
  • [10] Dissimilarities in alignment-free methods for phylogenetic analysis based on genomes
    Yu, Zu-Guo (yuzuguo@aliyun.com), 1600, Bentham Science Publishers (09):