POSMM: an efficient alignment-free metagenomic profiler that complements alignment-based profiling

被引:0
|
作者
David J. Burks
Vaidehi Pusadkar
Rajeev K. Azad
机构
[1] University of North Texas,Department of Biological Sciences and BioDiscovery Institute
[2] University of North Texas,Department of Mathematics
来源
Environmental Microbiome | / 18卷
关键词
Metagenomes; Microbiome; Taxonomic classification; Markov model; Sequence alignment;
D O I
暂无
中图分类号
学科分类号
摘要
We present here POSMM (pronounced ‘Possum’), Python-Optimized Standard Markov Model classifier, which is a new incarnation of the Markov model approach to metagenomic sequence analysis. Built on the top of a rapid Markov model based classification algorithm SMM, POSMM reintroduces high sensitivity associated with alignment-free taxonomic classifiers to probe whole genome or metagenome datasets of increasingly prohibitive sizes. Logistic regression models generated and optimized using the Python sklearn library, transform Markov model probabilities to scores suitable for thresholding. Featuring a dynamic database-free approach, models are generated directly from genome fasta files per run, making POSMM a valuable accompaniment to many other programs. By combining POSMM with ultrafast classifiers such as Kraken2, their complementary strengths can be leveraged to produce higher overall accuracy in metagenomic sequence classification than by either as a standalone classifier. POSMM is a user-friendly and highly adaptable tool designed for broad use by the metagenome scientific community.
引用
收藏
相关论文
共 50 条
  • [1] POSMM: an efficient alignment-free metagenomic profiler that complements alignment-based profiling
    Burks, David J.
    Pusadkar, Vaidehi
    Azad, Rajeev K.
    ENVIRONMENTAL MICROBIOME, 2023, 18 (01)
  • [2] Alignment-free methods for metagenomic profiling
    Gao, Shanshan
    Diem-Trang Pham
    Vinhthuy Phan
    BMC BIOINFORMATICS, 2015, 16
  • [3] Alignment-free methods for metagenomic profiling
    Shanshan Gao
    Diem-Trang Pham
    Vinhthuy Phan
    BMC Bioinformatics, 16
  • [4] Metalign: efficient alignment-based metagenomic profiling via containment min hash
    Nathan LaPierre
    Mohammed Alser
    Eleazar Eskin
    David Koslicki
    Serghei Mangul
    Genome Biology, 21
  • [5] Metalign: efficient alignment-based metagenomic profiling via containment min hash
    LaPierre, Nathan
    Alser, Mohammed
    Eskin, Eleazar
    Koslicki, David
    Mangul, Serghei
    GENOME BIOLOGY, 2020, 21 (01)
  • [6] Prediction of Plant Resistance Proteins Using Alignment-Based and Alignment-Free Approaches
    Gahlot, Pushpendra Singh
    Choudhury, Shubham
    Bajiya, Nisha
    Kumar, Nishant
    Raghava, Gajendra P. S.
    PROTEOMICS, 2025, 25 (5-6)
  • [7] Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
    Ribeiro, Vinicius
    Huang, Yiteng
    Yuan Shangguan
    Yang, Zhaojun
    Wan, Li
    Sun, Ming
    INTERSPEECH 2023, 2023, : 5366 - 5370
  • [8] Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification
    Borozan, Ivan
    Watt, Stuart
    Ferretti, Vincent
    BIOINFORMATICS, 2015, 31 (09) : 1396 - 1404
  • [9] Towards Selective-Alignment: Bridging the Accuracy Gap between Alignment-Based and Alignment-Free Transcript Quantification
    Sarkar, Hirak
    Zakeri, Mohsen
    Malik, Laraib
    Patro, Rob
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 27 - 36
  • [10] Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
    Kirichenko, Anastasiya D.
    Poroshina, Anastasiya A.
    Sherbakov, Dmitry Yu
    Sadovsky, Michael G.
    Krutovsky, Konstantin, V
    PLOS ONE, 2022, 17 (03):