POSMM: an efficient alignment-free metagenomic profiler that complements alignment-based profiling

被引:0
|
作者
David J. Burks
Vaidehi Pusadkar
Rajeev K. Azad
机构
[1] University of North Texas,Department of Biological Sciences and BioDiscovery Institute
[2] University of North Texas,Department of Mathematics
来源
Environmental Microbiome | / 18卷
关键词
Metagenomes; Microbiome; Taxonomic classification; Markov model; Sequence alignment;
D O I
暂无
中图分类号
学科分类号
摘要
We present here POSMM (pronounced ‘Possum’), Python-Optimized Standard Markov Model classifier, which is a new incarnation of the Markov model approach to metagenomic sequence analysis. Built on the top of a rapid Markov model based classification algorithm SMM, POSMM reintroduces high sensitivity associated with alignment-free taxonomic classifiers to probe whole genome or metagenome datasets of increasingly prohibitive sizes. Logistic regression models generated and optimized using the Python sklearn library, transform Markov model probabilities to scores suitable for thresholding. Featuring a dynamic database-free approach, models are generated directly from genome fasta files per run, making POSMM a valuable accompaniment to many other programs. By combining POSMM with ultrafast classifiers such as Kraken2, their complementary strengths can be leveraged to produce higher overall accuracy in metagenomic sequence classification than by either as a standalone classifier. POSMM is a user-friendly and highly adaptable tool designed for broad use by the metagenome scientific community.
引用
收藏
相关论文
共 50 条
  • [21] DectICO: an alignment-free supervised metagenomic classification method based on feature extraction and dynamic selection
    Xiao Ding
    Fudong Cheng
    Changchang Cao
    Xiao Sun
    BMC Bioinformatics, 16
  • [22] DectICO: an alignment-free supervised metagenomic classification method based on feature extraction and dynamic selection
    Ding, Xiao
    Cheng, Fudong
    Cao, Changchang
    Sun, Xiao
    BMC BIOINFORMATICS, 2015, 16
  • [23] Alignment-free homology detection
    Tang, Lin
    NATURE METHODS, 2024, 21 (10) : 1785 - 1785
  • [24] An alignment-free test for recombination
    Haubold, Bernhard
    Krause, Linda
    Horn, Thomas
    Pfaffelhuber, Peter
    BIOINFORMATICS, 2013, 29 (24) : 3121 - 3127
  • [25] Local Binary Patterns as a Feature Descriptor in Alignment-Free Visualisation of Metagenomic Data
    Kouchaki, Samaneh
    Tirunagari, Santosh
    Tapinos, Avraam
    Robertson, David L.
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [26] REACH: Researching Efficient Alignment-based Conformance Checking
    Casas-Ramos, Jacobo
    Mucientes, Manuel
    Lama, Manuel
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [27] Alignment-Free Phylogenetic Reconstruction
    Daskalakis, Constantinos
    Roch, Sebastien
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2010, 6044 : 123 - +
  • [28] Alignment-Free Population Genomics: An Efficient Estimator of Sequence Diversity
    Haubold, Bernhard
    Pfaffelhuber, Peter
    G3-GENES GENOMES GENETICS, 2012, 2 (08): : 883 - 889
  • [29] Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes
    Seligmann, Herve
    BIOSYSTEMS, 2018, 167 : 33 - 46
  • [30] Generating Minimal Models of H1N1 NS1 Gene Sequences Using Alignment-Based and Alignment-Free Algorithms
    Fang, Meng
    Xu, Jiawei
    Sun, Nan
    Yau, Stephen S. -T.
    GENES, 2023, 14 (01)