An In-Depth Assessment of Sequence Clustering Software in Bioinformatics

被引:0
|
作者
Ju, Zhen [1 ,2 ]
Wang, Mingyu [3 ]
Li, Xuelei [1 ]
Meng, Jintao [1 ]
Xi, Wenhui [1 ]
Wei, Yanjie [1 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518005, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Shanxi Med Univ, Hosp 1, Dept Neurosurg, Taiyuan 030600, Peoples R China
[4] Shenzhen Inst Adv Technol, Shenzhen Key Lab Intelligent Bioinformat, Shenzhen 518055, Peoples R China
[5] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China
基金
美国国家科学基金会;
关键词
Sequence clustering; Precision; Speed; Scalability; Memory consumption;
D O I
10.1007/978-981-97-5128-0_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence clustering software is essential in bioinformatics, yet selecting the most suitable one poses a challenge due to its diverse algorithm design and targeted bioinformatics applications. This paper comprehensively reviewed the developments of most representative sequence clustering software and evaluated 8 representative software based on criteria such as precision, speed, scalability, and memory consumption. This paper divides the clustering software into four aspects: NMI scores greater than 0.95, running time less than 1min/h, 64 core acceleration exceeding 30 times, and memory consumption less than 3 times the dataset, and summarizes them into a table for user querying. Finally, taking OTU, tree of life building, and metagenomic analysis, as examples, this paper demonstrates how to analyze the requirements of scenarios for clustering software and provides recommendations for selecting the most suitable one based on evaluation results.
引用
收藏
页码:359 / 370
页数:12
相关论文
共 50 条
  • [1] Software Module Clustering: An In-Depth Literature Analysis
    Sarhan, Qusay, I
    Ahmed, Bestoun S.
    Bures, Miroslav
    Zamli, Kamal Z.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (06) : 1905 - 1928
  • [2] An In-Depth Security Assessment of Maritime Container Terminal Software Systems
    Eichenhofer, Joseph O.
    Heymann, Elisa
    Miller, Barton P.
    Kang, Arnold
    IEEE ACCESS, 2020, 8 (08): : 128050 - 128067
  • [3] AN IN-DEPTH LOOK AT INVASIVE SOFTWARE
    MODELL, H
    IEEE SOFTWARE, 1991, 8 (06) : 91 - 92
  • [4] Sequence clustering in bioinformatics: an empirical study
    Zou, Quan
    Lin, Gang
    Jiang, Xingpeng
    Liu, Xiangrong
    Zeng, Xiangxiang
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) : 1 - 10
  • [5] In-depth analysis of the adipocyte proteome by mass spectrometry and bioinformatics
    Adachi, Jun
    Kumar, Chanchal
    Zhang, Yanling
    Mann, Matthias
    MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (07) : 1257 - 1273
  • [6] Ignalina in-depth safety assessment
    Brown, RA
    Budnitz, RJ
    Butcher, P
    Reichenbach, DG
    NUCLEAR SAFETY, 1997, 38 (01): : 24 - 34
  • [7] SeqCalc: A portable bioinformatics software for sequence analysis
    Vignesh, Dhandapani
    Parameswari, Paul
    Jin, Kim Hae
    Pyo, Lim Yong
    BIOINFORMATION, 2010, 5 (03) : 85 - 88
  • [8] Does class size matter? An in-depth assessment of the effect of class size in software defect prediction
    Amjed Tahir
    Kwabena E. Bennin
    Xun Xiao
    Stephen G. MacDonell
    Empirical Software Engineering, 2021, 26
  • [9] Does class size matter? An in-depth assessment of the effect of class size in software defect prediction
    Tahir, Amjed
    Bennin, Kwabena E.
    Xiao, Xun
    MacDonell, Stephen G.
    EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (05)
  • [10] Scrum in a Software Engineering Course: An In-Depth Praxis Report
    Scharf, Andreas
    Koch, Andreas
    2013 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING EDUCATION AND TRAINING (CSEE&T), 2013, : 159 - 168