Entropy Based Clustering of Viral Sequences

被引:1
|
作者
Juyal, Akshay [1 ]
Hosseini, Roya [1 ]
Novikov, Daniel [1 ]
Grinshpon, Mark [2 ]
Zelikovsky, Alex [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
[2] Georgia State Univ, Dept Math & Stat, Atlanta, GA 30303 USA
来源
BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2022 | 2022年 / 13760卷
关键词
Categorical data; Clustering; Entropy; Monte Carlo algorithm; Viral genomic sequences; TRANSMISSIONS; VARIANTS; DESIGN; AIDS;
D O I
10.1007/978-3-031-23198-8_33
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering viral sequences allows us to characterize the composition and structure of intrahost and interhost viral populations, which play a crucial role in disease progression and epidemic spread. In this paper we propose and validate a new entropy based method for clustering aligned viral sequences considered as categorical data. The method finds a homogeneous clustering by minimizing information entropy rather than distance between sequences in the same cluster. We have applied our entropy based clustering method to SARS-CoV-2 viral sequencing data. We report the information content extracted from the sequences by entropy based clustering. Our method converges to similar minimum-entropy clusterings across different runs and limited permutations of data. We also show that a parallelized version of our tool is scalable to very large SARS-CoV-2 datasets.
引用
收藏
页码:369 / 380
页数:12
相关论文
共 50 条
  • [21] Hairiness detection based on maximum entropy and density clustering
    Li P.
    Yan K.
    Zhang H.
    Jing J.
    Fangzhi Xuebao/Journal of Textile Research, 2019, 40 (07): : 158 - 162
  • [22] An Improved Entropy-based Ant Clustering Algorithm
    Zhao Weili
    2009 WASE INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING, ICIE 2009, VOL II, 2009, : 41 - 44
  • [23] Image Segmentation with Fuzzy Clustering Based on Generalized Entropy
    Li, Kai
    Guo, Zhixin
    JOURNAL OF COMPUTERS, 2014, 9 (07) : 1678 - 1683
  • [24] Evaluation Algorithm for Clustering Quality Based on Information Entropy
    Liang Xingxing
    Xiu Baoxin
    Fan Changjun
    Chen Chao
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2016, : 32 - 36
  • [25] Entropy-based active sparse subspace clustering
    Yanbei Liu
    Kaihua Liu
    Changqing Zhang
    Xiao Wang
    Shaona Wang
    Zhitao Xiao
    Multimedia Tools and Applications, 2018, 77 : 22281 - 22297
  • [26] Entropy Based Soft K-means Clustering
    Bai, Xue
    Luo, Siwei
    Zhao, Yibiao
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 107 - 110
  • [27] Entropy-based active sparse subspace clustering
    Liu, Yanbei
    Liu, Kaihua
    Zhang, Changqing
    Wang, Xiao
    Wang, Shaona
    Xiao, Zhitao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22281 - 22297
  • [28] Entropy-based consensus clustering for patient stratification
    Liu, Hongfu
    Zhao, Rui
    Fang, Hongsheng
    Cheng, Feixiong
    Fu, Yun
    Liu, Yang-Yu
    BIOINFORMATICS, 2017, 33 (17) : 2691 - 2698
  • [29] An new fuzzy clustering algorithm based on entropy weighting
    Su, Xuan
    Wang, Xiaoye
    Wang, Zhuo
    Xiao, Yingyuan
    Journal of Computational Information Systems, 2010, 6 (10): : 3319 - 3326
  • [30] Entropy-based fuzzy clustering and fuzzy modeling
    Yao, J
    Dash, M
    Tan, ST
    Liu, H
    FUZZY SETS AND SYSTEMS, 2000, 113 (03) : 381 - 388