Similarity-based clustering of sequences using hidden Markov models

被引:0
|
作者
Bicego, M
Murino, V
Figueiredo, MAT
机构
[1] Univ Verona, Dipartimento Informat, I-37134 Verona, Italy
[2] Inst Super Tecn, Inst Telecomun, P-1049001 Lisbon, Portugal
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov models constitute a widely employed tool for sequential data modelling; nevertheless, their use in the clustering context has been poorly investigated. In this paper a novel scheme for HMM-based sequential data clustering is proposed, inspired on the similarity-based paradigm recently introduced in the supervised learning context. With this approach, a new representation space is built, in which each object is described by the vector of its similarities with respect to a pre-determinate set of other objects. These similarities are determined using hidden Markov models. Clustering is then performed in such a space. By way of this, the difficult problem of clustering of sequences is thus transposed to a more manageable format, the clustering of points (vectors of features). Experimental evaluation on synthetic and real data shows that the proposed approach largely outperforms standard HMM-clustering schemes.
引用
收藏
页码:86 / 95
页数:10
相关论文
共 50 条
  • [41] Similarity-based soft clustering algorithm for web documents
    School of Remote Sensing Information Engineering, Wuhan University, Wuhan 430079, China
    Jisuanji Gongcheng, 2006, 2 (59-61):
  • [42] Subspace Similarity-based Algorithm for Combine Multiple Clustering
    Xu, Sen
    Li, Xianfeng
    Chen, Rong
    Wu, Shuang
    Ni, Jun
    2013 SEVENTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR ENGINEERING AND SCIENCE (ICICSE 2013), 2013, : 69 - 76
  • [43] Data integration by fuzzy similarity-based hierarchical clustering
    Ciaramella, Angelo
    Nardone, Davide
    Staiano, Antonino
    BMC BIOINFORMATICS, 2020, 21 (Suppl 10)
  • [44] Similarity-based test case prioritization using ordered sequences of program entities
    Chunrong Fang
    Zhenyu Chen
    Kun Wu
    Zhihong Zhao
    Software Quality Journal, 2014, 22 : 335 - 361
  • [45] Similarity-based test case prioritization using ordered sequences of program entities
    Fang, Chunrong
    Chen, Zhenyu
    Wu, Kun
    Zhao, Zhihong
    SOFTWARE QUALITY JOURNAL, 2014, 22 (02) : 335 - 361
  • [46] Spectral analysis of text collection for similarity-based clustering
    Li, WY
    Ng, WK
    Lim, EP
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 833 - 833
  • [47] Predicting user preferences via similarity-based clustering
    Qin, Mian
    Buffett, Scott
    Fleming, Michael W.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2008, 5032 : 222 - +
  • [48] Data integration by fuzzy similarity-based hierarchical clustering
    Angelo Ciaramella
    Davide Nardone
    Antonino Staiano
    BMC Bioinformatics, 21
  • [49] Spectral analysis of text collection for similarity-based clustering
    Li, WY
    Ng, WK
    Lim, EP
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 389 - 393
  • [50] Structural similarity-based object tracking in video sequences
    Loza, Artur
    Mihaylova, Lyudmila
    Canagarajah, Nishan
    Bull, David
    2006 9TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2006, : 95 - 100