Consensus Clustering Based on a New Probabilistic Rand Index with Application to Subtopic Retrieval

被引:44
|
作者
Carpineto, Claudio [1 ]
Romano, Giovanni [1 ]
机构
[1] Fdn Ugo Bordoni, I-00161 Rome, Italy
关键词
Consensus clustering; Rand index; probabilistic Rand index; search results clustering; subtopic retrieval;
D O I
10.1109/TPAMI.2012.80
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a probabilistic version of the well-known Rand Index (RI) for measuring the similarity between two partitions, called Probabilistic Rand Index (PRI), in which agreements and disagreements at the object-pair level are weighted according to the probability of their occurring by chance. We then cast consensus clustering as an optimization problem of the PRI value between a target partition and a set of given partitions, experimenting with a simple and very efficient stochastic optimization algorithm. Remarkable performance gains over input partitions as well as over existing related methods are demonstrated through a range of applications, including a new use of consensus clustering to improve subtopic retrieval.
引用
收藏
页码:2315 / 2326
页数:12
相关论文
共 50 条
  • [1] Speaker clustering based on minimum rand index
    Tsai, Wei-Ho
    Wang, Hsin-Min
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 485 - +
  • [2] A Rand Index-Based Analysis of Consensus Protocols
    Roy, Sangita
    Shyamasundar, Rudrapatna K.
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, SECRYPT 2023, 2023, : 567 - 576
  • [3] Full-Subtopic Retrieval with Keyphrase-based Search Results Clustering
    Bernardini, Andrea
    Carpineto, Claudio
    D'Amico, Massimiliano
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 206 - 213
  • [4] A Generalized Rand-Index Method for Consensus Clustering of Separate Partitions of the Same Data Base
    Abba M. Krieger
    Paul E. Green
    Journal of Classification, 1999, 16 : 63 - 89
  • [5] A generalized rand-index method for consensus clustering of separate partitions of the same data base
    Krieger, AM
    Green, PE
    JOURNAL OF CLASSIFICATION, 1999, 16 (01) : 63 - 89
  • [6] A new video retrieval approach based on clustering
    Lei, Z
    Wu, LD
    Lao, SY
    Wang, G
    Wang, C
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1733 - 1738
  • [7] Effective and Optimal Clustering Based on New Clustering Validity Index
    Zhu, Erzhou
    Li, Peng
    Ma, Zhujuan
    Li, Xuejun
    Liu, Feng
    PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 529 - 534
  • [8] OBJECT BASED VALIDATION ALGORITHM AND ITS APPLICATION TO CONSENSUS CLUSTERING
    Fa, Rui
    Abu-Jamous, Basel
    Nandi, Asoke K.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [9] Probabilistic information retrieval method based on differential latent semantic index space
    Chen, L
    Tokuda, N
    Nagai, A
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (07) : 910 - 914
  • [10] An effective partitional clustering algorithm based on new clustering validity index
    Zhu, Erzhou
    Ma, Ruhui
    APPLIED SOFT COMPUTING, 2018, 71 : 608 - 621