Consensus Clustering Based on a New Probabilistic Rand Index with Application to Subtopic Retrieval

被引:44
|
作者
Carpineto, Claudio [1 ]
Romano, Giovanni [1 ]
机构
[1] Fdn Ugo Bordoni, I-00161 Rome, Italy
关键词
Consensus clustering; Rand index; probabilistic Rand index; search results clustering; subtopic retrieval;
D O I
10.1109/TPAMI.2012.80
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a probabilistic version of the well-known Rand Index (RI) for measuring the similarity between two partitions, called Probabilistic Rand Index (PRI), in which agreements and disagreements at the object-pair level are weighted according to the probability of their occurring by chance. We then cast consensus clustering as an optimization problem of the PRI value between a target partition and a set of given partitions, experimenting with a simple and very efficient stochastic optimization algorithm. Remarkable performance gains over input partitions as well as over existing related methods are demonstrated through a range of applications, including a new use of consensus clustering to improve subtopic retrieval.
引用
收藏
页码:2315 / 2326
页数:12
相关论文
共 50 条
  • [41] A consensus process based on regret theory with probabilistic linguistic term sets and its application in venture capital
    Tian, Xiaoli
    Xu, Zeshui
    Gu, Jing
    Herrera, Francisco
    INFORMATION SCIENCES, 2021, 562 (562) : 347 - 369
  • [42] A new probabilistic classifier based on decomposable models with application to internet traffic
    Ghofrani, Fatemeh
    Keshavarz-Haddad, Alireza
    Jamshidi, Ali
    PATTERN RECOGNITION, 2018, 77 : 1 - 11
  • [43] A new cluster validity index for fuzzy clustering based on combination of dual triples
    Frelicot, Carl
    Mascarilla, Laurent
    Berthier, Michel
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 42 - +
  • [44] Stable Hierarchical Clustering Analysis based on New Designed Cluster Validity Index
    Zhu, Erzhou
    Zhu, Binbin
    FengLiu
    2018 3RD INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE), 2018, : 659 - 664
  • [45] A New Recommendation Approach Based on Probabilistic Soft Clustering Methods: A Scientific is Documentation Case Study
    Hurtado, Remigio
    Bobadilla, Jesus
    Bojorque, Rodolfo
    Ortega, Fernando
    Li, Xin
    IEEE ACCESS, 2019, 7 : 7522 - 7534
  • [46] A new method for fuzzy information retrieval based on fuzzy hierarchical clustering and fuzzy inference techniques
    Horng, YJ
    Chen, SM
    Chang, YC
    Lee, CH
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2005, 13 (02) : 216 - 228
  • [47] An Improved Seed Point Selection-Based Unsupervised Color Clustering for Content-Based Image Retrieval Application
    Pavithra, L. K.
    Sharmila, T. Sree
    COMPUTER JOURNAL, 2020, 63 (03): : 337 - 350
  • [48] A new graph-based clustering approach: Application to PMSI data
    Elghazel, Haytham
    Kheddouci, Hamamache
    Deslandres, Veronique
    Dussauchoy, Alain
    2006 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2006, : 110 - 115
  • [49] Relational clustering based on a new robust estimator with application to web mining
    Nasraoui, O
    Krishnapuram, R
    Joshi, A
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 705 - 709
  • [50] New Version of Davies-Bouldin Index for Clustering Validation Based on Cylindrical Distance
    Rojas Thomas, Juan Carlos
    Santos Penas, Matilde
    Mora, Marco
    PROCEEDINGS OF 2013 32ND INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016, : 49 - 53