Interactive textual feature selection for consensus clustering

被引:13
|
作者
Correa, Geraldo N. [1 ]
Marcacini, Ricardo M. [2 ]
Hruschka, Eduardo R. [1 ]
Rezende, Solange O. [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Carlos, SP, Brazil
[2] Univ Fed Mato Grosso do Sul, CPTL, Tres Lagoas, MS, Brazil
基金
巴西圣保罗研究基金会;
关键词
Interactive feature selection; Consensus clustering; Text mining;
D O I
10.1016/j.patrec.2014.09.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consensus clustering and interactive feature selection are very useful methods to extract and manage knowledge from texts. While consensus clustering allows the aggregation of different clustering solutions into a single robust clustering solution, the interactive feature selection facilitates the incorporation of the users' experience in the clustering tasks by selecting a set of textual features, i.e., including user's supervision at the term-level. We propose an approach for incorporating interactive textual feature selection into consensus clustering. Experimental results on several text collections demonstrate that our approach significantly improves consensus clustering accuracy, even when only few textual features are selected by the users. (C) 2014 Elsevier By. All rights reserved,
引用
收藏
页码:25 / 31
页数:7
相关论文
共 50 条
  • [1] Improving Consensus Clustering of Texts Using Interactive Feature Selection
    Marcacini, Ricardo M.
    Domingues, Marcos A.
    Rezende, Solange O.
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 237 - 238
  • [2] FEATURE-SELECTION BY INTERACTIVE CLUSTERING
    WISMATH, SK
    SOONG, HP
    AKL, SG
    [J]. PATTERN RECOGNITION, 1981, 14 (1-6) : 75 - 80
  • [3] Feature Subset Selection Using Consensus Clustering
    Rani, D. Sandhya
    Rani, T. Sobha
    Bhavani, S. Durga
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 57 - +
  • [4] Feature ranking based consensus clustering for feature subset selection
    Rani, D. Sandhya
    Rani, T. Sobha
    Bhavani, S. Durga
    Krishna, G. Bala
    [J]. APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8154 - 8169
  • [5] InterActive Feature Selection
    Raghavan, Hema
    Madani, Omid
    Jones, Rosie
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 841 - 846
  • [6] Feature selection for clustering
    Dash, M
    Liu, H
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 110 - 121
  • [7] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [8] Feature Selection for Visual Clustering
    Alagambigai, P.
    Thangavel, K.
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 498 - +
  • [9] On feature selection through clustering
    Butterworth, R
    Piatetsky-Shapiro, G
    Simovici, DA
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 581 - 584
  • [10] Feature selection and document clustering
    Dhillon, I
    Kogan, J
    Nicholas, C
    [J]. SURVEY OF TEXT MINING: CLUSTERING, CLASSIFICATION, AND RETRIEVAL, 2004, : 73 - 100