K-means based method for overlapping document clustering

被引:2
|
作者
Beltran, Beatriz [1 ]
Vilarino, Darnes [1 ]
Martinez-Trinidad, Jose Fco. [2 ]
Carrasco-Ochoa, J. A. [2 ]
Pinto, David [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, Language & Knowledge Engn Lab, Puebla, Mexico
[2] Inst Nacl Astrofis Opt & Electr, Comp Sci, Puebla, Mexico
关键词
Clustering; overlapping clustering; document clustering; ALGORITHM; DENSITY;
D O I
10.3233/JIFS-179878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Overlapping clustering algorithms have shown to be effective for clustering documents. However, the current overlapping document clustering algorithms produce a big number of clusters, which make them little useful for the user. Therefore, in this paper, we propose a k-means based method for overlapping document clustering, which allows to specify by the user the number of groups to be built. Our experiments with different corpora show that our proposal allows obtaining better results in terms of FBcubed than other recent works for overlapping document clustering reported in the literature.
引用
收藏
页码:2127 / 2135
页数:9
相关论文
共 50 条
  • [1] An extended version of the k-means method for overlapping clustering
    Cleuziou, Guillaume
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 563 - 566
  • [2] An improved overlapping k-means clustering method for medical applications
    Khanmohammadi, Sina
    Adibeig, Naiier
    Shanehbandy, Samaneh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 67 : 12 - 18
  • [3] Text Document Clustering Based on Density K-means
    Wu, Di
    Zeng, Yan
    Qu, Yin-chuan
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER, MECHATRONICS AND ELECTRONIC ENGINEERING (CMEE 2016), 2016,
  • [4] An ellipsoidal K-means for document clustering
    Dzogang, Fabon
    Marsala, Christophe
    Lesot, Marie-Jeanne
    Rifqi, Maria
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 221 - 230
  • [5] A Clustering Method Based on K-Means Algorithm
    Li, Youguo
    Wu, Haiyan
    [J]. INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 1104 - 1109
  • [6] An Improved K-means Algorithm for Document Clustering
    Wu, Guohua
    Lin, Hairong
    Fu, Ershuai
    Wang, Liuyang
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA), 2015, : 65 - 69
  • [7] Harmony K-means algorithm for document clustering
    Mahdavi, Mehrdad
    Abolhassani, Hassan
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 18 (03) : 370 - 391
  • [8] Harmony K-means algorithm for document clustering
    Mehrdad Mahdavi
    Hassan Abolhassani
    [J]. Data Mining and Knowledge Discovery, 2009, 18 : 370 - 391
  • [9] One optimized choosing method of K-means document clustering center
    Suo, Hongguang
    Nie, Kunming
    Sun, Xin
    Wang, Yuwei
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 490 - 495
  • [10] KERNEL OVERLAPPING K-MEANS FOR CLUSTERING IN FEATURE SPACE
    Ben N'Cir, Chiheb-Eddine
    Essoussi, Nadia
    Bertrand, Patrice
    [J]. KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 250 - 256