An improved semantic smoothing model for model-based document clustering

被引:1
|
作者
Cai, Jiarong [1 ]
Liu, Yubao [1 ]
Yin, Jian [1 ]
机构
[1] Sun Yat sen Univ, Dept Comp Sci, Guangzhou 510275, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/SNPD.2007.155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, semantic smoothing is proposed as an efficient solution for the improvement of document cluster quality. However the existing semantic smoothing model is not effective for partitional clustering to enhance the document clustering quality. In this paper, inspired by the TF*IDF schema and background elimination strategy, we first introduce an improved semantic smoothing model, which is suitable for both agglomerative and partitional clustering. Based on the improved semantic smoothing model, two model-document clustering algorithms, the partitional clustering algorithm and the agglomerative clustering algorithm, are also presented. The experimental results show our algorithms are more effective than the previous methods to improve the cluster quality.
引用
收藏
页码:670 / +
页数:2
相关论文
共 50 条
  • [1] Semantic smoothing for model-based document clustering
    Zhang, Xiaodan
    Zhou, Xiaohua
    Hu, Xiaohua
    [J]. ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 1193 - +
  • [2] Document clustering based on semantic smoothing approach
    Liu, Yubao
    Cai, Jiarong
    Yin, Jian
    Huang, Zhilan
    [J]. ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 217 - +
  • [3] Model-based document categorization employing semantic pattern analysis and local structure clustering
    Fume, Kosei
    Ishitani, Yasuto
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815
  • [4] Generative model-based document clustering: a comparative study
    Zhong, S
    Ghosh, J
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (03) : 374 - 384
  • [5] Generative model-based document clustering: a comparative study
    Shi Zhong
    Joydeep Ghosh
    [J]. Knowledge and Information Systems, 2005, 8 : 374 - 384
  • [6] Research on Mixture Language Model-based Document Clustering
    Wen, Jian
    Li, Zhoujun
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 649 - +
  • [7] Semantic Smoothing of Document Models for Agglomerative Clustering
    Zhou, Xiaohua
    Zhang, Xiaodan
    Hu, Xiaohua
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2922 - 2927
  • [8] MMPClust: A skew prevention algorithm for model-based document clustering
    Li, XG
    Yu, G
    Wang, DL
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 536 - 547
  • [9] Improved deep clustering model based on semantic consistency for image clustering
    Zhang, Feng
    Li, Lin
    Hua, Qiang
    Dong, Chun-Ru
    Lim, Boon-Han
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [10] Improved Model-based Clustering using Evolutionary Optimization
    Kebriaei, Emad
    Bijari, Kayvan
    Zare, Hadi
    [J]. 2017 ARTIFICIAL INTELLIGENCE AND ROBOTICS (IRANOPEN), 2017, : 182 - 187