Unsupervised document clustering based on keyword clusters

被引:0
|
作者
Chang, HC [1 ]
Hsu, CC [1 ]
Deng, YW [1 ]
机构
[1] Hwa Hsia Coll Technol & Commerce, Dept Elect Engn, Taipei 235, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the explosion growth of digital information, automatic document clustering or categorization has been an important research topic. Since document clustering has high dimension, the magnitude of the representation features will influence the efficiency and effect of clustering proceeding and precision of clustering results. This paper presents an unsupervised document clustering method based on partitioning a weighted undirected graph. It initially discovers a set of tightly relevant keyword clusters that are disposed throughout the feature space of the collection of documents, and further cluster the documents into document clusters by using these keyword clusters. The experimental results show that the proposed approach can efficiently produce higher quality document clustering as compared with several well-known document clustering algorithms.
引用
收藏
页码:1198 / 1203
页数:6
相关论文
共 50 条
  • [31] Multi-Document Summarization Based on Keyword Fusion
    Alshahrani, Saud
    Bikdash, Marwan
    2019 IEEE SOUTHEASTCON, 2019,
  • [32] Coherence based Document Clustering
    Thielmann, Anton
    Weisser, Christoph
    Kneib, Thomas
    Safken, Benjamin
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 9 - 16
  • [33] Semantic-based keyword extraction method for document
    Jiang, Fang
    Li, Guohe
    Yun, Xue
    Yue, Xiang
    International Journal of Future Generation Communication and Networking, 2015, 8 (05): : 37 - 46
  • [34] Document clustering into an unknown number of clusters using a genetic algorithm
    Casillas, A
    de Lena, MTG
    Martínez, R
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 43 - 49
  • [35] Soft Rough Set based span for unsupervised keyword extraction
    Chatterjee, Niladri
    Roy, Aayush Singha
    Yadav, Nidhika
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4379 - 4386
  • [36] Unsupervised Keyword Extraction Methods Based on a Word Graph Network
    Wang, Hongbin
    Ye, Jingzhen
    Yu, Zhengtao
    Wang, Jian
    Mao, Cunli
    INTERNATIONAL JOURNAL OF AMBIENT COMPUTING AND INTELLIGENCE, 2020, 11 (02) : 68 - 79
  • [37] MILP-Based Unsupervised Clustering
    Malhotra, Akshay
    Schizas, Ioannis D.
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (12) : 1825 - 1829
  • [38] An Unsupervised Keyword Extraction Method based on Text Semantic Graph
    Zhao, Liujun
    Miao, Zhongquan
    Wang, Chunming
    Kong, Weizheng
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1431 - 1436
  • [39] Watershed-based unsupervised clustering
    Bicego, M
    Cristani, M
    Fusiello, A
    Murino, V
    ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 2003, 2683 : 83 - 94
  • [40] Clustering clusters: unsupervised machine learning on globular cluster structural parameters
    Pasquato, Mario
    Chung, Chul
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2019, 490 (03) : 3392 - 3403