Fuzzy clustering for topic analysis and summarization of document collections

被引:0
|
作者
Witte, Rene [1 ]
Bergler, Sabine [2 ]
机构
[1] Univ Karlsruhe, Inst Programmstruckturen & Datenorg, Kaiserstr 12, Karlsruhe, Germany
[2] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common and distinctive topics within a document set, together with the generation of multi-document summaries, can greatly ease the burden of information management. We show how this can be achieved with a clustering algorithm based on fuzzy set theory, which (i) is easy to implement and integrate into a personal information system, (ii) generates a highly flexible data structure for topic analysis and summarization, and (iii) also delivers excellent performance.
引用
收藏
页码:476 / +
页数:3
相关论文
共 50 条
  • [1] Probabilistic Topic Modeling for Comparative Analysis of Document Collections
    Hua, Ting
    Lu, Chang-Tien
    Choo, Jaegul
    Reddy, Chandan K.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (02)
  • [2] Possibilistic fuzzy co-clustering of large document collections
    Tjhi, William-Chandra
    Chen, Lihui
    PATTERN RECOGNITION, 2007, 40 (12) : 3452 - 3466
  • [3] Topic Generation for Web Document Summarization
    Hsu, Heng-Yao
    Tsai, Chun-Wei
    Chiang, Ming-Chao
    Yang, Chu-Sing
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 3701 - +
  • [4] Unsupervised neural networks for automatic Arabic text summarization using document clustering and topic modeling
    Alami, Nabil
    Meknassi, Mohammed
    En-nahnahi, Noureddine
    El Adlouni, Yassine
    Ammor, Ouafae
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 172
  • [5] Integrating Document Clustering and Multidocument Summarization
    Wang, Dingding
    Zhu, Shenghuo
    Li, Tao
    Chi, Yun
    Gong, Yihong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2011, 5 (03)
  • [6] Multi document summarization using clustering
    Balabantaray, R.C.
    Sahoo, D.K.
    Swain, M.
    Sahoo, B.
    Journal of Theoretical and Applied Information Technology, 2012, 46 (02) : 565 - 571
  • [7] Image content clustering and summarization for photo collections
    Li, Cheng-Hung
    Chiu, Chih-Yi
    Huang, Chun-Rong
    Chen, Chu-Song
    Chien, Lee-Feng
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1033 - +
  • [8] Topic-based Coordination for Visual Analysis of Evolving Document Collections
    Eler, Danilo Medeiros
    Paulovich, Fernando Vieira
    Ferreira de Oliveira, Maria Cristina
    Minghim, Rosane
    INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 149 - 155
  • [9] Evaluating Topic Representations for Exploring Document Collections
    Aletras, Nikolaos
    Baldwin, Timothy
    Lau, Jey Han
    Stevenson, Mark
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (01) : 154 - 167
  • [10] Mixture of Topic Model for Multi-document Summarization
    Liu Na
    Li Ming-xia
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    Xiao Peng
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 5168 - 5172