Fuzzy clustering for topic analysis and summarization of document collections

被引:0
|
作者
Witte, Rene [1 ]
Bergler, Sabine [2 ]
机构
[1] Univ Karlsruhe, Inst Programmstruckturen & Datenorg, Kaiserstr 12, Karlsruhe, Germany
[2] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common and distinctive topics within a document set, together with the generation of multi-document summaries, can greatly ease the burden of information management. We show how this can be achieved with a clustering algorithm based on fuzzy set theory, which (i) is easy to implement and integrate into a personal information system, (ii) generates a highly flexible data structure for topic analysis and summarization, and (iii) also delivers excellent performance.
引用
收藏
页码:476 / +
页数:3
相关论文
共 50 条
  • [21] A topic modeling based approach to novel document automatic summarization
    Wu, Zongda
    Lei, Li
    Li, Guiling
    Huang, Hui
    Zheng, Chengren
    Chen, Enhong
    Xu, Guandong
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 84 : 12 - 23
  • [22] Topic-Sensitive Multi-document Summarization Algorithm
    Liu Na
    Tang Xiao-jun
    Lu Ying
    Li Ming-xia
    Wang Hai-wen
    Xiao Peng
    2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 69 - 74
  • [23] Topic Exploration in Spatio-Temporal Document Collections
    Zhao, Kaiqi
    Chen, Lisi
    Cong, Gao
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 985 - 998
  • [24] A novel contextual topic model for multi-document summarization
    Yang, Guangbing
    Wen, Dunwei
    Kinshuk
    Chen, Nian-Shing
    Sutinen, Erkki
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1340 - 1352
  • [25] A topic Approach to Sentence Ordering for Multi-document Summarization
    Na, Liu
    Peng, Xiao
    Ying, Lu
    Tang Xiao-jun
    Wang Hai-wen
    Li Ming-xia
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1390 - 1395
  • [26] Topic-Guided Abstractive Multi-Document Summarization
    Cui, Peng
    Hu, Le
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1463 - 1472
  • [27] Topic-Sensitive Multi-document Summarization Algorithm
    Liu Na
    Di Tang
    Lu Ying
    Tang Xiao-jun
    Wang Hai-wen
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 12 (04) : 1375 - 1389
  • [28] Efficient clustering of very large document collections
    Dhillon, IS
    Fan, J
    Guan, YQ
    DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 : 357 - 381
  • [29] Topic-driven Clustering for Document Datasets
    Zhao, Ying
    Karypis, George
    PROCEEDINGS OF THE FIFTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 358 - 369
  • [30] An efficient clustering approach for large document collections
    Han, B
    Kang, LS
    Song, HZ
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 240 - 247