Hierarchical clustering of text corpora using suffix trees

被引:0
|
作者
Maslowska, I [1 ]
Slowinski, R [1 ]
机构
[1] Poznan Tech Univ, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method for hierarchical clustering of text corpora, which proves especially suitable for online clustering. Information overload - the current phenomenon in electronic document repositories and the Internet in particular - constitutes an unceasing challenge for researchers. Clustering has been proposed as a comprehensive information access method. We describe a system, which automatically builds a navigable hierarchy of meaningful document groups. We claim that our system addresses two chief needs of the Web users: the need for efficient access to the up-to-date information on every available topic and the need for an organized and meaningful presentation of the desired information.
引用
收藏
页码:179 / 188
页数:10
相关论文
共 50 条
  • [41] Suffix trees on words
    Andersson, A
    Larsson, NJ
    Swanson, K
    ALGORITHMICA, 1999, 23 (03) : 246 - 260
  • [42] TCBLHT: A new method of hierarchical text clustering
    Xu, JS
    Wang, L
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 2178 - 2181
  • [43] Efficient hierarchical clustering of large data sets using P-trees
    Denton, A
    Ding, Q
    Perrizo, W
    Ding, Q
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2002, : 138 - 141
  • [44] An improvement of Chinese text hierarchical clustering algorithm
    Liu, X. L.
    Chen, Z. G.
    Zeng, B.
    Zhu, Q. X.
    Chen, T.
    COMPUTING, CONTROL, INFORMATION AND EDUCATION ENGINEERING, 2015, : 647 - 650
  • [45] Relative Suffix Trees
    Farruggia, Andrea
    Gagie, Travis
    Navarro, Gonzalo
    Puglisi, Simon J.
    Siren, Jouni
    COMPUTER JOURNAL, 2018, 61 (05): : 773 - 788
  • [46] Text summarization for pharmaceutical sciences using hierarchical clustering with a weighted evaluation methodology
    Dalal, Avinash
    Ranjan, Sumit
    Bopaiah, Yajna
    Chembachere, Divya
    Steiger, Nick
    Burns, Christopher
    Daswani, Varsha
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [47] Improving the Decision Value of Hierarchical Text Clustering Using Term Overlap Detection
    Nathawitharana, Nilupulee
    Alahakoon, Damminda
    Matharage, Sumith
    AUSTRALASIAN JOURNAL OF INFORMATION SYSTEMS, 2015, 19 : S55 - S74
  • [48] PSIST: Indexing protein structures using suffix trees
    Gao, F
    Zaki, MJ
    2005 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2005, : 212 - 222
  • [49] Clone detection using abstract syntax suffix trees
    Koschke, Rainer
    Falke, Raimar
    Frenzel, Pierre
    13TH WORKING CONFERENCE ON REVERSE ENGINEERING PROCEEDINGS, 2006, : 253 - 262
  • [50] Creating improvisations on chord progressions using suffix trees
    Ayad, Lorraine A. K.
    Chemillier, Marc
    Pissis, Solon P.
    JOURNAL OF MATHEMATICS AND MUSIC, 2018, 12 (03) : 233 - 247