Hierarchical clustering of text corpora using suffix trees

被引:0
|
作者
Maslowska, I [1 ]
Slowinski, R [1 ]
机构
[1] Poznan Tech Univ, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method for hierarchical clustering of text corpora, which proves especially suitable for online clustering. Information overload - the current phenomenon in electronic document repositories and the Internet in particular - constitutes an unceasing challenge for researchers. Clustering has been proposed as a comprehensive information access method. We describe a system, which automatically builds a navigable hierarchy of meaningful document groups. We claim that our system addresses two chief needs of the Web users: the need for efficient access to the up-to-date information on every available topic and the need for an organized and meaningful presentation of the desired information.
引用
收藏
页码:179 / 188
页数:10
相关论文
共 50 条
  • [21] Hierarchical clustering in minimum spanning trees
    Yu, Meichen
    Hillebrand, Arjan
    Tewarie, Prejaas
    Meier, Jil
    van Dijk, Bob
    Van Mieghem, Piet
    Stam, Cornelis Jan
    CHAOS, 2015, 25 (02)
  • [22] From suffix trees to suffix vectors
    Prieur, Elise
    Lecroq, Thierry
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2006, 17 (06) : 1385 - 1402
  • [23] Hierarchical Clustering Approach to Text Compression
    Oswald, C.
    Vyas, V. Akshay
    Kumar, K. Arun
    Sri, L. Vijay
    Sivaselvan, B.
    PROGRESS IN INTELLIGENT COMPUTING TECHNIQUES: THEORY, PRACTICE, AND APPLICATIONS, VOL 1, 2018, 518 : 347 - 357
  • [24] Hierarchical classification of diatom images using ensembles of predictive clustering trees
    Dimitrovski, Ivica
    Kocev, Dragi
    Loskovska, Suzana
    Dzeroski, Saso
    ECOLOGICAL INFORMATICS, 2012, 7 (01) : 19 - 29
  • [25] Fast approximate matching using suffix trees
    Cobbs, AL
    COMBINATORIAL PATTERN MATCHING, 1995, 937 : 41 - 54
  • [26] Using suffix trees for gapped motif discovery
    Rocke, E
    COMBINATORIAL PATTERN MATCHING, 2000, 1848 : 335 - 349
  • [27] Fully-Dynamic Hierarchical Graph Clustering Using Cut Trees
    Doll, Christof
    Hartmann, Tanja
    Wagner, Dorothea
    ALGORITHMS AND DATA STRUCTURES, 2011, 6844 : 338 - +
  • [28] Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering
    Dotan-Cohen, Dikla
    Kasif, Simon
    Melkman, Avraham A.
    BIOINFORMATICS, 2009, 25 (14) : 1789 - 1795
  • [29] Activity Discovery Using Compressed Suffix Trees
    Guha, Prithwijit
    Mukerjee, Amitabha
    Venkatesh, K. S.
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT II, 2011, 6979 (II): : 69 - +
  • [30] Hierarchical Text Clustering and Categorisation using A Semi-Supervised Framework
    Mahyoub, Mohamed
    Hind, Jade
    Woods, David
    Wong, Carl
    Hussain, Abir
    Aljumeily, Dhiya
    12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 153 - 159