Hierarchical clustering of text corpora using suffix trees

被引:0
|
作者
Maslowska, I [1 ]
Slowinski, R [1 ]
机构
[1] Poznan Tech Univ, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method for hierarchical clustering of text corpora, which proves especially suitable for online clustering. Information overload - the current phenomenon in electronic document repositories and the Internet in particular - constitutes an unceasing challenge for researchers. Clustering has been proposed as a comprehensive information access method. We describe a system, which automatically builds a navigable hierarchy of meaningful document groups. We claim that our system addresses two chief needs of the Web users: the need for efficient access to the up-to-date information on every available topic and the need for an organized and meaningful presentation of the desired information.
引用
收藏
页码:179 / 188
页数:10
相关论文
共 50 条
  • [31] Clustering Sentence Level-Text using Fuzzy Hierarchical Algorithm
    Priya, G. Krishna
    Anupriya, G.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [32] On Clustering and Evaluation of Narrow Domain Short-Text Corpora
    Pinto Avendano, David Eduardo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (42): : 129 - 130
  • [33] Density-based clustering of short-text corpora
    Ingaramo, Diego A.
    Errecalde, Marcelo L.
    Rosso, Paolo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 81 - 88
  • [34] Distributed text search using suffix arrays
    Arroyuelo, Diego
    Bonacic, Carolina
    Gil-Costa, Veronica
    Marin, Mauricio
    Navarro, Gonzalo
    PARALLEL COMPUTING, 2014, 40 (09) : 471 - 495
  • [35] Statistically validated hierarchical clustering: Nested partitions in hierarchical trees
    Bongiorno, Christian
    Micciche, Salvatore
    Mantegna, Rosario N.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 593
  • [36] Computing suffix links for suffix trees and arrays
    Maass, Moritz G.
    INFORMATION PROCESSING LETTERS, 2007, 101 (06) : 250 - 254
  • [37] Converting suffix trees into factor/suffix oracles
    Rusu, Irena
    JOURNAL OF DISCRETE ALGORITHMS, 2008, 6 (02) : 324 - 340
  • [38] Dependent nonparametric trees for dynamic hierarchical clustering
    Dubey, Avinava
    Ho, Qirong
    Williamson, Sinead
    Xing, Eric P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [39] Distributed suffix trees
    Clifford, Raphael
    JOURNAL OF DISCRETE ALGORITHMS, 2005, 3 (2-4) : 176 - 197
  • [40] Suffix Trees on Words
    A. Andersson
    N. J. Larsson
    K. Swanson
    Algorithmica, 1999, 23 : 246 - 260