An Incremental Algorithm for Clustering Search Results

被引:1
|
作者
Liu, Yongli [1 ]
Ouyang, Yuanxin [1 ]
Sheng, Hao [1 ]
Xiong, Zhang [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Technol, Beijing 100083, Peoples R China
关键词
D O I
10.1109/SITIS.2008.53
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
When internet users are facing massive search results, document clustering techniques are very helpful. Generally, existing clustering methods start with a known set of data objects, measured against a known set of attributes. However, there are numerous applications where the attribute set can only obtained gradually as processing data objects incrementally. This paper presents an incremental clustering algorithm (ICA) for clustering search results, which relies on pair-wise search result similarity calculated using Jaccard method. We use a measure namely, Cluster Average Similarity Area to score cluster cohesiveness. Experimental results show that our algorithm leads to less computational time than traditional clustering method while achieving a comparable or better clustering quality.
引用
收藏
页码:112 / 117
页数:6
相关论文
共 50 条
  • [1] A new algorithm for clustering search results
    Mecca, Giansalvatore
    Raunich, Salvatore
    Pappalardo, Alessandro
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 62 (03) : 504 - 522
  • [2] Refining Web search engine results using incremental clustering
    Zhang, YJ
    Liu, ZQ
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2004, 19 (1-2) : 191 - 199
  • [3] An efficient algorithm for clustering search engine results
    Zhang, Hui
    Pang, Bin
    Xie, Ke
    Wu, Hui
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, 2007, 4456 : 661 - 671
  • [4] An efficient algorithm for clustering search engine results
    Hui Zhang
    Bin Pang
    Ke Xie
    Hui Wu
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 1429 - 1434
  • [5] Clustering Algorithm Comparison of Search Results Documents
    David
    Kosala, Raymondus Raymond
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM), 2018, : 328 - 333
  • [6] Incremental clustering of search history in personalized search
    [J]. Wang, X. (xcwang@mtlab.hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [7] A concept-driven algorithm for clustering search results
    Osinski, S
    Weiss, D
    [J]. IEEE INTELLIGENT SYSTEMS, 2005, 20 (03) : 48 - 54
  • [8] Search Results Clustering Algorithm based on the Suffix Tree
    Wang, Dengwei
    Liu, Libo
    Dong, Jing
    Zheng, Jiao
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING ICISCE 2015, 2015, : 456 - 460
  • [9] A Fast Incremental Clustering Algorithm
    Su, Xiaoke
    Lan, Yang
    Wan, Renxia
    Qin, Yuming
    [J]. ISIP: 2009 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING, PROCEEDINGS, 2009, : 175 - +
  • [10] Online Clustering Algorithm for Restructuring User Web Search Results
    Pavani, M.
    Teja, G. Ravi
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 27 - 36