An entropy-based hierarchical search result clustering method by utilizing augmented information

被引:0
|
作者
Hung-Yu, Kao [1 ]
Hsin-Wei, Hsiao [1 ]
Chih-Lu, Lin [1 ]
Chia-Chun, Shih [2 ]
Tse-Ming, Tsai [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
[2] Inst Informat Ind, Innovat Digitech Enabled Applicat & Serv Inst, Taipei, Taiwan
来源
INFORMATION RETRIEVAL TECHNOLOGY | 2008年 / 4993卷
关键词
search engine; clustering; snippet; entropy; augmented information;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of the improvement of the technology of search engines, and the massively increase of the number of web pages, the results returned by the search engines are always mixed and disordered. Especially for the queries with multiple topics, the mixed and disorderly situation of the search results would be more obvious. The search engines can return information of several hundred to thousand of the pages' titles, snippets and URLs. Almost all of the technologies about search result clustering must attain further information from the contents of the returned lists. However, long execution time is not permitted for a real-time clustering system. In this paper we propose some methods with better efficiency to improve the previous technologies. We utilize and augment information that search engines returned and use entropy calculation to attain the term distribution in snippets. We also propose several new methods to attain better clustered search results and reduce execution time. Our experiments indicate that these proposed methods obtain the better clustered results.
引用
收藏
页码:670 / +
页数:2
相关论文
共 50 条
  • [1] Information entropy-based ant Clustering algorithm
    Weili, Zhao
    Zhiguo, Zhang
    Zhijun, Zhang
    International Journal of Advancements in Computing Technology, 2012, 4 (16) : 219 - 228
  • [2] Unsupervised non-hierarchical entropy-based clustering
    Jardino, M
    DATA ANALYSIS, CLASSIFICATION, AND RELATED METHODS, 2000, : 29 - 34
  • [3] A New Information Entropy-based Ant Clustering Algorithm
    Zhao Weili
    Zhang Zhiguo
    Zhang Zhijun
    APPLIED MECHANICS AND MANUFACTURING TECHNOLOGY, 2011, 87 : 101 - +
  • [4] Information Entropy-based Density Clustering Algorithm of Database Log
    Xiao, Zongshui
    Kong, Lanju
    Zhang, Baochen
    Qian, Jin
    Jin, Fuqi
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 102 - 108
  • [5] An Information Entropy-Based Modeling Method for the Measurement System
    Kong, Li
    Pan, Hao
    Li, Xuewei
    Ma, Shuangbao
    Xu, Qi
    Zhou, Kaibo
    ENTROPY, 2019, 21 (07)
  • [6] INFORMATION ENTROPY-BASED CLUSTERING ALGORITHM FOR RAPID SOFTWARE FAULT DIAGNOSIS
    Li, Yin-Zhao
    Hu, Chang-Zhen
    Wang, Kun-Sheng
    Xu, Li-Na
    He, Hui-Ling
    Ren, Jia-Dong
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2106 - +
  • [7] An Information Entropy-Based Animal Migration Optimization Algorithm for Data Clustering
    Hou, Lei
    Gao, Jian
    Chen, Rong
    ENTROPY, 2016, 18 (05)
  • [8] Information Entropy-based Two-stage Clustering Recommendation Algorithm
    Wang, Yunxiang
    Wan, Jing
    Li, Hui
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 78 - 83
  • [9] Hybrid sampling on mutual information entropy-based clustering ensembles for optimizations
    Wang, Feng
    Yang, Cheng
    Lin, Zhiyi
    Li, Yuanxiang
    Yuan, Yuan
    NEUROCOMPUTING, 2010, 73 (7-9) : 1457 - 1464
  • [10] An Entropy-Based Model for Hierarchical Learning
    Asadi, Amir R.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 45