Architecture of an hybrid system for experimentation on Web information retrieval incorporating clustering techniques

被引:0
|
作者
Mateos, Montserrat [1 ]
Figuerola, Carlos G. [2 ]
机构
[1] Univ Salamanca, E-37008 Salamanca, Spain
[2] Univ Salamanca, REINA Res Grp, E-37008 Salamanca, Spain
关键词
Web information retrieval; clustering; architecture; hybrid systems;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The results retrieved from common search engines are frequently not acceptable because of the large amounts of Web pages that are returned and the ambiguity of the results. Several authors have suggested grouping the Web pages by topics using clustering techniques. The application of these techniques involves different alternatives in different areas. Carrying out these research works could be easier and their cost lower if an experimentation hybrid system could be available that implemented the different alternatives or new ones could be added with a low cost. This work proposes an architecture of a Web information retrieval experimentation hybrid system that incorporates clustering techniques. This hybrid system is composed of different replaceable components, so that we could test different alternatives with a low cost. Later, using the proposed and the implemented system an empirical study is exposed in order to evaluate what type of clustering techniques is the most appropriate.
引用
收藏
页码:427 / 434
页数:8
相关论文
共 50 条
  • [41] An automatic web wrapper for extracting information from web sources, using clustering techniques
    Papadakis, N
    Skoutas, D
    Raftopoulos, K
    Varvarigou, T
    2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 24 - 30
  • [42] An extensible system architecture for web-based lidar experimentation and data analysis
    Parikh, NC
    Parikh, JA
    Clark, M
    Damon, M
    Mandable, S
    Connors, M
    22ND INTERNATIONAL LASER RADAR CONFERENCE (ILRC 2004), VOLS 1 AND 2, 2004, 561 : 243 - 246
  • [43] Web pages clustering and concepts mining: An approach towards intelligent information retrieval
    Li, Fang
    Mehlitz, Martin
    Fen, Li
    Sheng, Huanye
    2006 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 522 - +
  • [44] The Continuous Media Web: a distributed multimedia information retrieval architecture extending the World Wide Web
    Silvia Pfeiffer
    Conrad Parker
    André Pang
    Multimedia Systems, 2005, 10 : 544 - 558
  • [45] The Continuous Media Web: a distributed multimedia information retrieval architecture extending the World Wide Web
    Pfeiffer, S
    Parker, C
    Pang, A
    MULTIMEDIA SYSTEMS, 2005, 10 (06) : 544 - 558
  • [46] Information Extraction from the Web: System and Techniques
    Luo Xiao
    Dieter Wissmann
    Michael Brown
    Stephan Jablonski
    Applied Intelligence, 2004, 21 : 195 - 224
  • [47] Information extraction from the Web: System and techniques
    Xiao, L
    Wissmann, D
    Brown, M
    Jablonski, S
    APPLIED INTELLIGENCE, 2004, 21 (02) : 195 - 224
  • [48] Efficient Techniques to Improve Clustering Accuracy on Web by Using Hybrid Approach
    Sharma, Monika
    Rizvi, M. A.
    HELIX, 2018, 8 (05): : 3731 - 3735
  • [49] Hybrid Image Retrieval in Digital Libraries A Large Scale Multicollection Experimentation of Deep Learning Techniques
    Moreux, Jean-Philippe
    Chiron, Guillaume
    DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018, 2018, 11057 : 354 - 358
  • [50] Research on information retrieval system based on ant clustering algorithm
    Liu, Peiyu
    Zhu, Zhenfang
    Zhao, Lina
    Journal of Software, 2009, 4 (09) : 1032 - 1036