Similarity-based document distribution for efficient distributed information retrieval

被引:0
|
作者
Herschel, Sven [1 ]
机构
[1] Humboldt Univ, D-10099 Berlin, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performing information retrieval (IR) efficiently in a distributed environment is currently one of the main challenges in IR. Document representations are distributed among nodes in a manner that allows a query processing algorithm to efficiently direct queries to those nodes that contribute to the result. Existing term-based document distribution algorithms do not scale with large collection sizes or many-term queries because they incur heavy network traffic during the distribution and query phases. We propose a novel algorithm for document distribution, namely distance-based document distribution. The distribution obtained by our algorithm allows answering any IR query effectively by contacting only a few nodes, independent of both document collection size and network size, thereby improving efficiency. We accomplish this by linearizing the information retrieval search space such that it reflects the ranking formula which will be used for later retrieval. Our experimental evaluation indicates that effective information retrieval can be efficiently accomplished in distributed networks.
引用
收藏
页码:99 / 110
页数:12
相关论文
共 50 条
  • [1] A similarity-based view to distributed Information Retrieval with mobile agents
    Loia, V
    Luongo, P
    Senatore, S
    Sessa, MI
    [J]. 10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 1283 - 1286
  • [2] Similarity-based queries for information retrieval
    Aguilera, AI
    Subero, AR
    Tineo, LJ
    [J]. DATABASES IN NETWORKED INFORMATION SYSTEMS, PROCEEDINGS, 2001, 1966 : 148 - 156
  • [3] A semantic similarity-based social information retrieval model
    Choumane, Ali
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2014, 4 (01) : 1 - 6
  • [4] Job information retrieval based on document similarity
    Wang, Jingfan
    Xia, Yunqing
    Zheng, Thomas Fang
    Wu, Xiaojun
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 165 - +
  • [5] An efficient approach to similarity-based retrieval on top of relational databases
    Schumacher, J
    Bergmann, R
    [J]. ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2001, 1898 : 273 - 284
  • [6] SIMILARITY-BASED RETRIEVAL WITH STRUCTURE-SENSITIVE SPARSE BINARY DISTRIBUTED REPRESENTATIONS
    Rachkovskij, Dmitri A.
    Slipchenko, Serge V.
    [J]. COMPUTATIONAL INTELLIGENCE, 2012, 28 (01) : 106 - 129
  • [7] Similarity-Based Information Retrieval and Its Role within Spatial Data Infrastructures
    Janowicz, Krzysztof
    Wilkes, Marc
    Lutz, Michael
    [J]. GEOGRAPHIC INFORMATION SCIENCE, 2008, 5266 : 151 - +
  • [8] Term similarity-based query expansion for cross-language information retrieval
    Adriani, M
    van Rijsbergen, CJ
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 1999, 1696 : 311 - 322
  • [9] BAYESIAN RETRIEVAL USING A SIMILARITY-BASED LEMMATIZER
    Maragoudakis, Manolis
    Lyras, Dimitrios P.
    Sgarbas, Kyriakos
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2012, 21 (05)
  • [10] Surface similarity-based retrieval: in default or by default?
    Raynal, Lucas
    Sander, Emmanuel
    Clement, Evelyne
    [J]. ANNEE PSYCHOLOGIQUE, 2024, 124 (01):