Solr Integration in the Anserini Information Retrieval Toolkit

被引:1
|
作者
Clancy, Ryan [1 ]
Eskildsen, Toke [2 ]
Ruest, Nick [3 ]
Lin, Jimmy [1 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON, Canada
[2] Royal Danish Lib, Copenhagen, Denmark
[3] York Univ Libraries, Toronto, ON, Canada
基金
加拿大创新基金会; 加拿大自然科学与工程研究理事会;
关键词
D O I
10.1145/3331184.3331401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anserini is an open-source information retrieval toolkit built around Lucene to facilitate replicable research. In this demonstration, we examine different architectures for Solr integration in order to address two current limitations of the system: the lack of an interactive search interface and support for distributed retrieval. Two architectures are explored: In the first approach, Anserini is used as a frontend to index directly into a running Solr instance. In the second approach, Lucene indexes built directly with Anserini can be copied into a Solr installation and placed under its management. We discuss the tradeoffs associated with each architecture and report the results of a performance evaluation comparing indexing throughput. To illustrate the additional capabilities enabled by Anserini/Solr integration, we present a search interface built using the open-source Blacklight discovery interface.
引用
收藏
页码:1285 / 1288
页数:4
相关论文
共 50 条
  • [1] Information Retrieval Meets Scalable Text Analytics: Solr Integration with Spark
    Clancy, Ryan
    Lee, Jaejun
    Yilmaz, Zeynep Akkalyoncu
    Lin, Jimmy
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1313 - 1316
  • [2] Anserini: Enabling the Use of Lucene for Information Retrieval Research
    Yang, Peilin
    Fang, Hui
    Lin, Jimmy
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1253 - 1256
  • [3] A Mongolian Information Retrieval System Based on Solr
    Ma, Lujia
    Bao, Wei
    Bao, Wugedele
    Yuan, Wuriga
    Huang, Tao
    Zhao, XiaoBing
    [J]. PROCEEDINGS OF 2017 9TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2017, : 335 - 338
  • [4] Spatial Information Retrieval Optimization using Solr and GIS
    Xu, Jiajun
    Pei, Zhiyuan
    Guo, Lin
    Zhang, Ruxia
    Zhang, Yin
    Hu, Hualang
    Wang, Fei
    Zhang, Xuegang
    [J]. 2018 7TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2018, : 419 - 424
  • [5] Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes
    Ma, Xueguang
    Teofili, Tommaso
    Lin, Jimmy
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5366 - 5370
  • [6] A Data-Parallel Toolkit for Information Retrieval
    Fetterly, Dennis
    McSherry, Frank
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 701 - 701
  • [7] Information Retrieval and Interaction System (IRIS): A Toolkit for Investigating Information Retrieval and Interaction Activities
    Pulliza, Jonathan
    Shah, Chirag
    [J]. CHIIR'18: PROCEEDINGS OF THE 2018 CONFERENCE ON HUMAN INFORMATION INTERACTION & RETRIEVAL, 2018, : 333 - 335
  • [8] Design and implementation of SOLR-based information retrieval system for value-added service
    WANG, Hong-man
    WANG, He-wei
    LIU, Yu-zhang
    YANG, Fang-chun
    [J]. Journal of China Universities of Posts and Telecommunications, 2008, 15 (SUPPL.): : 51 - 54
  • [9] Information integration and retrieval: the CDS hub
    Genova, F
    Bonnarel, F
    Dubois, P
    Egret, D
    Fernique, P
    Jasniewicz, G
    Lesteven, S
    Ochsenbein, F
    Wenger, M
    [J]. ASTRONOMICAL DATA ANALYSIS, 2001, 4477 : 142 - 150