Comprehensive Characterization of an Open Source Document Search Engine

被引:3
|
作者
Hadjilambrou, Zacharias [1 ]
Kleanthous, Marios [1 ]
Antoniou, Georgia [1 ]
Portero, Antoni [2 ,3 ]
Sazeides, Yiannakis [1 ]
机构
[1] Univ Cyprus, 1 Univ Ave, CY-2109 Aglantzia, Cyprus
[2] IT4Innovations, Ostrava, Czech Republic
[3] VSB Univ Ostrava, IT4Innovations, Ostrava 70833, Czech Republic
关键词
Document search; index partitioning; parallel index search; parallelism; characterization; real hardware; measurement; evaluation; performance; experimentation; WEB SEARCH;
D O I
10.1145/3320346
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This work performs a thorough characterization and analysis of the open source Lucene search library. The article describes in detail the architecture, functionality, and micro-architectural behavior of the search engine, and investigates prominent online document search research issues. In particular, we study how intra-server index partitioning affects the response time and throughput, explore the potential use of low power servers for document search, and examine the sources of performance degradation ands the causes of tail latencies. Some of our main conclusions are the following: (a) intra-server index partitioning can reduce tail latencies but with diminishing benefits as incoming query traffic increases, (b) low power servers given enough partitioning can provide same average and tail response times as conventional high performance servers, (c) index search is a CPU-intensive cache-friendly application, and (d) C-states are the main culprits for performance degradation in document search.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Swift Search An open-source search engine
    Kaneria, Fenil
    Khan, Shafaq
    Nizamuddin, Nishara
    [J]. 2022 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING, ICISE 2022, 2022, : 6 - 9
  • [2] OrChem: an open source chemistry search engine for Oracle
    Mark L Rijnbeek
    Christoph Steinbeck
    [J]. Journal of Cheminformatics, 2 (Suppl 1)
  • [3] NBLucene: Flexible and Efficient Open Source Search Engine
    Zhang, Zhaohua
    Ye, Benjun
    Huang, Jiayi
    Stones, Rebecca
    Wang, Gang
    Liu, Xiaoguang
    [J]. WEB-AGE INFORMATION MANAGEMENT, PT I, 2016, 9658 : 504 - 516
  • [4] Towards a Semantic Search Engine for Open Source Software
    Ben Sassi, Sihem
    [J]. SOFTWARE REUSE: BRIDGING WITH SOCIAL-AWARENESS, 2016, 9679 : 300 - 314
  • [5] ScoutSL: An Open-source Simulink Search Engine
    Shrestha, Sohil Lal
    Boll, Alexander
    Kehrer, Timo
    Csallner, Christoph
    [J]. 2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 70 - 74
  • [6] OrChem - An open source chemistry search engine for Oracle®
    Mark Rijnbeek
    Christoph Steinbeck
    [J]. Journal of Cheminformatics, 1
  • [7] OrChem - An open source chemistry search engine for Oracle®
    Rijnbeek, Mark
    Steinbeck, Christoph
    [J]. JOURNAL OF CHEMINFORMATICS, 2009, 1
  • [8] An open source and modular search engine for biomedical literature retrieval
    Almeida, Hayda
    Jean-Louis, Ludovic
    Meurs, Marie-Jean
    [J]. COMPUTATIONAL INTELLIGENCE, 2018, 34 (01) : 200 - 218
  • [9] Stalker, a Multilingual Text Mining Search Engine for Open Source Intelligence
    Neri, F.
    Pettoni, M.
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS CISIS 2008, 2009, 53 : 35 - +
  • [10] Stalker, a multilingual text mining search engine for Open Source Intelligence
    Neri, F.
    Pettoni, Ten Col. M.
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 314 - 320