NBLucene: Flexible and Efficient Open Source Search Engine

被引:1
|
作者
Zhang, Zhaohua [1 ]
Ye, Benjun [1 ]
Huang, Jiayi [1 ]
Stones, Rebecca [1 ]
Wang, Gang [1 ]
Liu, Xiaoguang [1 ]
机构
[1] Nankai Univ, Coll Comp & Control Engn, Tianjin, Peoples R China
来源
关键词
D O I
10.1007/978-3-319-39937-9_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most popular open source projects for text searching have been designed to support many features. These projects are well-written in Java for cross-platform using. But when conducting research, the execution efficiency of program should be more essential, which is a problem for applications written in Java. It is also difficult for Java to use parallel mechanisms in the modern computer system like SIMD and GPUs. To this end, we expand an open source text searching project written in C++ for research purpose. Our approach is to define a flexible and efficient search engine architecture which consists of extensible application programming interfaces. We aim to provide a flexible architecture to enable researchers to readily implement and modify search engine algorithms and strategies. Moreover, we integrate one generic mathematical encoding library which can be used to compress inverted index. We also implement an integral framework for result summarization, including snippet generation and cache strategies. Experiment results show that the new architecture makes a significant improvement versus original work.
引用
收藏
页码:504 / 516
页数:13
相关论文
共 50 条
  • [1] Swift Search An open-source search engine
    Kaneria, Fenil
    Khan, Shafaq
    Nizamuddin, Nishara
    [J]. 2022 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING, ICISE 2022, 2022, : 6 - 9
  • [2] SHiFT: An Efficient, Flexible Search Engine for Transfer Learning
    Renggli, Cedric
    Yao, Xiaozhe
    Kolar, Luka
    Rimanic, Luka
    Klimovic, Ana
    Zhang, Ce
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 16 (02): : 304 - 316
  • [3] OrChem: an open source chemistry search engine for Oracle
    Mark L Rijnbeek
    Christoph Steinbeck
    [J]. Journal of Cheminformatics, 2 (Suppl 1)
  • [4] Comprehensive Characterization of an Open Source Document Search Engine
    Hadjilambrou, Zacharias
    Kleanthous, Marios
    Antoniou, Georgia
    Portero, Antoni
    Sazeides, Yiannakis
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (02)
  • [5] Towards a Semantic Search Engine for Open Source Software
    Ben Sassi, Sihem
    [J]. SOFTWARE REUSE: BRIDGING WITH SOCIAL-AWARENESS, 2016, 9679 : 300 - 314
  • [6] ScoutSL: An Open-source Simulink Search Engine
    Shrestha, Sohil Lal
    Boll, Alexander
    Kehrer, Timo
    Csallner, Christoph
    [J]. 2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 70 - 74
  • [7] OrChem - An open source chemistry search engine for Oracle®
    Mark Rijnbeek
    Christoph Steinbeck
    [J]. Journal of Cheminformatics, 1
  • [8] OrChem - An open source chemistry search engine for Oracle®
    Rijnbeek, Mark
    Steinbeck, Christoph
    [J]. JOURNAL OF CHEMINFORMATICS, 2009, 1
  • [9] ODECAL, a flexible open source rag doll simulation engine
    Lih-Hern, Pang
    Siang, Tan Yee
    Foo, Wong Chin
    Kuan, Wong Lai
    [J]. TECHNOLOGIES FOR E-LEARNING AND DIGITAL ENTERTAINMENT, PROCEEDINGS, 2006, 3942 : 680 - 687
  • [10] An open source and modular search engine for biomedical literature retrieval
    Almeida, Hayda
    Jean-Louis, Ludovic
    Meurs, Marie-Jean
    [J]. COMPUTATIONAL INTELLIGENCE, 2018, 34 (01) : 200 - 218