MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

被引:11
|
作者
Sojka, Petr [1 ]
Ruzicka, Michal [1 ]
Novotny, Vit [1 ]
机构
[1] Masaryk Univ, Fac Informat, Brno, Czech Republic
关键词
Math Information Retrieval; Digital Mathematical Libraries;
D O I
10.1145/3269206.3269233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the fulltext search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task.
引用
收藏
页码:1923 / 1926
页数:4
相关论文
共 50 条
  • [1] Introducing MathQA: a Math-Aware question answering system
    Schubotz, Moritz
    Scharpf, Philipp
    Dudhat, Kaushal
    Nagar, Yash
    Hamborg, Felix
    Gipp, Bela
    INFORMATION DISCOVERY AND DELIVERY, 2018, 46 (04) : 214 - 224
  • [2] WebMIaS on Docker Deploying Math-Aware Search in a Single Line of Code
    Luptak, David
    Novotny, Vit
    Stefanik, Michal
    Sojka, Petr
    INTELLIGENT COMPUTER MATHEMATICS (CICM 2021), 2021, 12833 : 159 - 164
  • [3] PyA0: A Python']Python Toolkit for Accessible Math-Aware Search
    Zhong, Wei
    Lin, Jimmy
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2541 - 2545
  • [4] Advancing Math-Aware Search: The ARQMath-3 Lab at CLEF 2022
    Mansouri, Behrooz
    Agarwal, Anurag
    Oard, Douglas W.
    Zanibbi, Richard
    ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 : 408 - 415
  • [5] MATH PROBLEM, INTERNET AND DIGITAL MATHEMATICAL PERFORMANCE
    Borba, Marcelo
    PROCEEDINGS OF THE PROBLEM@WEB INTERNATIONAL CONFERENCE: TECHNOLOGY, CREATIVITY AND AFFECT IN MATHEMATICAL PROBLEM SOLVING, 2014, : 4 - 5
  • [6] Semantic hypermedia retrieval in digital libraries
    Wiesener, S
    Kowarschick, W
    Vogel, P
    Bayer, R
    DIGITAL LIBRARIES: RESEARCH AND TECHNOLOGY ADVANCES, 1996, 1082 : 115 - 129
  • [7] Geographic Information Retrieval and Digital Libraries
    Larson, Ray R.
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2009, 5714 : 461 - 464
  • [8] Information retrieval challenges for digital libraries
    Rasmussen, E
    DIGITAL LIBRARIES: INTERNATIONAL COLLABORATION AND CROSS-FERTILIZATION, PROCEEDINGS, 2004, 3334 : 95 - 103
  • [9] Mathematical Symbol Indexing for Digital Libraries
    Marinai, Simone
    Miotti, Beatrice
    Soda, Giovanni
    DIGITAL LIBRARIES, 2010, 91 : 113 - 124
  • [10] Forms of Plagiarism in Digital Mathematical Libraries
    Schubotz, Moritz
    Teschke, Olaf
    Stange, Vincent
    Meuschke, Norman
    Gipp, Bela
    INTELLIGENT COMPUTER MATHEMATICS, CICM 2019, 2019, 11617 : 258 - 274