MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

被引:11
|
作者
Sojka, Petr [1 ]
Ruzicka, Michal [1 ]
Novotny, Vit [1 ]
机构
[1] Masaryk Univ, Fac Informat, Brno, Czech Republic
关键词
Math Information Retrieval; Digital Mathematical Libraries;
D O I
10.1145/3269206.3269233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the fulltext search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task.
引用
收藏
页码:1923 / 1926
页数:4
相关论文
共 50 条
  • [41] The network structure of mathematical knowledge according to the Wikipedia, Math World, and DLMF online libraries
    Gonzaga, Flavio B.
    Barbosa, Valmir C.
    Xexeo, Geraldo B.
    NETWORK SCIENCE, 2014, 2 (03) : 367 - 386
  • [42] Context-sensitive queries for image retrieval in digital libraries
    Boccignone, G.
    Chianese, A.
    Moscato, V.
    Picariello, A.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2008, 31 (01) : 53 - 84
  • [43] A general system for the retrieval of document images from digital libraries
    Marinai, S
    Marino, E
    Cesarini, F
    Soda, G
    FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, : 150 - 173
  • [44] An object-based image retrieval system for digital libraries
    Sridhar R. Avula
    Jinshan Tang
    Scott T. Acton
    Multimedia Systems, 2006, 11 : 260 - 270
  • [45] Information Retrieval and Informetrics: The Application of Informetric Methods in Digital Libraries
    Schaer, Philipp
    HISTORICAL SOCIAL RESEARCH-HISTORISCHE SOZIALFORSCHUNG, 2013, 38 (03): : 282 - 354
  • [46] Patenting the processes for content-based retrieval in digital libraries
    Sasaki, H
    Kiyoki, Y
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 471 - 482
  • [47] An object-based image retrieval system for digital libraries
    Avula, SR
    Tang, JS
    Acton, ST
    MULTIMEDIA SYSTEMS, 2006, 11 (03) : 260 - 270
  • [48] Geographic Information Retrieval (GIR) ranking methods for digital libraries
    Larson, RR
    Frontiera, P
    JCDL 2004: PROCEEDINGS OF THE FOURTH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES: GLOBAL REACH AND DIVERSE IMPACT, 2004, : 415 - 415
  • [49] MinervaDL: An architecture for information retrieval and filtering in distributed digital libraries
    Zimmer, Christian
    Tryfonopoulos, Christos
    Weikum, Gerhard
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2007, 4675 : 148 - +
  • [50] Fuzzy matching as a retrieval-enabling technique for digital libraries
    Girill, TR
    Luk, CH
    DIGITAL REVOLUTION - ASIS MID-YEAR 1996, 1996, : 139 - 145