MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

被引:11
|
作者
Sojka, Petr [1 ]
Ruzicka, Michal [1 ]
Novotny, Vit [1 ]
机构
[1] Masaryk Univ, Fac Informat, Brno, Czech Republic
关键词
Math Information Retrieval; Digital Mathematical Libraries;
D O I
10.1145/3269206.3269233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the fulltext search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task.
引用
收藏
页码:1923 / 1926
页数:4
相关论文
共 50 条
  • [21] Towards Privacy Aware Social Semantic Digital Libraries
    Sacco, Owen
    Breslin, John
    NEW AVENUES FOR ELECTRONIC PUBLISHING IN THE AGE OF INFINITE COLLECTIONS AND CITIZEN SCIENCE: SCALE, OPENNESS AND TRUST, 2015, : 160 - 162
  • [22] Content-Based Image Retrieval in digital libraries
    Breiteneder, C
    Eidenberger, H
    2000 KYOTO INTERNATIONAL CONFERENCE ON DIGITAL LIBRARIES: RESEARCH AND PRACTICE, PROCEEDINGS, 2000, : 288 - 295
  • [23] Navigation, organization, and retrieval in personal digital libraries of email
    Gross, BM
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 258 - 259
  • [24] Introduction to the special section on digital libraries: Representation and retrieval
    Picard, RW
    Pentland, AP
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (08) : 769 - 770
  • [25] Building Models of Documentary and Factographic Retrieval in Digital Libraries
    Barakhnin, V. B.
    Fedotov, A. M.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2014, 48 (06) : 296 - 304
  • [26] Modulation domain texture retrieval for CBIR in digital libraries
    Havlicek, JP
    Tang, JS
    Acton, ST
    Antonucci, R
    Ouandji, FN
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1580 - 1584
  • [27] Content-based information retrieval and digital libraries
    Wan, Gary
    Liu, Zao
    INFORMATION TECHNOLOGY AND LIBRARIES, 2008, 27 (01) : 41 - 47
  • [28] Conceptual information retrieval of technical papers for digital libraries
    Horii, C
    Imai, M
    Chihara, K
    IEEE FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES, PROCEEDINGS, 1999, : 171 - 178
  • [29] SyDoM: A multilingual information retrieval system for digital libraries
    Roussey, C
    Calabretto, S
    Pinon, JM
    ELECTRONIC PUBLISHING '01, CONFERENCE PROCEEDINGS: 2001 IN THE DIGITAL PUBLISHING ODYSSEY, 2001, : 150 - 164
  • [30] An overview of the information retrieval features of twenty digital libraries
    Chowdhury, GG
    Chowdhury, S
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2000, 34 (04) : 341 - 373