Mathematical formula information retrieval system

被引:0
|
作者
Hou, Yong [1 ]
机构
[1] Bengbu Univ, Sch Comp & Informat Engn, Bengbu 233030, Anhui, Peoples R China
关键词
Mathematical formula; index; retrieval; mathematical content representation; document sorting; retrieval engine;
D O I
10.3233/JCM-226961
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Design and implementation of the system for retrieving information about mathematical formulas - MFIRS. The structure of the system is mainly divided into the modules: input normalization, mathematical formula unification, mathematical formula encoding, text information feature extraction, mathematical formula feature extraction, mathematical formula indexing, retrieval and ranking. A method for extracting mathematical formulas and keywords based on FastText word embedding technology is proposed. This method can be used not only to get the structural features of the formula, but also to facilitate the calculation of the similarity of the formula by the vector result. At the same time, the model introduces the semantic features of context-rich mathematical formulas to improve the domain correlation of search results. The MathRetEval dataset was created based on about 7.9 x 10(5) arXiv documents and about 1.5 x 10(8) mathematical formulas. The scalability of the system is verified using this data set. The mathematical formulas can be written in the language TEX or MathML. When queried in the TEX language, it can be converted to a tree representation of the MathML representation and then indexed. This MFIRS is an information retrieval system for mathematical formulas with the features of mathematical perception, which can use the search for the similarity of partial formulas.
引用
收藏
页码:2949 / 2973
页数:25
相关论文
共 50 条
  • [41] INFORMATION-RETRIEVAL SYSTEM
    ALTSHULER, CH
    HOLLISTER, WN
    CHEST, 1976, 69 (01) : 136 - 136
  • [42] Intelligent Information Retrieval System
    Cho, Young Im
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 565 - 568
  • [43] A note on information retrieval system
    Wong, JWT
    Kan, WK
    Young, GH
    PROCEEDINGS OF SECOND INTERNATIONAL WORKSHOP ON CSCW IN DESIGN, 1997, : 556 - 561
  • [44] A system for adaptive information retrieval
    Psarras, Ioannis
    Jose, Joemon
    ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, PROCEEDINGS, 2006, 4018 : 313 - 317
  • [45] The cooperative system for information retrieval
    Mekaouche, A
    XX INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY - PROCEEDINGS, 2000, : 199 - 209
  • [46] A system for Music Information Retrieval
    Lahart, O
    O'Riordan, C
    ARTIFICIAL INTELLIGENCE AND COGNITIVE SCIENCE, PROCEEDINGS, 2002, 2464 : 197 - 202
  • [47] The ECOINFORM information retrieval system
    Vasil'ev, AG
    Akoev, MA
    Sal'nikov, AA
    Smirnov, LN
    RUSSIAN JOURNAL OF ECOLOGY, 2002, 33 (05) : 366 - 369
  • [48] CLIENT INFORMATION RETRIEVAL SYSTEM
    GREENE, RJ
    JOURNAL OF ACCOUNTANCY, 1974, 137 (02): : 79 - 82
  • [49] A CENTRAL INFORMATION RETRIEVAL SYSTEM
    ARMSTRONG, DL
    GRENIER, MT
    JOURNAL OF CHEMICAL DOCUMENTATION, 1965, 5 (02): : 99 - +
  • [50] SYSTEM OF SOILS INFORMATION RETRIEVAL
    JOHN, MK
    SPROUT, PN
    VANLAERH.CJ
    CANADIAN JOURNAL OF SOIL SCIENCE, 1972, 52 (03) : 351 - &