MathNet: A Data-Centric Approach for Printed Mathematical Expression Recognition

被引:0
|
作者
Schmitt-Koopmann, Felix M. [1 ,2 ]
Huang, Elaine M. [2 ]
Hutter, Hans-Peter [1 ]
Stadelmann, Thilo [3 ,4 ]
Darvishy, Alireza [1 ]
机构
[1] Zurich Univ Appl Sci ZHAW, Inst Comp Sci, CH-8401 Winterthur, Switzerland
[2] Univ Zurich, People & Comp Lab, CH-8050 Zurich, Switzerland
[3] Zurich Univ Appl Sci ZHAW, Ctr Artificial Intelligence, CH-8400 Winterthur, Switzerland
[4] European Ctr Living Technol ECLT, I-30123 Venice, Italy
来源
IEEE ACCESS | 2024年 / 12卷
基金
瑞士国家科学基金会;
关键词
Symbols; White spaces; Decoding; Rendering (computer graphics); Benchmark testing; Visualization; Artificial intelligence; Deep learning; Document handling; Mathematical models; Pattern recognition; Data-centric AI; deep learning; labeling; document analysis; mathematical expression recognition; pattern recognition;
D O I
10.1109/ACCESS.2024.3404834
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Printed mathematical expression recognition (MER) models are usually trained and tested using LaTeX-generated mathematical expressions (MEs) as input and the LaTeX source code as ground truth. As the same ME can be generated by various different LaTeX source codes, this leads to unwanted variations in the ground truth data that bias test performance results and hinder efficient learning. In addition, the use of only one font to generate the MEs heavily limits the generalization of the reported results to realistic scenarios. We propose a data-centric approach to overcome this problem, and present convincing experimental results: Our main contribution is an enhanced LaTeX normalization to map any LaTeX ME to a canonical form. Based on this process, we developed an improved version of the benchmark dataset im2latex-100k, featuring 30 fonts instead of one. Second, we introduce the real-world dataset realFormula, with MEs extracted from papers. Third, we developed a MER model, MathNet, based on a convolutional vision transformer, with superior results on all four test sets (im2latex-100k, im2latexv2, realFormula, and InftyMDB-1), outperforming the previous state of the art by up to 88.3%.
引用
收藏
页码:76963 / 76974
页数:12
相关论文
共 50 条
  • [1] A Data-Centric Approach to Synchronization
    Dolby, Julian
    Hammer, Christian
    Marino, Daniel
    Tip, Frank
    Vaziri, Mandana
    Vitek, Jan
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2012, 34 (01):
  • [2] A data-centric approach to generative modelling for 3D-printed steel
    Dodwell, T. J.
    Fleming, L. R.
    Buchanan, C.
    Kyvelou, P.
    Detommaso, G.
    Gosling, P. D.
    Scheichl, R.
    Kendall, W. S.
    Gardner, L.
    Girolami, M. A.
    Oates, C. J.
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 477 (2255):
  • [3] A Data-Centric Approach to Loss Mechanisms
    Senior, Alistair C.
    Miller, Robert J.
    [J]. JOURNAL OF TURBOMACHINERY-TRANSACTIONS OF THE ASME, 2024, 146 (04):
  • [4] A Data-Centric Approach to Change Management
    Nwokeji, Joshua Chibuike
    Clark, Tony
    Barn, Balbir
    Kulkarni, Vinay
    Anum, Sheena O.
    [J]. PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE, 2015, : 185 - 190
  • [5] A DATA-CENTRIC APPROACH TO LOSS MECHANISMS
    Senior, Alistair C.
    Miller, Robert J.
    [J]. PROCEEDINGS OF ASME TURBO EXPO 2023: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, GT2023, VOL 13A, 2023,
  • [6] A data-centric approach to distributed tracing
    Popa, Nicolae Marian
    Oprescu, Ana
    [J]. 11TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2019), 2019, : 209 - 216
  • [7] A Data-Centric Approach for Portuguese Speech Recognition: Language Model And Its Implications
    Alvarenga, Joao Paulo Reis
    Merschmann, Luiz Henrique de Campos
    Luz, Eduardo Jose da Silva
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (04) : 546 - 556
  • [8] Data Subsetting: A Data-Centric Approach to Approximate Computing
    Kim, Younghoon
    Venkataramani, Swagath
    Chandrachoodan, Nitin
    Raghunathan, Anand
    [J]. 2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 576 - 581
  • [9] A DATA-CENTRIC APPROACH FOR INTEGRATED DATA CENTER MANAGEMENT
    Hoover, Christopher
    [J]. PROCEEDINGS OF THE ASME PACIFIC RIM TECHNICAL CONFERENCE AND EXHIBITION ON PACKAGING AND INTEGRATION OF ELECTRONIC AND PHOTONIC SYSTEMS, MEMS AND NEMS 2011, VOL 2, 2012, : 565 - 576
  • [10] A data-centric approach to manage business processes
    Haddar, Nahla
    Tmar, Mohamed
    Gargouri, Faiez
    [J]. COMPUTING, 2016, 98 (04) : 375 - 406