Tree-Based Representation and Generation of Natural and Mathematical Language

被引:0
|
作者
Scarlatos, Alexander [1 ]
Lan, Andrew [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mathematical language in scientific communications and educational scenarios is important yet relatively understudied compared to natural languages. Recent works on mathematical language focus either on representing stand-alone mathematical expressions, especially in their natural tree format, or mathematical reasoning in pre-trained natural language models. Existing works on jointly modeling and generating natural and mathematical languages simply treat mathematical expressions as text, without accounting for the rigid structural properties of mathematical expressions. In this paper, we propose a series of modifications to existing language models to jointly represent and generate text and math: representing mathematical expressions as sequences of node tokens in their operator tree format, using math symbol and tree position embeddings to preserve the semantic and structural properties of mathematical expressions, and using a constrained decoding method to generate mathematically valid expressions. We ground our modifications in GPT-2, resulting in a model MathGPT, and demonstrate that it outperforms baselines on mathematical expression generation tasks.
引用
收藏
页码:3714 / 3730
页数:17
相关论文
共 50 条
  • [1] Natural Language Inference by Tree-Based Convolution and Heuristic Matching
    Mou, Lili
    Men, Rui
    Li, Ge
    Xu, Yan
    Zhang, Lu
    Yan, Rui
    Jin, Zhi
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 130 - 136
  • [2] A TREE-BASED STATISTICAL LANGUAGE MODEL FOR NATURAL-LANGUAGE SPEECH RECOGNITION
    BAHL, LR
    BROWN, PF
    DESOUZA, PV
    MERCER, RL
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07): : 1001 - 1008
  • [3] Tree-based picture generation
    Drewes, F
    THEORETICAL COMPUTER SCIENCE, 2000, 246 (1-2) : 1 - 51
  • [4] Tree-based BLSTM for mathematical expression recognition
    Zhang, Ting
    Mouchere, Harold
    Viard-Gaudin, Christian
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 914 - 919
  • [5] Tree-based generation of languages of fractals
    Drewes, F
    THEORETICAL COMPUTER SCIENCE, 2001, 262 (1-2) : 377 - 414
  • [6] An algebra for tree-based music generation
    Drewes, Frank
    Hogberg, Johanna
    ALGEBRAIC INFORMATICS, 2007, 4728 : 172 - 188
  • [7] TREE-BASED SEMANTIC ANALYSIS METHOD FOR NATURAL LANGUAGE PHRASE TO FORMAL QUERY CONVERSION
    Litvin, A. A.
    Yu, Velychko V.
    Kaverynskyi, V. V.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2021, (02) : 105 - 113
  • [8] Context-Aware Tree-Based Convolutional Neural Networks for Natural Language Inference
    Meng, Zhao
    Mou, Lili
    Li, Ge
    Jin, Zhi
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2016, 2016, 9983 : 515 - 526
  • [9] Tree-Based Generation of Restricted Graph Languages
    Bjorklund, Henrik
    Bjorklund, Johanna
    Ericson, Petter
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2024, 35 (01N02) : 215 - 243
  • [10] SOME APPLICATIONS OF TREE-BASED MODELING TO SPEECH AND LANGUAGE
    RILEY, MD
    SPEECH AND NATURAL LANGUAGE, 1989, : 339 - 352