Language Modeling with a General Second-Order RNN

被引:0
|
作者
Maupome, Diego [1 ]
Meurs, Marie-Jean [1 ]
机构
[1] Univ Quebec Montreal UQAM, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
Recurrent Neural Networks; Language Modeling;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Different Recurrent Neural Network (RNN) architectures update their state in different manners as the input sequence is processed. RNNs including a multiplicative interaction between their current state and the current input, second-order ones, show promising performance in language modeling. In this paper, we introduce a second-order RNNs that generalizes existing ones. Evaluating on the Penn Treebank dataset, we analyze how its different components affect its performance in character-lever recurrent language modeling. We perform our experiments controlling the parameter counts of models. We find that removing the first-order terms does not hinder performance. We perform further experiments comparing the effects of the relative size of the state space and the multiplicative interaction space on performance. Our expectation was that a larger states would benefit language models built on longer documents, and larger multiplicative interaction states would benefit ones built on larger input spaces. However, our results suggest that this is not the case and the optimal relative size is the same for both document tokenizations used.
引用
收藏
页码:4749 / 4753
页数:5
相关论文
共 50 条
  • [31] The regular-language semantics of second-order idealized ALGOL
    Ghica, DR
    McCusker, G
    THEORETICAL COMPUTER SCIENCE, 2003, 309 (1-3) : 469 - 502
  • [32] ON CONSISTENCY OF A FRAISSE-HYPOTHESIS ON DEFINABILITY IN A LANGUAGE OF SECOND-ORDER
    MAREK, W
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1973, 276 (18): : 1169 - 1172
  • [33] A Linear Proof Language for Second-Order Intuitionistic Linear Logic
    Diaz-Caro, Alejandro
    Dowek, Gilles
    Ivnisky, Malena
    Malherbe, Octavio
    LOGIC, LANGUAGE, INFORMATION, AND COMPUTATION, WOLLIC 2024, 2024, 14672 : 18 - 35
  • [34] Order conditions for RKN methods solving general second-order oscillatory systems
    You, Xiong
    Zhao, Jinxi
    Yang, Hongli
    Fang, Yonglei
    Wu, Xinyuan
    NUMERICAL ALGORITHMS, 2014, 66 (01) : 147 - 176
  • [35] Order conditions for RKN methods solving general second-order oscillatory systems
    Xiong You
    Jinxi Zhao
    Hongli Yang
    Yonglei Fang
    Xinyuan Wu
    Numerical Algorithms, 2014, 66 : 147 - 176
  • [36] Solvability of the Dirichlet problem for a general second-order elliptic equation
    Dumanyan, V. Zh.
    SBORNIK MATHEMATICS, 2011, 202 (07) : 1001 - 1020
  • [37] A general Raychaudhuri's equation for second-order differential equations
    Jerie, M
    Prince, GE
    JOURNAL OF GEOMETRY AND PHYSICS, 2000, 34 (3-4) : 226 - 241
  • [38] Solvability of the Dirichlet problem for the general second-order elliptic equation
    V. Zh. Dumanyan
    Doklady Mathematics, 2011, 83 : 30 - 33
  • [39] Dimension reduction for second-order systems by general orthogonal polynomials
    Xiao, Zhi-Hua
    Jiang, Yao-Lin
    MATHEMATICAL AND COMPUTER MODELLING OF DYNAMICAL SYSTEMS, 2014, 20 (04) : 414 - 432
  • [40] A group classification of the general second-order coupled diffusion system
    Molati, M.
    Mahomed, F. M.
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2010, 43 (41)