Language Modeling with a General Second-Order RNN

被引：0

作者：

Maupome, Diego ^{[1
]}

Meurs, Marie-Jean ^{[1
]}

机构：

[1] Univ Quebec Montreal UQAM, Montreal, PQ, Canada

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

基金：

加拿大自然科学与工程研究理事会; 加拿大健康研究院;

关键词：

Recurrent Neural Networks; Language Modeling;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Different Recurrent Neural Network (RNN) architectures update their state in different manners as the input sequence is processed. RNNs including a multiplicative interaction between their current state and the current input, second-order ones, show promising performance in language modeling. In this paper, we introduce a second-order RNNs that generalizes existing ones. Evaluating on the Penn Treebank dataset, we analyze how its different components affect its performance in character-lever recurrent language modeling. We perform our experiments controlling the parameter counts of models. We find that removing the first-order terms does not hinder performance. We perform further experiments comparing the effects of the relative size of the state space and the multiplicative interaction space on performance. Our expectation was that a larger states would benefit language models built on longer documents, and larger multiplicative interaction states would benefit ones built on larger input spaces. However, our results suggest that this is not the case and the optimal relative size is the same for both document tokenizations used.

引用

页码：4749 / 4753

页数：5

共 50 条

[41] Secret Key Agreement: General Capacity and Second-Order Asymptotics
Hayashi, Masahito
Tyagi, Himanshu
Watanabe, Shun
2014 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2014, : 1136 - 1140
[42] Second-order general perturbed sweeping process differential inclusion
Noel, Jimmy
JOURNAL OF FIXED POINT THEORY AND APPLICATIONS, 2018, 20 (03)
[43] Secret Key Agreement: General Capacity and Second-Order Asymptotics
Hayashi, Masahito
Tyagi, Himanshu
Watanabe, Shun
IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (07) : 3796 - 3810
[44] Dissipative second-order difference operators with general boundary conditions
Allahverdiev, BP
JOURNAL OF DIFFERENCE EQUATIONS AND APPLICATIONS, 2004, 10 (01) : 1 - 16
[45] Symmetric collocation ERKN methods for general second-order oscillators
You, Xiong
Zhang, Ruqiang
Huang, Ting
Fang, Yonglei
CALCOLO, 2019, 56 (04)
[46] Symmetric collocation ERKN methods for general second-order oscillators
Xiong You
Ruqiang Zhang
Ting Huang
Yonglei Fang
Calcolo, 2019, 56
[47] Oscillation theorems for general quasilinear second-order difference equations
Thandapani, E
Ravi, K
Graef, JR
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2001, 42 (3-5) : 687 - 694
[48] SECOND-ORDER RECURRENT SPACE-TIMES IN GENERAL RELATIVITY
MCLENAGHAN, RG
THOMPSON, AH
LETTERE AL NUOVO CIMENTO, 1972, 5 (07): : 563 - 564
[49] Second-order general perturbed sweeping process differential inclusion
Jimmy Noel
Journal of Fixed Point Theory and Applications, 2018, 20
[50] Solvability of the Dirichlet problem for the general second-order elliptic equation
Dumanyan, V. Zh.
DOKLADY MATHEMATICS, 2011, 83 (01) : 30 - 33

← 1 2 3 4 5 →