Dynamic Feed-Forward LSTM

被引:0
|
作者
Piao, Chengkai [1 ]
Wang, Yuchen [1 ]
Wei, Jinmao [1 ]
机构
[1] Nankai Univ, 38 Tongyan Rd, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic Process; Feed Forward; LSTM; Full Context; ATTENTION MECHANISM; BIDIRECTIONAL LSTM; MODEL;
D O I
10.1007/978-3-031-40283-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the insufficient hidden states capabilities and single-direction feeding flaws of existing LSTM caused by its horizontal recurrent steps. To this end, we propose the Dynamic Feed-Forward LSTM (D-LSTM). Specifically, our D-LSTM first expands the capabilities of hidden states by assigning an exclusive state vector to each word. Then, the Dynamic Additive Attention (DAA) method is utilized to adaptively compress local context words into a fixed size vector. Last, a vertical feed-forward process is proposed to search context relations by filtering informative features in the compressed context vector and updating hidden states. With the help of exclusive hidden states, each word can preserve its most correlated context features and hidden states do not interfere with each other. By setting an appropriate context window size for DAA and stacking multiple such layers, the context scope can be gradually expanded from a central word to both sides and achieve the whole sentence at the top layer. Furthermore, the D-LSTM module is compatible with parallel computing and amenable to training via back-propagation for its vertical prorogation. Experimental results on both classification and sequence tagging datasets insist that our models achieve competitive performance compared to existing LSTMs.
引用
收藏
页码:191 / 202
页数:12
相关论文
共 50 条
  • [31] Multivariable inferential feed-forward control
    Zhang, J
    Agustriyanto, R
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2003, 42 (18) : 4186 - 4197
  • [32] Explaining synchrony in feed-forward networks:
    T. Nowotny
    R. Huerta
    Biological Cybernetics, 2003, 89 : 237 - 241
  • [33] Feed-Forward Network for Cancer Detection
    Pei, Shengyu
    Tong, Lang
    Li, Xia
    Jiang, Jin
    Huang, Jingyu
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 697 - 701
  • [34] Explaining synchrony in feed-forward networks:
    T. Nowotny
    R. Huerta
    Biological Cybernetics, 2003, 89 (6) : 449 - 449
  • [35] Wind Feed-forward Control of a USV
    Qu, Huajin
    Sarda, Edoardo I.
    Bertaska, Ivan R.
    von Ellenrieder, Karl D.
    OCEANS 2015 - GENOVA, 2015,
  • [36] The Case for Feed-Forward Clock Synchronization
    Ridoux, Julien
    Veitch, Darryl
    Broomhead, Timothy
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2012, 20 (01) : 231 - 242
  • [37] Feed-forward in temperature control of buildings
    Thomas, B
    Soleimani-Mohseni, MS
    Fahlén, P
    ENERGY AND BUILDINGS, 2005, 37 (07) : 755 - 761
  • [38] Feed-forward and the evolution of social behavior
    Slobodchikoff, CN
    BEHAVIORAL AND BRAIN SCIENCES, 2000, 23 (02) : 265 - +
  • [39] A Novel Feed-Forward Controller for PMSMs
    Altun, Yusuf
    Gulez, Kayhan
    Mumcu, Tarik Veli
    Kizilkaya, M. Ozgur
    2013 3RD INTERNATIONAL CONFERENCE ON ELECTRIC POWER AND ENERGY CONVERSION SYSTEMS (EPECS), 2013,
  • [40] Quantum Feed-Forward Control of Light
    Andersen, Ulrik L.
    Filip, Radim
    PROGRESS IN OPTICS, VOL 53, 2009, 53 : 365 - 414