Bidirectional Multi-Stack RNNs with Attention for Machine Translation

被引:0
|
作者
Chen, Zhiren [1 ]
Qiu, Ziyu [2 ]
Chen, Nuo [3 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON, Canada
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[3] Dalhousie Univ, Dept Mech Engn, Adv Control & Mechatron Lab, Halifax, NS B3H 4R2, Canada
关键词
D O I
10.1109/ONCON60463.2023.10430889
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This project developed a novel attention algorithm for multi-stack bidirectional encoder-decoder RNN (including GRU and LSTM) sequence-to-sequence models, particularly for language translation tasks. The attention mechanism utilizes matrix rearranging and multiplication to compute the significance of the vectors in the encoder output to the vectors in the current decoder hidden states when predicting each word. Our approach achieved 98% of the performance of fine-tuned pretrained T5-small, with 30% to 50% fewer parameters depending on vocabulary size, making our model an ideal choice in cases of single-processor training, low processor resource, limited memory, small dataset, or tasks not supported by pretrained transformers.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] An Adaptive State Machine Based Energy Management Strategy for a Multi-Stack Fuel Cell Hybrid Electric Vehicle
    Fernandez, Alvaro Macias
    Kandidayeni, Mohsen
    Boulon, Loic
    Chaoui, Hicham
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) : 220 - 234
  • [32] Attention over Heads: A Multi-Hop Attention for Neural Machine Translation
    Iida, Shohei
    Kimura, Ryuichiro
    Cui, Hongyi
    Hung, Po-Hsuan
    Utsuro, Takehito
    Nagata, Masaaki
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 217 - 222
  • [33] Gaussian Multi-head Attention for Simultaneous Machine Translation
    Zhang, Shaolei
    Feng, Yang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3019 - 3030
  • [34] Nahuatl Neural Machine Translation Using Attention Based Architectures: A Comparative Analysis for RNNs and Transformers as a Mobile Application Service
    Bello Garcia, Sergio Khalil
    Sanchez Lucero, Eduardo
    Bonilla Huerta, Edmundo
    Hernandez Hernandez, Jose Crispin
    Ramirez Cruz, Jose Federico
    Pedroza Mendez, Blanca Estela
    ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 120 - 139
  • [35] Inconsistency analysis and power allocation of the stack in multi-stack solid oxide fuel cell system
    Wang, Zhen
    Liu, Guoqiang
    Liu, Xing-bo
    Xiang, Hong-fu
    Sun, Can
    Wang, Zhuo
    Fu, Qiuyun
    Li, Xi
    JOURNAL OF POWER SOURCES, 2024, 598
  • [36] High Gain and Wideband Multi-Stack Multilayer Anisotropic Dielectric Antenna
    Moayyed, Farhad
    Oskouei, Hamid R. Dalili
    Shirkolaei, Morteza Mohammadi
    PROGRESS IN ELECTROMAGNETICS RESEARCH LETTERS, 2021, 99 : 103 - 109
  • [37] UNRESTRICTED AND DISJOINT OPERATIONS OVER MULTI-STACK VISIBLY PUSHDOWN LANGUAGES
    Bruda, Stefan D.
    Bin Waez, Tawhid
    ICSOFT 2011: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATABASE TECHNOLOGIES, VOL 2, 2011, : 156 - 161
  • [38] Evaluation of thermal behaviors for the multi-stack vanadium flow battery module
    Chen, Fuyu
    Gao, Hai
    Chen, Hui
    Yan, Chuanwei
    JOURNAL OF ENERGY STORAGE, 2020, 27
  • [39] A multi-stack simulation of shunt currents in vanadium redox flow batteries
    Wandschneider, F. T.
    Roehm, S.
    Fischer, P.
    Pinkwart, K.
    Tuebke, J.
    Nirschl, H.
    JOURNAL OF POWER SOURCES, 2014, 261 : 64 - 74
  • [40] Wide emission spectra from multi-stack InGaAs quantum dots
    Tzeng, T. E.
    Feng, David J. Y.
    Chen, C. Y.
    Lay, T. S.
    Chang, T. Y.
    2007 INTERNATIONAL CONFERENCE ON INDIUM PHOSPHIDE AND RELATED MATERIALS, CONFERENCE PROCEEDINGS, 2007, : 194 - 196