Bidirectional Multi-Stack RNNs with Attention for Machine Translation

被引:0
|
作者
Chen, Zhiren [1 ]
Qiu, Ziyu [2 ]
Chen, Nuo [3 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON, Canada
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[3] Dalhousie Univ, Dept Mech Engn, Adv Control & Mechatron Lab, Halifax, NS B3H 4R2, Canada
关键词
D O I
10.1109/ONCON60463.2023.10430889
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This project developed a novel attention algorithm for multi-stack bidirectional encoder-decoder RNN (including GRU and LSTM) sequence-to-sequence models, particularly for language translation tasks. The attention mechanism utilizes matrix rearranging and multiplication to compute the significance of the vectors in the encoder output to the vectors in the current decoder hidden states when predicting each word. Our approach achieved 98% of the performance of fine-tuned pretrained T5-small, with 30% to 50% fewer parameters depending on vocabulary size, making our model an ideal choice in cases of single-processor training, low processor resource, limited memory, small dataset, or tasks not supported by pretrained transformers.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A multi-stack RNN-based neural machine translation model for English to Pakistan sign language translation
    Farooq, Uzma
    Rahim, Mohd Shafry Mohd
    Abid, Adnan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18): : 13225 - 13238
  • [2] A multi-stack RNN-based neural machine translation model for English to Pakistan sign language translation
    Uzma Farooq
    Mohd Shafry Mohd Rahim
    Adnan Abid
    Neural Computing and Applications, 2023, 35 : 13225 - 13238
  • [3] Effects of Multi-stack Ball Grid Array on Multi-stack Printed Circuit Board
    Mukhtar, M. A. F. M.
    Abas, A.
    Bahri, W. M. E. I. W. S.
    INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN INDUSTRIAL ENGINEERING AND MANUFACTURING, 2019, 530
  • [4] Multi-stack quantum cascade lasers
    Blanchard, Romain
    Pfluegl, Christian
    Diehl, Laurent
    Dupuis, Russell D.
    Capasso, Federico
    2012 INTERNATIONAL CONFERENCE ON INDIUM PHOSPHIDE AND RELATED MATERIALS (IPRM), 2013, : 147 - 150
  • [5] Multi-stack Decoding of Polar Codes
    Wu, Dongsheng
    Zhang, Qingshuang
    Zhang, Yingxian
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2016, 386 : 383 - 389
  • [6] Games on Multi-stack Pushdown Systems
    Seth, Anil
    LOGICAL FOUNDATIONS OF COMPUTER SCIENCE, 2009, 5407 : 395 - 408
  • [7] Multi-stack boundary labeling problems
    Bekos, Michael A.
    Kaufmann, Michael
    Potika, Katerina
    Symvonis, Antonios
    FSTTCS 2006: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, PROCEEDINGS, 2006, 4337 : 81 - +
  • [8] The Dynamic Multi-stack Storage Structure
    Ren, Zhiguo
    Da, Wenjiao
    Proceedings of the 2016 International Conference on Engineering and Advanced Technology, 2016, 82 : 204 - 208
  • [9] A Multi-stack Denoising Autoencoder for QoS Prediction
    Wu, Mengwei
    Lu, Qin
    Wang, Yingxue
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 757 - 768
  • [10] The Complexity of Model Checking Multi-Stack Systems
    Benedikt Bollig
    Dietrich Kuske
    Roy Mennicke
    Theory of Computing Systems, 2017, 60 : 695 - 736