Bidirectional Multi-Stack RNNs with Attention for Machine Translation

被引:0
|
作者
Chen, Zhiren [1 ]
Qiu, Ziyu [2 ]
Chen, Nuo [3 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON, Canada
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[3] Dalhousie Univ, Dept Mech Engn, Adv Control & Mechatron Lab, Halifax, NS B3H 4R2, Canada
关键词
D O I
10.1109/ONCON60463.2023.10430889
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This project developed a novel attention algorithm for multi-stack bidirectional encoder-decoder RNN (including GRU and LSTM) sequence-to-sequence models, particularly for language translation tasks. The attention mechanism utilizes matrix rearranging and multiplication to compute the significance of the vectors in the encoder output to the vectors in the current decoder hidden states when predicting each word. Our approach achieved 98% of the performance of fine-tuned pretrained T5-small, with 30% to 50% fewer parameters depending on vocabulary size, making our model an ideal choice in cases of single-processor training, low processor resource, limited memory, small dataset, or tasks not supported by pretrained transformers.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A TrueTime Extension for Instruction-level Timing and Multi-stack Support
    Naderlinger, Andreas
    Moser, Michael
    45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 4495 - 4500
  • [42] FVM based Simulation on Multi-Stack Ball Grid Array (BGA)
    Ishak, M. H. H.
    Abas, Aizat
    Abdullah, M. Z.
    Yuen, H. Z.
    INTERNATIONAL CONFERENCE ON MATHEMATICS, ENGINEERING AND INDUSTRIAL APPLICATIONS 2016 (ICOMEIA2016), 2016, 1775
  • [43] Exploring temporal representations by leveraging attention-based bidirectional LSTM-RNNs for multi-modal emotion recognition
    Li, Chao
    Bao, Zhongtian
    Li, Linhao
    Zhao, Ziping
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [44] Queue-exchange Mechanism to Improve the QoS in a Multi-stack Architecture
    El Rachkidy, Nancy
    Chalhoub, Gerard
    Guitton, Alexandre
    Misson, Michel
    PE-WASUN 11: PROCEEDINGS OF THE EIGHTH ACM SYMPOSIUM ON PERFORMANCE EVALUATION OF WIRELESS AD HOC, SENSOR, AND UBIQUITOUS NETWORKS, 2011, : 65 - 72
  • [45] In-line Metrology Capability for Epitaxial Multi-stack SiGe Layers
    Le Cunff, D.
    Couvrat, S.
    Abbate, F.
    2012 23RD ANNUAL SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE (ASMC), 2012, : 115 - 121
  • [46] Spectral gain profile of a multi-stack terahertz quantum cascade laser
    Bachmann, D.
    Roesch, M.
    Deutsch, C.
    Krall, M.
    Scalari, G.
    Beck, M.
    Faist, J.
    Unterrainer, K.
    Darmo, J.
    APPLIED PHYSICS LETTERS, 2014, 105 (18)
  • [47] Reduction of Torque Ripples in Multi-Stack Slotless Axial Flux Machine by Using Right Angled Trapezoidal Permanent Magnet
    Yousuf, Muhammad
    Khan, Faisal
    Ikram, Junaid
    Badar, Rabiah
    Bukhari, Syed Sabir Hussain
    Ro, Jong-Suk
    IEEE ACCESS, 2021, 9 : 22760 - 22773
  • [48] Minimization of Torque Ripples in Multi-Stack Slotted Stator Axial-Flux Synchronous Machine by Modifying Magnet Shape
    Mahmood, Zia
    Ikram, Junaid
    Badar, Rabiah
    Bukhari, Syed Sabir Hussain
    Shah, Madad Ali
    Memon, Ali Asghar
    Huba, Mikulas
    MATHEMATICS, 2022, 10 (10)
  • [49] A Comparative Analysis of RNNs, GRUs and LSTMs in Machine Translation and Sentiment Analysis
    Gupta, Amit
    Shastri, Bhavya
    Nautiyal, Utsav
    Kanupriya
    Garg, Navin
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 293 - 303
  • [50] Numerical analysis and optimization on shunt losses in a multi-stack VRFB system
    Chen, Jizhong
    Yan, Tao
    Kizhnerman, Eugene
    Yin, Haitao
    Zhang, Ming-xia
    Hui, Dong
    2018 2ND IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2018,