Bidirectional Multi-Stack RNNs with Attention for Machine Translation

被引:0
|
作者
Chen, Zhiren [1 ]
Qiu, Ziyu [2 ]
Chen, Nuo [3 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Toronto, ON, Canada
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
[3] Dalhousie Univ, Dept Mech Engn, Adv Control & Mechatron Lab, Halifax, NS B3H 4R2, Canada
关键词
D O I
10.1109/ONCON60463.2023.10430889
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This project developed a novel attention algorithm for multi-stack bidirectional encoder-decoder RNN (including GRU and LSTM) sequence-to-sequence models, particularly for language translation tasks. The attention mechanism utilizes matrix rearranging and multiplication to compute the significance of the vectors in the encoder output to the vectors in the current decoder hidden states when predicting each word. Our approach achieved 98% of the performance of fine-tuned pretrained T5-small, with 30% to 50% fewer parameters depending on vocabulary size, making our model an ideal choice in cases of single-processor training, low processor resource, limited memory, small dataset, or tasks not supported by pretrained transformers.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] A multi-stack based phylogenetic tree building method
    Busa-Fekete, Robert
    Kocsor, Andras
    Bagyinka, Csaba
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 49 - +
  • [22] Global Reachability in Bounded Phase Multi-stack Pushdown Systems
    Seth, Anil
    COMPUTER AIDED VERIFICATION, PROCEEDINGS, 2010, 6174 : 615 - 628
  • [23] Degraded Mode Operation Of Multi-Stack Fuel Cell Systems
    Cardenas, David Camilo Toquica
    Marx, Neigel
    Boulon, Loic
    Gustin, Frederic
    Hissel, Daniel
    2014 IEEE VEHICLE POWER AND PROPULSION CONFERENCE (VPPC), 2014,
  • [24] Effect of different temperature distribution on multi-stack BGA package
    Tung, Lun Hao
    Ng, Fei Chong
    Abas, Aizat
    Abdullah, M. Z.
    Samsudin, Zambri
    Ali, Mohd Yusuf Tura
    MICROELECTRONICS INTERNATIONAL, 2021, 38 (02) : 33 - 45
  • [25] Coordinated Control Technology for Multi-stack Fuel Cell System
    Li, Duankai
    Zhang, Guorui
    PROCEEDINGS OF THE 10TH HYDROGEN TECHNOLOGY CONVENTION, VOL 3, WHTC 2023, 2024, 395 : 159 - 165
  • [26] Analysis of Impacting Multi-stack Standard Cells on Chip Implementation
    Chang, Kyungjoon
    Kim, Taewhan
    2022 19TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2022, : 119 - 120
  • [27] Degraded mode operation of multi-stack fuel cell systems
    Marx, Neigel
    Cardenas, David Camilo Toquica
    Boulon, Loic
    Gustin, Frederic
    Hissel, Daniel
    IET ELECTRICAL SYSTEMS IN TRANSPORTATION, 2016, 6 (01) : 3 - 11
  • [28] FLEXIBLE CIRCUIT BOARD PACKAGE EMBEDDED WITH MULTI-STACK DIES
    Ueta, Nobuki
    Sato, Shunsuke
    Sato, Masakazu
    Nakao, Yoshio
    Magnuson, Joshua
    Ishizuka, Rocky
    PROCEEDINGS OF THE 2020 DESIGN OF MEDICAL DEVICES CONFERENCE (DMD2020), 2020,
  • [29] Thinned wafer multi-stack 3DI technology
    Ohba, Takayuki
    Maeda, Nobuhide
    Kitada, Hideki
    Fujimoto, Koji
    Suzuki, Kousuke
    Nakamura, Tomoji
    Kawai, Akihito
    Arai, Kazuhisa
    MICROELECTRONIC ENGINEERING, 2010, 87 (03) : 485 - 490
  • [30] Properties of a multi-stack type piezoelectric energy harvesting device
    Jeong, Soon-Jong
    Kim, Min-Soo
    Lee, Dae-Su
    Song, Jae-Sung
    INTEGRATED FERROELECTRICS, 2008, 98 (01) : 208 - 215