A LSTM-Based Bidirectional Translation Model for Optimizing Rare Words and Terminologies

被引:0
|
作者
Huang, Xing [1 ]
Tan, Huobin [1 ]
Lin, Guangyan [1 ]
Tian, Yongfen [1 ]
机构
[1] Beihang Univ, Sch Software, Beijing, Peoples R China
关键词
bidirectional translation; RTR; mutual learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural translation model has greatly grown in recent years. Many researches have come up with very good solutions to deficiencies in Neural translation model. However, it is difficult to get best effect for rare words and terminologies what are marked as unknown words because of the limit of the dictionary's size. This paper presents a bidirectional translation model what can be used to translate between bilinguals and optimize rare words and terminologies. At first we use word2vec to get a word similarity model. By replacing the rare words to be trained and tested by similarity model, we solve the problems caused by rare words. In addition, all terminologies are treated as a rare word to join this model, so that there is a good performance in translating terminologies. Then, by introducing mutual learning in the symmetric LSTM, the translation accuracy between bilinguals has been improved. As experimental results show, this method achieves expected goal in effectiveness and accuracy.
引用
收藏
页码:185 / 189
页数:5
相关论文
共 50 条
  • [41] A hybrid CNN and LSTM-based deep learning model for abnormal behavior detection
    Chang, Chuan-Wang
    Chang, Chuan-Yu
    Lin, You-Ying
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) : 11825 - 11843
  • [42] Ventilation prediction for ICU patients with LSTM-based deep relative risk model
    Liu, Bin
    Yin, Guosheng
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 979 - 986
  • [43] CNN autoencoders and LSTM-based reduced order model for student dropout prediction
    Niu, Ke
    Lu, Guoqiang
    Peng, Xueping
    Zhou, Yuhang
    Zeng, Jingni
    Zhang, Ke
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22341 - 22357
  • [44] Predicting Human Eye Fixations via an LSTM-Based Saliency Attentive Model
    Cornia, Marcella
    Baraldi, Lorenzo
    Serra, Giuseppe
    Cucchiara, Rita
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (10) : 5142 - 5154
  • [45] Construction and Application of LSTM-Based Prediction Model for Tunnel Surrounding Rock Deformation
    He, Yongchao
    Chen, Qiunan
    SUSTAINABILITY, 2023, 15 (08)
  • [46] Improving Resource Utilization in Data Centers using an LSTM-based Prediction Model
    Thonglek, Kundjanasith
    Ichikawa, Kohei
    Takahashi, Keichi
    Iida, Hajimu
    Nakasan, Chawanat
    2019 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2019, : 476 - 483
  • [47] An LSTM-based mixed-integer model predictive control for irrigation scheduling
    Agyeman, Bernard T. T.
    Sahoo, Soumya R. R.
    Liu, Jinfeng
    Shah, Sirish L. L.
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2023, 101 (06): : 3362 - 3381
  • [48] LSTM-based Fault Classification Model in Transmission Lines for Real Fault Data
    Kim T.
    Lim S.
    Song K.
    Yoon S.-G.
    Transactions of the Korean Institute of Electrical Engineers, 2024, 73 (03): : 585 - 592
  • [49] LSTM-based Fault Prediction Model of Semiconductor Device under Thermal Stress
    Zhang M.
    Wang Q.
    Yu Y.
    Binggong Xuebao/Acta Armamentarii, 2021, 42 (06): : 1265 - 1274
  • [50] A hybrid CNN and LSTM-based deep learning model for abnormal behavior detection
    Chuan-Wang Chang
    Chuan-Yu Chang
    You-Ying Lin
    Multimedia Tools and Applications, 2022, 81 : 11825 - 11843