Korean Neural Machine Translation Using Hierarchical Word Structure

被引:0
|
作者
Park, Jeonghyeok
Zhao, Hai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Key Lab Shanghai Educ Commiss Intelligent Interac, Dept Comp Sci & Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine Translation; Hierarchical Word Structure; Korean Language;
D O I
10.1109/ialp51396.2020.9310510
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Korean neural machine translation may significantly suffer from low-resource issues. We thus propose an enhancement method that fully exploits the hierarchical Korean word embedding structure from source representation. To our best knowledge, this is the first attempt for such Korean NMT tasks. Every Korean word can be decomposed into character- and jamo-level (sub-character) units. Therefore, We merge the character- and jamo-level representations with word embeddings to capture important combining word meaning. And then the merged representations are fed into NMT model. Our simple and novel method achieves BLEU improvements (up to 0.6) compared to word-based NMT baselines on Korean-to-Chinese and Korean-to-English translation tasks.
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
  • [1] Korean Neural Machine Translation Using Hierarchical Word Structure
    Park, Jeonghyeok
    Zhao, Hai
    [J]. 2020 International Conference on Asian Language Processing, IALP 2020, 2020, : 294 - 298
  • [2] Effect of Word Sense Disambiguation on Neural Machine Translation: A Case Study in Korean
    Quang-Phuoc Nguyen
    Anh-Dung Vo
    Shin, Joon-Choul
    Ock, Cheol-Young
    [J]. IEEE ACCESS, 2018, 6 : 38512 - 38523
  • [3] Korean-Vietnamese Neural Machine Translation System With Korean Morphological Analysis and Word Sense Disambiguation
    Quang-Phuoc Nguyen
    Vo, Anh-Dung
    Shin, Joon-Choul
    Phuoc Tran
    Ock, Cheol-Young
    [J]. IEEE ACCESS, 2019, 7 : 32602 - 32616
  • [4] Ancient Korean Neural Machine Translation
    Park, Chanjun
    Lee, Chanhee
    Yang, Yeongwook
    Lim, Heuiseok
    [J]. IEEE ACCESS, 2020, 8 : 116617 - 116625
  • [5] Word-Level Quality Estimation for Korean-English Neural Machine Translation
    Eo, Sugyeong
    Park, Chanjun
    Moon, Hyeonseok
    Seo, Jaehyung
    Lim, Heuiseok
    [J]. IEEE ACCESS, 2022, 10 : 44964 - 44973
  • [6] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [7] Priming Ancient Korean Neural Machine Translation
    Park, Chanjun
    Lee, Seolhwa
    Seo, Jaehyung
    Moon, Hyeonseok
    Eo, Sugyeong
    Lim, Heuiseok
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 22 - 28
  • [8] On the Word Alignment from Neural Machine Translation
    Li, Xintong
    Li, Guanlin
    Liu, Lemao
    Meng, Max
    Shi, Shuming
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1293 - 1303
  • [9] Content Word Aware Neural Machine Translation
    Chen, Kehai
    Wang, Rui
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 358 - 364
  • [10] Word Position Aware Translation Memory for Neural Machine Translation
    He, Qiuxiang
    Huang, Guoping
    Liu, Lemao
    Li, Li
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 367 - 379