A Classifier-Based Preordering Approach for English-Vietnamese Statistical Machine Translation

被引:0
|
作者
Viet Hong Tran [1 ,2 ]
Huyen Thuong Vu [2 ,3 ]
Vinh Van Nguyen [2 ]
Minh Le Nguyen [4 ]
机构
[1] Univ Econ & Tech Ind, Hanoi, Vietnam
[2] Vietnam Natl Univ, Univ Engn & Technol, Hanoi, Vietnam
[3] ThuyLoi Univ, Hanoi, Vietnam
[4] Japan Adv Inst Sci & Technol, Nomi, Japan
关键词
Natural language processing; Machine translation; Phrase-based statistical machine translation;
D O I
10.1007/978-3-319-75487-1_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reordering is of essential importance problem for phrase based statistical machine translation (SMT). In this paper, we propose an approach to automatically learn reordering rules as preprocessing step based on a dependency parser in phrase-based statistical machine translation for English to Vietnamese. We used dependency parsing and rules extracting from training the features-rich discriminative classifiers for reordering source-side sentences. We evaluated our approach on English-Vietnamese machine translation tasks, and showed that it outperform the baseline phrase-based SMT system.
引用
收藏
页码:74 / 87
页数:14
相关论文
共 50 条
  • [21] A Vietnamese-English Neural Machine Translation System
    Thien Hai Nguyen
    Nguyen, Tuan-Duy H.
    Duy Phung
    Duy Tran-Cong Nguyen
    Hieu Minh Tran
    Manh Luong
    Tin Duy Vo
    Hung Hai Bui
    Dinh Phung
    Dat Quoc Nguyen
    INTERSPEECH 2022, 2022, : 5543 - 5544
  • [22] Using Classifier-Based Nominal Imputation to Improve Machine Learning
    Su, Xiaoyuan
    Greiner, Russell
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 124 - 135
  • [23] English to Bodo Phrase-Based Statistical Machine Translation
    Islam, Md Saiful
    Purkayastha, Bipul Syam
    ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2018, 562 : 207 - 217
  • [24] English - Afaan Oromoo Machine Translation: An Experiment Using a Statistical Approach
    Adugna, Sisay
    Eisele, Andreas
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [25] Machine Translation Approach for Vietnamese Diacritic Restoration
    Thi Ngoc Diep Do
    Duy Binh Nguyen
    Dang Khoa Mac
    Do Dat Tran
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 103 - 106
  • [26] Theoretical based approach to English to Sinhala machine translation
    Hettige, B.
    Karunananda, A. S.
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, 2009, : 380 - 385
  • [27] A CLASSIFIER-BASED DECODING APPROACH FOR LARGE SCALE DISTRIBUTED CODING
    Viswanatha, Kumar
    Ramaswamy, Sharadh
    Saxena, Ankur
    Rose, Kenneth
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1513 - 1516
  • [28] Integrating Pronunciation into Chinese-Vietnamese Statistical Machine Translation
    Anh Tran Huu
    Huang, Heyan
    Guo, Yuhang
    Shi, Shumin
    Jian, Ping
    TSINGHUA SCIENCE AND TECHNOLOGY, 2018, 23 (06) : 715 - 723
  • [29] Integrating Pronunciation into Chinese-Vietnamese Statistical Machine Translation
    Anh Tran Huu
    Heyan Huang
    Yuhang Guo
    Shumin Shi
    Ping Jian
    Tsinghua Science and Technology, 2018, 23 (06) : 715 - 723
  • [30] Classifier-Based Pattern Selection Approach for Relation Instance Extraction
    Mandya, Angrosh
    Bollegala, Danushka
    Coenen, Frans
    Atkinson, Katie
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 418 - 434