N-gram-based machine translation

被引:85
|
作者
Marino, Jose B. [1 ]
Banchs, Rafael E. [1 ]
Crego, Josep M. [1 ]
de Gispert, Adria [1 ]
Lambert, Patrik [1 ]
Fonollosa, Jose A. R. [1 ]
Costa-jussa, Marta R. [1 ]
机构
[1] Univ Politecn Cataluna, Dept Signal Theory & Commun, ES-08034 Barcelona, Spain
关键词
D O I
10.1162/coli.2006.32.4.527
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article describes in detail an n-gram approach to statistical machine translation. This approach consists of a log-linear combination of a translation model based on n-grams of bilingual units, which are referred to as tuples, along with four specific feature functions. Translation performance, which happens to be in the state of the art, is demonstrated with Spanish-to-English and English-to-Spanish translations of the European Parliament Plenary Sessions (EPPS).
引用
收藏
页码:527 / 549
页数:23
相关论文
共 50 条
  • [1] Comparison and system combination of n-gram-based and syntax-based machine translation systems
    Khalilov, Maxim
    Fonollosa, Jose A. R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 259 - 266
  • [2] The Operation Sequence ModelCombining N-Gram-Based and Phrase-Based Statistical Machine Translation
    Durrani, Nadir
    Schmid, Helmut
    Fraser, Alexander
    Koehn, Philipp
    Schuetze, Hinrich
    [J]. COMPUTATIONAL LINGUISTICS, 2015, 41 (02) : 185 - 214
  • [3] n-Gram-Based Text Compression
    Nguyen, Vu H.
    Nguyen, Hien T.
    Duong, Hieu N.
    Snasel, Vaclav
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [4] n-gram-based approach to composer recognition
    Wolkowicz, Jacek
    Kulka, Zbigniew
    Keselj, Vlado
    [J]. ARCHIVES OF ACOUSTICS, 2008, 33 (01) : 43 - 55
  • [5] Reordering experiments for N-gram-based SMT
    Crego, Josep M.
    Marino, Jose B.
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 242 - +
  • [6] n-Gram-based indexing for Korean text retrieval
    Lee, JH
    Cho, HY
    Park, HR
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1999, 35 (04) : 427 - 441
  • [7] N-gram-based detection of new malicious code
    Abou-Assaleh, T
    Cercone, N
    Keselj, V
    Sweidan, R
    [J]. PROCEEDINGS OF THE 28TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATION CONFERENCE, WORKSHOP AND FAST ABSTRACTS, 2004, : 41 - 42
  • [8] Generation, implementation, and appraisal of an N-gram-based stemming algorithm
    Pande, Bhagwati P.
    Tamta, Pawan
    Dhami, Hoshiyar S.
    [J]. DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2019, 34 (03) : 558 - 568
  • [9] Research on N-Gram-Based Mongolian Information Retrieval Unit
    Yue Jun-ying
    Gao Guang-lai
    Lin Min
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMECHANICAL CONTROL TECHNOLOGY AND TRANSPORTATION, 2015, 41 : 439 - 445
  • [10] n-Gram-based classification and unsupervised hierarchical clustering of genome sequences
    Tomovic, A
    Janicic, P
    Keselj, V
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2006, 81 (02) : 137 - 153