Incorporating Linguistic Information to Statistical Word-Level Alignment

被引:0
|
作者
Cendejas, Eduardo [1 ]
Barcelo, Grettel [1 ]
Gelbukh, Alexander [1 ]
Sidorov, Grigori [1 ]
机构
[1] Natl Polytech Inst, Ctr Res Comp, Mexico City, DF, Mexico
关键词
Parallel texts; word alignment; linguistic information; dictionary; cognates; semantic domains; morphological information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parallel texts are enriched by alignment algorithms, thus establishing a relationship between the structures of the implied languages. Depending on the alignment level, the enrichment can be performed on paragraphs, sentences or words, of the expressed content in the source language and its translation. There are two main approaches to perform word-level alignment: statistical or linguistic. Due to the dissimilar grammar rules the languages have, the statistical algorithms usually give lower precision. That is why the development of this type of algorithms is generally aimed at a specific language pair using linguistic techniques. A hybrid alignment system based on the combination of the two traditional approaches is presented in this paper. It provides user-friendly configuration and is adaptable to the computational environment. The system uses linguistic resources and procedures such as identification of cognates, morphological information, syntactic trees, dictionaries, and semantic domains. We show that the system outperforms existing algorithms.
引用
收藏
页码:387 / 394
页数:8
相关论文
共 50 条
  • [21] Hardware Trojan Detection Acceleration Based on Word-Level Statistical Properties Management
    Li, He
    Liu, Qiang
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2014, : 153 - 160
  • [22] Is Word-Level Recursion Actually Recursion?
    Miller, Taylor L.
    Sande, Hannah
    LANGUAGES, 2021, 6 (02)
  • [23] Lifting propositional interpolants to the word-level
    Kroening, Daniel
    Weissenbacher, Georg
    FMCAD 2007: FORMAL METHODS IN COMPUTER AIDED DESIGN, PROCEEDINGS, 2007, : 85 - 89
  • [24] Word-level Speech Recognition with a Letter to Word Encoder
    Collobert, Ronan
    Hannun, Awni
    Synnaeve, Gabriel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [25] The Phonetics of Paiwan Word-Level Prosody
    Chen, Chun-Mei
    LANGUAGE AND LINGUISTICS, 2009, 10 (03) : 593 - 625
  • [26] WARP: Word-level Adversarial ReProgramming
    Hambardzumyan, Karen
    Khachatrian, Hrant
    May, Jonathan
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4921 - 4933
  • [27] A word-level graph manipulation package
    Höreth S.
    International Journal on Software Tools for Technology Transfer, 2001, 3 (02) : 182 - 192
  • [28] Word-Level Symbolic Trajectory Evaluation
    Chakraborty, Supratik
    Khasidashvili, Zurab
    Seger, Carl-Johan H.
    Gajavelly, Rajkumar
    Haldankar, Tanmay
    Chhatani, Dinesh
    Mistry, Rakesh
    COMPUTER AIDED VERIFICATION, CAV 2015, PT II, 2015, 9207 : 128 - 143
  • [29] Word-level Speech Recognition with a Letter to Word Encoder
    Collobert, Ronan
    Hannun, Awni
    Synnaeve, Gabriel
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [30] Formal verification of word-level specifications
    Höreth, S
    Drechsler, R
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION 1999, PROCEEDINGS, 1999, : 52 - 58