Incorporating Linguistic Information to Statistical Word-Level Alignment

被引:0
|
作者
Cendejas, Eduardo [1 ]
Barcelo, Grettel [1 ]
Gelbukh, Alexander [1 ]
Sidorov, Grigori [1 ]
机构
[1] Natl Polytech Inst, Ctr Res Comp, Mexico City, DF, Mexico
关键词
Parallel texts; word alignment; linguistic information; dictionary; cognates; semantic domains; morphological information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parallel texts are enriched by alignment algorithms, thus establishing a relationship between the structures of the implied languages. Depending on the alignment level, the enrichment can be performed on paragraphs, sentences or words, of the expressed content in the source language and its translation. There are two main approaches to perform word-level alignment: statistical or linguistic. Due to the dissimilar grammar rules the languages have, the statistical algorithms usually give lower precision. That is why the development of this type of algorithms is generally aimed at a specific language pair using linguistic techniques. A hybrid alignment system based on the combination of the two traditional approaches is presented in this paper. It provides user-friendly configuration and is adaptable to the computational environment. The system uses linguistic resources and procedures such as identification of cognates, morphological information, syntactic trees, dictionaries, and semantic domains. We show that the system outperforms existing algorithms.
引用
下载
收藏
页码:387 / 394
页数:8
相关论文
共 50 条
  • [1] The where and when of linguistic word-level prosody
    Arciuli, Joanne
    Slowiaczek, Louisa M.
    NEUROPSYCHOLOGIA, 2007, 45 (11) : 2638 - 2642
  • [2] Chinese Clinical Named Entity Recognition with Word-Level Information Incorporating Dictionaries
    Lu, Ningjie
    Zheng, Jun
    Wu, Wen
    Yang, Yan
    Chen, Kaiwei
    Hu, Wenxin
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [3] DUSTer: A method for unraveling cross-language divergences for statistical word-level alignment
    Dorr, BJ
    Pearl, L
    Hwa, R
    Habash, N
    MACHINE TRANSLATION: FROM RESEARCH TO REAL USERS, 2002, 2499 : 31 - 43
  • [4] The neural response at the fundamental frequency of speech is modulated by word-level acoustic and linguistic information
    Kegler, Mikolaj
    Weissbart, Hugo
    Reichenbach, Tobias
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [5] Hybrid Algorithm for Word-Level Alignment of Parallel Texts
    Cendejas, Eduardo
    Barcelo, Grettel
    Gelbukh, Alexander
    Sidorov, Grigori
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 5723 : 293 - 294
  • [6] Enhancing the Bilingual Concordancer TransSearch with Word-Level Alignment
    Bourdaillet, Julien
    Huet, Stephane
    Gotti, Fabrizio
    Lapalme, Guy
    Langlais, Philippe
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5549 : 27 - +
  • [7] Processing of word-level and paragraph-level information in schizotypy
    Niznikiewicz, MA
    Hun, SD
    Nestor, PG
    Dodd, C
    Shenton, ME
    McCarley, RW
    SCHIZOPHRENIA RESEARCH, 2003, 60 (01) : 256 - 257
  • [8] Estimating word-level quality of statistical machine translation output using monolingual information alone
    Tezcan, Arda
    Hoste, Veronique
    Macken, Lieve
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (01) : 73 - 94
  • [9] Using Word-Level Information in Formal Hardware Verification
    R. Drechsler
    Automation and Remote Control, 2004, 65 : 963 - 977