A Discriminative Neural Model for Cross-Lingual Word Alignment

被引:0
|
作者
Stengel-Estrin, Elias [1 ]
Su, Tzu-Ray [1 ]
Post, Matt [1 ]
Van Durme, Benjamin [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel discriminative word alignment model, which we integrate into a Transformer-based machine translation model. In experiments based on a small number of labeled examples (similar to 1.7K-5K sentences) we evaluate its performance intrinsically on both English-Chinese and English-Arabic alignment, where we achieve major improvements over unsupervised baselines (11-27 F1). We evaluate the model extrinsically on data projection for Chinese NER, showing that our alignments lead to higher performance when used to project NER tags from English to Chinese. Finally, we perform an ablation analysis and an annotation experiment that jointly support the utility and feasibility of future manual alignment elicitation.
引用
收藏
页码:910 / 920
页数:11
相关论文
共 50 条
  • [1] MultiMirror: Neural Cross-lingual Word Alignment for MultilingualWord Sense Disambiguation
    Procopio, Luigi
    Barba, Edoardo
    Martelli, Federico
    Navigli, Roberto
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3915 - 3921
  • [2] WORD SEGMENTATION THROUGH CROSS-LINGUAL WORD-TO-PHONEME ALIGNMENT
    Stahlberg, Felix
    Schlippe, Tim
    Vogel, Stephan
    Schultz, Tanja
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 85 - 90
  • [3] Cross-Lingual Word Embeddings
    Søgaard, Anders
    Vulić, Ivan
    Ruder, Sebastian
    Faruqui, Manaal
    [J]. Synthesis Lectures on Human Language Technologies, 2019, 12 (02): : 1 - 132
  • [4] Cross-Lingual Word Embeddings
    Corro, Caio Filippo
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2019, 60 (01): : 46 - 48
  • [5] Cross-Lingual Word Embeddings
    Agirre, Eneko
    [J]. COMPUTATIONAL LINGUISTICS, 2020, 46 (01) : 245 - 248
  • [6] Cross-Lingual Taxonomy Alignment with Bilingual Biterm Topic Model
    Wu, Tianxing
    Qi, Guilin
    Wang, Haofen
    Xu, Kang
    Cui, Xuan
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 287 - 293
  • [7] Neural topic-enhanced cross-lingual word embeddings for CLIR
    Zhou, Dong
    Qu, Wei
    Li, Lin
    Tang, Mingdong
    Yang, Aimin
    [J]. INFORMATION SCIENCES, 2022, 608 : 809 - 824
  • [8] A Variational Hierarchical Model for Neural Cross-Lingual Summarization
    Liang, Yunlong
    Meng, Fandong
    Zhou, Chulun
    Xu, Jinan
    Chen, Yufeng
    Su, Jinsong
    Zhou, Jie
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2088 - 2099
  • [9] Cross-lingual morphological inflection with explicit alignment
    Coltekin, Cagri
    [J]. 16TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2019), 2019, : 71 - 79
  • [10] Cross-lingual Entity Alignment with Incidental Supervision
    Chen, Muhao
    Shi, Weijia
    Zhou, Ben
    Roth, Dan
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 645 - 658