Unsupervised Multilingual Alignment using Wasserstein Barycenter

被引:0
|
作者
Lian, Xin [1 ,3 ]
Jain, Kshitij [2 ]
Truszkowski, Jakub [2 ]
Poupart, Pascal [1 ,2 ,3 ]
Yu, Yaoliang [1 ,3 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Borealis AI, Waterloo, ON, Canada
[3] Vector Inst, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data. One popular strategy is to reduce multilingual alignment to the much simplified bilingual setting, by picking one of the input languages as the pivot language that we transit through. However, it is well-known that transiting through a poorly chosen pivot language (such as English) may severely degrade the translation quality, since the assumed transitive relations among all pairs of languages may not be enforced in the training process. Instead of going through a rather arbitrarily chosen pivot language, we propose to use the Wasserstein barycenter as a more informative "mean" language: it encapsulates information from all languages and minimizes all pairwise transportation costs. We evaluate our method on standard benchmarks and demonstrate state-of-the-art performances.
引用
收藏
页码:3702 / 3708
页数:7
相关论文
共 50 条
  • [41] Geometric mean flows and the Cartan barycenter on the Wasserstein space over positive definite matrices
    Hiai, Fumio
    Lim, Yongdo
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2017, 533 : 118 - 131
  • [42] Solving Soft Clustering Ensemble via k-Sparse Discrete Wasserstein Barycenter
    Qin, Ruizhe
    Li, Mengying
    Ding, Hu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [43] Multilingual Unsupervised Dependency Parsing with Unsupervised POS Tags
    Marecek, David
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 72 - 82
  • [44] Wasserstein barycenter regression: application to the joint dynamics of regional GDP and life expectancy in Italy
    Levantesi, Susanna
    Nigri, Andrea
    Pagnottoni, Paolo
    Spelta, Alessandro
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2024,
  • [45] Wasserstein-Based Graph Alignment
    Maretic, Hermina Petric
    El Gheche, Mireille
    Minder, Matthias
    Chierchia, Giovanni
    Frossard, Pascal
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2022, 8 : 353 - 363
  • [46] Wasserstein barycenter regression for estimating the joint dynamics of renewable and fossil fuel energy indices
    De Giuli, Maria Elena
    Spelta, Alessandro
    COMPUTATIONAL MANAGEMENT SCIENCE, 2023, 20 (01)
  • [47] Upper and lower risk bounds for estimating the Wasserstein barycenter of random measures on the real line
    Bigot, Jeremie
    Gouet, Raul
    Klein, Thierry
    Lopez, Alfredo
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (02): : 2253 - 2289
  • [48] Spatial-aware Network using Wasserstein Distance for Unsupervised Domain Adaptation
    Long, Liu
    Bin, Luo
    Jiang, Fan
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4591 - 4594
  • [49] Sliced Wasserstein Discrepancy for Unsupervised Domain Adaptation
    Lee, Chen-Yu
    Batra, Tanmay
    Baig, Mohammad Haris
    Ulbricht, Daniel
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10277 - 10287
  • [50] Learning Unsupervised Multilingual Word Embeddings with Incremental Multilingual Hubs
    Heyman, Geert
    Verreet, Bregt
    Vulic, Ivan
    Moens, Marie-Francine
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1890 - 1902