Unsupervised Multilingual Alignment using Wasserstein Barycenter

被引:0
|
作者
Lian, Xin [1 ,3 ]
Jain, Kshitij [2 ]
Truszkowski, Jakub [2 ]
Poupart, Pascal [1 ,2 ,3 ]
Yu, Yaoliang [1 ,3 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Borealis AI, Waterloo, ON, Canada
[3] Vector Inst, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data. One popular strategy is to reduce multilingual alignment to the much simplified bilingual setting, by picking one of the input languages as the pivot language that we transit through. However, it is well-known that transiting through a poorly chosen pivot language (such as English) may severely degrade the translation quality, since the assumed transitive relations among all pairs of languages may not be enforced in the training process. Instead of going through a rather arbitrarily chosen pivot language, we propose to use the Wasserstein barycenter as a more informative "mean" language: it encapsulates information from all languages and minimizes all pairwise transportation costs. We evaluate our method on standard benchmarks and demonstrate state-of-the-art performances.
引用
收藏
页码:3702 / 3708
页数:7
相关论文
共 50 条
  • [1] Unsupervised Alignment of Embeddings with Wasserstein Procrustes
    Grave, Edouard
    Joulin, Armand
    Berthet, Quentin
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [2] Unsupervised Graph Alignment with Wasserstein Distance Discriminator
    Gao, Ji
    Huang, Xiao
    Li, Jundong
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 426 - 435
  • [3] Dimensionality Reduction for Wasserstein Barycenter
    Izzo, Zachary
    Silwal, Sandeep
    Zhou, Samson
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] A CANONICAL BARYCENTER VIA WASSERSTEIN REGULARIZATION
    Kim, Young-Heon
    Pass, Brendan
    SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2018, 50 (02) : 1817 - 1828
  • [5] On Robust Wasserstein Barycenter: The Model and Algorithm
    Wang, Xu
    Huang, Jiawei
    Yang, Qingyuan
    Zhang, Jinpeng
    PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 235 - 243
  • [6] Barycenter in Wasserstein Spaces: Existence and Consistency
    Le Gouic, Thibaut
    Loubes, Jean-Michel
    GEOMETRIC SCIENCE OF INFORMATION, GSI 2015, 2015, 9389 : 104 - 108
  • [7] Bilingual alignment transfers to multilingual alignment for unsupervised parallel text mining
    Tien, Chih-chan
    Steinert-Threlkeld, Shane
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8696 - 8706
  • [8] WASSERSTEIN BARYCENTER TRANSPORT FOR ACOUSTIC ADAPTATION
    Montesuma, Eduardo F.
    Mboula, Fred-Maurice Ngole
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3405 - 3409
  • [9] Wasserstein Iterative Networks for Barycenter Estimation
    Korotin, Alexander
    Egiazarian, Vage
    Li, Lingxiao
    Burnaev, Evgeny
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] HYPERSPECTRAL AND MULTISPECTRAL WASSERSTEIN BARYCENTER FOR IMAGE FUSION
    Mifdal, Jamila
    Coll, Bartomeu
    Courty, Nicolas
    Froment, Jacques
    Vedel, Beatrice
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3373 - 3376