Unsupervised Multilingual Alignment using Wasserstein Barycenter

被引:0
|
作者
Lian, Xin [1 ,3 ]
Jain, Kshitij [2 ]
Truszkowski, Jakub [2 ]
Poupart, Pascal [1 ,2 ,3 ]
Yu, Yaoliang [1 ,3 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Borealis AI, Waterloo, ON, Canada
[3] Vector Inst, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data. One popular strategy is to reduce multilingual alignment to the much simplified bilingual setting, by picking one of the input languages as the pivot language that we transit through. However, it is well-known that transiting through a poorly chosen pivot language (such as English) may severely degrade the translation quality, since the assumed transitive relations among all pairs of languages may not be enforced in the training process. Instead of going through a rather arbitrarily chosen pivot language, we propose to use the Wasserstein barycenter as a more informative "mean" language: it encapsulates information from all languages and minimizes all pairwise transportation costs. We evaluate our method on standard benchmarks and demonstrate state-of-the-art performances.
引用
收藏
页码:3702 / 3708
页数:7
相关论文
共 50 条
  • [31] An inexact PAM method for computing Wasserstein barycenter with unknown supports
    Yitian Qian
    Shaohua Pan
    Computational and Applied Mathematics, 2021, 40
  • [32] Stochastic saddle-point optimization for the Wasserstein barycenter problem
    Daniil Tiapkin
    Alexander Gasnikov
    Pavel Dvurechensky
    Optimization Letters, 2022, 16 : 2145 - 2175
  • [33] Wasserstein Unsupervised Reinforcement Learning
    He, Shuncheng
    Jiang, Yuhang
    Zhang, Hongchang
    Shao, Jianzhun
    Ji, Xiangyang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6884 - 6892
  • [34] Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks
    Fan, Jiaojiao
    Taghvaei, Amirhossein
    Chen, Yongxin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [35] BRULE: Barycenter-Regularized Unsupervised Landmark Extraction
    Bespalov, Iaroslav
    Buzun, Nazar
    Dylov, Dmitry, V
    PATTERN RECOGNITION, 2022, 131
  • [36] Understanding the complexities of the fine structure of interest rates: a Wasserstein barycenter learning approach
    Mari, Carlo
    Baldassari, Cristiano
    Neural Computing and Applications, 2024, 36 (31) : 19291 - 19305
  • [37] Interior-point Methods Strike Back: Solving the Wasserstein Barycenter Problem
    Ge, Dongdong
    Wang, Haoyue
    Xiong, Zikai
    Ye, Yinyu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [38] Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks
    Chu, Xu
    Jin, Yujie
    Wang, Xin
    Zhang, Shanghang
    Wang, Yasha
    Zhu, Wenwu
    Mei, Hong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [39] A Riemannian submersion-based approach to the Wasserstein barycenter of positive definite matrices
    Li, Mingming
    Sun, Huafei
    Li, Didong
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2020, 43 (07) : 4927 - 4939
  • [40] Unsupervised Multilingual Word Embeddings
    Chen, Xilun
    Cardie, Claire
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 261 - 270