Unsupervised Bilingual Lexicon Induction via Latent Variable Models

被引:0
|
作者
Dou, Zi-Yi [1 ]
Zhou, Zhi-Hao [1 ]
Huang, Shujian [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual lexicon extraction has been studied for decades and most previous methods have relied on parallel corpora or bilingual dictionaries. Recent studies have shown that it is possible to build a bilingual dictionary by aligning monolingual word embedding spaces in an unsupervised way. With the recent advances in generative models, we propose a novel approach which builds cross-lingual dictionaries via latent variable models and adversarial training with no parallel corpora. To demonstrate the effectiveness of our approach, we evaluate our approach on several language pairs and the experimental results show that our model could achieve competitive and even superior performance compared with several state-of-the-art models.
引用
收藏
页码:621 / 626
页数:6
相关论文
共 50 条
  • [1] A Discriminative Latent-Variable Model for Bilingual Lexicon Induction
    Ruder, Sebastian
    Cotterell, Ryan
    Kementchedjhieva, Yova
    Sogaard, Anders
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 458 - 468
  • [2] A Bilingual Adversarial Autoencoder for Unsupervised Bilingual Lexicon Induction
    Bai, Xuefeng
    Cao, Hailong
    Chen, Kehai
    Zhao, Tiejun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (10) : 1639 - 1648
  • [3] Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment
    Shi, Haoyue
    Zettlemoyer, Luke
    Wang, Sida, I
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 813 - 826
  • [4] Adversarial Training for Unsupervised Bilingual Lexicon Induction
    Zhang, Meng
    Liu, Yang
    Luan, Huanbo
    Sun, Maosong
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1959 - 1970
  • [5] Bilingual word embedding fusion for robust unsupervised bilingual lexicon induction
    Cao, Hailong
    Zhao, Tiejun
    Wang, Weixuan
    Peng, Wei
    [J]. INFORMATION FUSION, 2023, 97
  • [6] Point Set Registration for Unsupervised Bilingual Lexicon Induction
    Cao, Hailong
    Zhao, Tiejun
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3991 - 3997
  • [7] Bilingual Lexicon Induction through Unsupervised Machine Translation
    Artetxe, Mikel
    Labaka, Gorka
    Agirre, Eneko
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5002 - 5007
  • [8] Dual Word Embedding for Robust Unsupervised Bilingual Lexicon Induction
    Cao, Hailong
    Li, Liguo
    Zhu, Conghui
    Yang, Muyun
    Zhao, Tiejun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2606 - 2615
  • [9] Unsupervised Learning with Contrastive Latent Variable Models
    Severson, Kristen A.
    Ghosh, Soumya
    Ng, Kenney
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4862 - 4869
  • [10] Unsupervised Bilingual Lexicon Induction from Mono-Lingual Multimodal Data
    Chen, Shizhe
    Jin, Qin
    Hauptmann, Alexander
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8207 - 8214