Improving bilingual word embeddings mapping with monolingual context information

被引:1
|
作者
Zhu, Shaolin [2 ]
Mi, Chenggang [1 ]
Li, Tianqi [2 ]
Zhang, Fuhua [2 ]
Zhang, Zhifeng [2 ]
Sun, Yu [2 ]
机构
[1] Northwestern Polytech Univ, Xian 710129, Peoples R China
[2] Zhengzhou Univ Light Ind, Zhengzhou 450002, Peoples R China
基金
中国国家自然科学基金;
关键词
Bilingual word embeddings; Low-resource; Unsupervised emthod;
D O I
10.1007/s10590-021-09274-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual word embeddings (BWEs) play a very important role in many natural language processing (NLP) tasks, especially cross-lingual tasks such as machine translation (MT) and cross-language information retrieval. Most existing methods to train BWEs are based on bilingual supervision. However, bilingual resources are not available for many low-resource language pairs. Although some studies addressed this issue with unsupervised methods, monolingual contextual data are not used to improve the performance of low-resource BWEs. To address these issues, we propose an unsupervised method to improve BWEs using optimized monolingual context information without any parallel corpora. In particular, we first build a bilingual word embeddings mapping model between two languages by aligning monolingual word embedding spaces based on unsupervised adversarial training. To further improve the performance of these mappings, we use monolingual context information to optimize them during the course. Experimental results show that our method outperforms other baseline systems significantly, including results for four low-resource language pairs.
引用
收藏
页码:503 / 518
页数:16
相关论文
共 50 条
  • [1] Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings
    Vulic, Ivan
    Moens, Marie-Francine
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 363 - 372
  • [2] Improving Japanese-English Bilingual Mapping of Word Embeddings based on Language Specificity
    Song, Yuting
    Batjargal, Biligsaikhan
    Maeda, Akira
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 279 - 283
  • [3] Word Mapping and Executive Functioning in Young Monolingual and Bilingual Children
    Bialystok, Ellen
    Barac, Raluca
    Blaye, Agnes
    Poulin-Dubois, Diane
    [J]. JOURNAL OF COGNITION AND DEVELOPMENT, 2010, 11 (04) : 485 - 508
  • [4] Novel word retention in bilingual and monolingual speakers
    Kan, Pui Fong
    Sadagopan, Neeraja
    [J]. FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [5] Concreteness effects in bilingual and monolingual word learning
    Kaushanskaya, Margarita
    Rechtzigel, Katrina
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2012, 19 (05) : 935 - 941
  • [6] Learning bilingual word embeddings with (almost) no bilingual data
    Artetxe, Mikel
    Labaka, Gorka
    Agirre, Eneko
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 451 - 462
  • [7] Concreteness effects in bilingual and monolingual word learning
    Margarita Kaushanskaya
    Katrina Rechtzigel
    [J]. Psychonomic Bulletin & Review, 2012, 19 : 935 - 941
  • [8] The development of fast-mapping and novel word retention strategies in monolingual and bilingual infants
    Kalashnikova, Marina
    Escudero, Paola
    Kidd, Evan
    [J]. DEVELOPMENTAL SCIENCE, 2018, 21 (06)
  • [9] Monolingual and Bilingual Word Recognition and Word Learning in Background Noise
    Morini, Giovanna
    Newman, Rochelle S.
    [J]. LANGUAGE AND SPEECH, 2020, 63 (02) : 381 - 403
  • [10] Novel Word Learning in Bilingual and Monolingual Infants: Evidence for a Bilingual Advantage
    Singh, Leher
    Fu, Charlene S. L.
    Tay, Zhi Wen
    Golinkoff, Roberta Michnick
    [J]. CHILD DEVELOPMENT, 2018, 89 (03) : E183 - E198