Improving bilingual word embeddings mapping with monolingual context information

被引:1
|
作者
Zhu, Shaolin [2 ]
Mi, Chenggang [1 ]
Li, Tianqi [2 ]
Zhang, Fuhua [2 ]
Zhang, Zhifeng [2 ]
Sun, Yu [2 ]
机构
[1] Northwestern Polytech Univ, Xian 710129, Peoples R China
[2] Zhengzhou Univ Light Ind, Zhengzhou 450002, Peoples R China
基金
中国国家自然科学基金;
关键词
Bilingual word embeddings; Low-resource; Unsupervised emthod;
D O I
10.1007/s10590-021-09274-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual word embeddings (BWEs) play a very important role in many natural language processing (NLP) tasks, especially cross-lingual tasks such as machine translation (MT) and cross-language information retrieval. Most existing methods to train BWEs are based on bilingual supervision. However, bilingual resources are not available for many low-resource language pairs. Although some studies addressed this issue with unsupervised methods, monolingual contextual data are not used to improve the performance of low-resource BWEs. To address these issues, we propose an unsupervised method to improve BWEs using optimized monolingual context information without any parallel corpora. In particular, we first build a bilingual word embeddings mapping model between two languages by aligning monolingual word embedding spaces based on unsupervised adversarial training. To further improve the performance of these mappings, we use monolingual context information to optimize them during the course. Experimental results show that our method outperforms other baseline systems significantly, including results for four low-resource language pairs.
引用
收藏
页码:503 / 518
页数:16
相关论文
共 50 条
  • [41] Preschool English teachers gaining bilingual competencies in a monolingual context
    Dikilitas, Kenan
    Mumford, Simon E.
    [J]. SYSTEM, 2020, 91
  • [42] Words in context: Compensation for phonological assimilation in monolingual and bilingual toddlers
    Singh, Leher
    Cheng, Qiqi
    [J]. FIRST LANGUAGE, 2023, 43 (04) : 407 - 430
  • [43] Novel Word Learning in Children Who Are Bilingual: Comparison to Monolingual Peers
    Alt, Mary
    Arizmendi, Genesis Dominique
    Gray, Shelley
    Hogan, Tiffany Patrice
    Green, Samuel
    Cowan, Nelson
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (07): : 2332 - 2360
  • [44] Phoneme awareness, vocabulary and word decoding in monolingual and bilingual Dutch children
    Janssen, Marije
    Bosman, Anna M. T.
    Leseman, Paul P. M.
    [J]. JOURNAL OF RESEARCH IN READING, 2013, 36 (01) : 1 - 13
  • [45] Bilingual word recognition in a sentence context
    Van Assche, Eva
    Duyck, Wouter
    Hartsuiker, Robert J.
    [J]. FRONTIERS IN PSYCHOLOGY, 2012, 3
  • [46] Effect of speaker certainty on novel word learning in monolingual and bilingual children
    Buac, Milijana
    Tauzin-Larche, Aurelie
    Weisberg, Emily
    Kaushanskaya, Margarita
    [J]. BILINGUALISM-LANGUAGE AND COGNITION, 2019, 22 (04) : 883 - 895
  • [47] Functional neuroanatomy of English word reading in early bilingual and monolingual adults
    Brignoni-Perez, Edith
    Jamal, Nasheed, I
    Eden, Guinevere F.
    [J]. HUMAN BRAIN MAPPING, 2022, 43 (14) : 4310 - 4325
  • [48] The Role of Audiovisual Speech in Fast-Mapping and Novel Word Retention in Monolingual and Bilingual 24-Month-Olds
    Weatherhead, Drew
    Arredondo, Maria M.
    Nacar Garcia, Loreto
    Werker, Janet F.
    [J]. BRAIN SCIENCES, 2021, 11 (01) : 1 - 17
  • [49] Enhancing Semantic Representations of Bilingual Word Embeddings with Syntactic Dependencies
    Xu, Linli
    Ouyang, Wenjun
    Ren, Xiaoying
    Wang, Yang
    Jiang, Liang
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4517 - 4524
  • [50] Applications of Tf-idf Concept to Improve Monolingual and Cross-Language Information Retrieval based on Word Embeddings
    Sari, Syandra
    Adriani, Mirna
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,