Translation of word vectors by exploiting translatable context pairs

被引:0
|
作者
Ishiwatari S. [1 ]
Kaji N. [2 ]
Yoshinaga N. [3 ]
Toyoda M. [3 ]
Kitsuregawa M. [3 ,4 ]
机构
[1] Graduate School of Information Science and Technology, University of Tokyo
[2] Institute of Industrial Science, University of Tokyo
[3] National Institute of Informatics, University of Tokyo
来源
| 1600年 / Japanese Society for Artificial Intelligence卷 / 31期
基金
日本学术振兴会;
关键词
Bilingual dictionary induction; Cross-lingual projection; Distributional semantics; Word representation;
D O I
10.1527/tjsai.AI30-A
中图分类号
学科分类号
摘要
While vector-based representations of word meanings (word vectors) have been widely used in a variety of natural language processing applications, they are not meant for capturing the similarity between words in different languages. This prevents using word vectors in multilingual-applications such as cross-lingual information retrieval and machine translation. To solve this problem, we propose a method that learns a cross-lingual projection of word representations from one language into another. Our method utilizes translatable context pairs obtained from a bilingual dictionary and surface similarity as bonus terms of the objective function. In the experiments, we evaluated the effectiveness of the proposed method in four languages, Japanese, Chinese, English and Spanish. Experiments shows that our method outperformed existing methods without any additional supervisions. © 2016, Japanese Society for Artificial Intelligence. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Exploiting multiple word embeddings and one-hot character vectors for aspect-based sentiment analysis
    Pham, Duc-Hong
    Le, Anh-Cuong
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 103 : 1 - 10
  • [42] A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing Japanese Word Vectors
    Utsumi, Akira
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4444 - 4450
  • [43] Austen and Translation: National Characters, Translatable Heroines, and the Heroine as Translator
    Saglia, Diego
    NOVEL-A FORUM ON FICTION, 2013, 46 (01) : 73 - 92
  • [44] Exploiting mTOR Signaling: A Novel Translatable Treatment Strategy for Traumatic Optic Neuropathy?
    Morgan-Warren, Peter J.
    Berry, Martin
    Ahmed, Zubair
    Scott, Robert A. H.
    Logan, Ann
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2013, 54 (10) : 6903 - 6916
  • [45] Exploiting shared Chinese characters in Chinese word segmentation optimization for Chinese-Japanese Machine Translation
    Chu, Chenhui
    Nakazawa, Toshiaki
    Kawahara, Daisuke
    Kurohashi, Sadao
    Proceedings of the 16th Annual Conference of the European Association for Machine Translation, EAMT 2012, 2012, : 35 - 42
  • [46] Context Reinstatement Effects in Children's Cued Recall of Strongly and Weakly Associated Word Pairs
    Dietze, Paul M.
    Sharman, Stefanie J.
    Powell, Martine B.
    Thomson, Donald M.
    EDUCATIONAL AND DEVELOPMENTAL PSYCHOLOGIST, 2011, 28 (02): : 91 - 100
  • [47] Predicting word vectors for microtext
    Chaturvedi, Iti
    Satapathy, Ranjan
    Lynch, Curtis
    Cambria, Erik
    EXPERT SYSTEMS, 2024, 41 (08)
  • [48] Personalized Semantic Word Vectors
    Ebrahimi, Javid
    Dou, Dejing
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1925 - 1928
  • [49] Attenuating Bias in Word Vectors
    Dev, Sunipa
    Phillips, Jeff M.
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 879 - 887
  • [50] WORD GOODNESS AND WORD-FREQUENCY IN HOMOPHONE PAIRS
    FRINCKE, G
    JOHNSON, RC
    PSYCHOLOGICAL REPORTS, 1960, 7 (03) : 470 - 470