Using k-Way Co-Occurrences for Learning Word Embeddings

被引:0
|
作者
Bollegala, Danushka [1 ]
Yoshida, Yuichi [2 ]
Kawarabayashi, Ken-ichi [2 ,3 ]
机构
[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
[2] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[3] Japan Sci & Technol Agcy, ERATO, Kawarabayashi Large Graph Project, Kawaguchi, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Co-occurrences between two words provide useful insights into the semantics of those words. Consequently, numerous prior work on word embedding learning has used co-occurrences between two words as the training signal for learning word embeddings. Flowever, in natural language texts it is common for multiple words to be related and cooccurring in the same context. We extend the notion of co-occurrences to cover k(>= 2)-way co-occurrences among a set of k-words. Specifically, we prove a theoretical relationship between the joint probability of k(>= 2) words, and the sum of l(2) norms of their embeddings. Next, we propose a learning objective motivated by our theoretical result that utilises k-way Co-occurrences for learning word embeddings. Our experimental results show that the derived theoretical relationship does indeed hold empirically, and despite data sparsity, for some smaller k(<= 5) values, k-way embeddings perform comparably or better than 2-way embeddings in a range of tasks.
引用
收藏
页码:5037 / 5044
页数:8
相关论文
共 50 条
  • [21] Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences)
    Pfister, Tomas
    Charles, James
    Zisserman, Andrew
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [22] Mapping research topics using word-reference co-occurrences: A method and an exploratory case study
    Van Den Besselaar, Peter
    Heimeriks, Gaston
    SCIENTOMETRICS, 2006, 68 (03) : 377 - 393
  • [23] Mapping research topics using word-reference co-occurrences: A method and an exploratory case study
    Peter van den Besselaar
    Gaston Heimeriks
    Scientometrics, 2006, 68 : 377 - 393
  • [25] A new measure for query disambiguation using term co-occurrences
    Wakaki, Hiromi
    Masada, Tomonari
    Takasu, Atsuhiro
    Adachi, Jun
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 904 - 911
  • [26] Performance of information retrieval models using term co-occurrences
    Desjardins, G.
    Godin, R.
    Proulx, R.
    DATA MINING VIII: DATA, TEXT AND WEB MINING AND THEIR BUSINESS APPLICATIONS, 2007, 38 : 183 - +
  • [27] The Relation Dimension in the Identification and Classification of Lexically Restricted Word Co-Occurrences in Text Corpora
    Shvets, Alexander
    Wanner, Leo
    MATHEMATICS, 2022, 10 (20)
  • [28] Maintenance of self-consistency of coding tables by statistical analysis of word co-occurrences
    Surjan, G
    Heja, G
    MEDICAL INFORMATICS EUROPE '99, 1999, 68 : 887 - 890
  • [29] Song Clustering Using Peer-to-Peer Co-occurrences
    Shavitt, Yuval
    Weinsberg, Udi
    2009 11TH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2009), 2009, : 471 - 476
  • [30] Co-occurrences of EDCs and PPCPs in Surface Water Using Chemometrics
    Hagemann, Mark
    Park, Minji
    Srinivasan, Varun
    Reckhow, David A.
    Lavine, Michael
    Rosenfeldt, Erik
    Stanford, Benjamin D.
    Park, Mi-Hyun
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 2016, 108 (04): : E205 - E220