Using k-Way Co-Occurrences for Learning Word Embeddings

被引:0
|
作者
Bollegala, Danushka [1 ]
Yoshida, Yuichi [2 ]
Kawarabayashi, Ken-ichi [2 ,3 ]
机构
[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
[2] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[3] Japan Sci & Technol Agcy, ERATO, Kawarabayashi Large Graph Project, Kawaguchi, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Co-occurrences between two words provide useful insights into the semantics of those words. Consequently, numerous prior work on word embedding learning has used co-occurrences between two words as the training signal for learning word embeddings. Flowever, in natural language texts it is common for multiple words to be related and cooccurring in the same context. We extend the notion of co-occurrences to cover k(>= 2)-way co-occurrences among a set of k-words. Specifically, we prove a theoretical relationship between the joint probability of k(>= 2) words, and the sum of l(2) norms of their embeddings. Next, we propose a learning objective motivated by our theoretical result that utilises k-way Co-occurrences for learning word embeddings. Our experimental results show that the derived theoretical relationship does indeed hold empirically, and despite data sparsity, for some smaller k(<= 5) values, k-way embeddings perform comparably or better than 2-way embeddings in a range of tasks.
引用
收藏
页码:5037 / 5044
页数:8
相关论文
共 50 条
  • [31] Detecting Possibly Incorrect Medication Using an Unlikely Co-occurrences Measure
    Jacob, Ionut Emil
    Jimoh, Hameed
    Al Mamun, Abdullah
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE - ECAI 2017, 2017,
  • [32] An improved method for the identification of areas of endemism using species co-occurrences
    Giokas, Sinos
    Sfenthourakis, Spyros
    JOURNAL OF BIOGEOGRAPHY, 2008, 35 (05) : 893 - 902
  • [33] Clustering Word Co-Occurrences with Color Keywords based on Twitter Feeds in Japanese and German Culture
    Marutschke, Daniel Moritz
    Krysanova, Sasha
    Ogawa, Hitoshi
    2015 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE COMPUTING), 2015, : 191 - 192
  • [34] Using codispersion analysis to characterize spatial patterns in species co-occurrences
    Buckley, Hannah L.
    Case, Bradley S.
    Ellison, Aaron M.
    ECOLOGY, 2016, 97 (01) : 32 - 39
  • [35] Evaluation of Deep Species Distribution Models Using Environment and Co-occurrences
    Deneu, Benjamin
    Servajean, Maximilien
    Botella, Christophe
    Joly, Alexis
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION (CLEF 2019), 2019, 11696 : 213 - 225
  • [36] RELATIONSHIP BETWEEN CITATION INDEXING AND WORD INDEXING - STUDY OF CO-OCCURRENCES OF TITLE WORDS AND CITED REFERENCES
    SMALL, HG
    PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1973, 10 : 217 - 218
  • [37] Initializing the VA medication reference terminology using UMLS Metathesaurus co-occurrences
    Carter, JS
    Brown, SH
    Erlbaum, MS
    Gregg, W
    Elkin, PL
    Speroff, T
    Tuttle, MS
    AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 116 - 120
  • [38] Where Divergent Ideas Converge: Answers to AUT Found on Short List of Word Co-Occurrences Terms
    Klein, Ariel
    Badia, Toni
    CREATIVITY RESEARCH JOURNAL, 2024, 36 (01) : 138 - 154
  • [39] Bayesian Multi-label Learning with Sparse Features and Labels, and Label Co-occurrences
    Zhao, He
    Raiy, Piyush
    Du, Lan
    Buntine, Wray
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [40] Learning Hidden Markov Models from Pairwise Co-occurrences with Application to Topic Modeling
    Huang, Kejun
    Fu, Xiao
    Sidiropoulos, Nicholas D.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80