Using k-Way Co-Occurrences for Learning Word Embeddings

被引:0
|
作者
Bollegala, Danushka [1 ]
Yoshida, Yuichi [2 ]
Kawarabayashi, Ken-ichi [2 ,3 ]
机构
[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
[2] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[3] Japan Sci & Technol Agcy, ERATO, Kawarabayashi Large Graph Project, Kawaguchi, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Co-occurrences between two words provide useful insights into the semantics of those words. Consequently, numerous prior work on word embedding learning has used co-occurrences between two words as the training signal for learning word embeddings. Flowever, in natural language texts it is common for multiple words to be related and cooccurring in the same context. We extend the notion of co-occurrences to cover k(>= 2)-way co-occurrences among a set of k-words. Specifically, we prove a theoretical relationship between the joint probability of k(>= 2) words, and the sum of l(2) norms of their embeddings. Next, we propose a learning objective motivated by our theoretical result that utilises k-way Co-occurrences for learning word embeddings. Our experimental results show that the derived theoretical relationship does indeed hold empirically, and despite data sparsity, for some smaller k(<= 5) values, k-way embeddings perform comparably or better than 2-way embeddings in a range of tasks.
引用
收藏
页码:5037 / 5044
页数:8
相关论文
共 50 条
  • [1] ViCo: Word Embeddings from Visual Co-occurrences
    Gupta, Tanmay
    Schwing, Alexander
    Hoiem, Derek
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7424 - 7433
  • [2] Word discrimination based on bigram co-occurrences
    El-Nasan, A
    Veeramachaneni, S
    Nagy, G
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 149 - 153
  • [3] Word co-occurrences as a principle of an algorithm for extraction of terminology
    Zunker, G
    Rapp, R
    COGNITIVE ASPECTS OF LANGUAGE, 1996, 360 : 293 - 298
  • [4] Visualizing Textbook Concepts: Beyond Word Co-occurrences
    Sastry, Chandramouli Shama
    Jagaluru, Darshan Siddesh
    Mahesh, Kavi
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 363 - 376
  • [5] Retrieving collocations by co-occurrences and word order constraints
    Shimohata, S
    Sugio, T
    Nagata, J
    35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 476 - 481
  • [6] Extended Strategies for Document Clustering with Word Co-occurrences
    Wei, Yang
    Wei, Jinmao
    Yang, Zhenglu
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 461 - 472
  • [7] Using Explicit Word Co-occurrences to Improve Term-Based Text Retrieval
    Ferilli, Stefano
    Biba, Marenglen
    Basile, Teresa M. A.
    Esposito, Floriana
    DIGITAL LIBRARIES, 2010, 91 : 125 - +
  • [8] Mapping research topics through word-reference co-occurrences
    Heimeriks, G
    Van den Besselaar, P
    ISSI 2005: Proceedings of the 10th International Conference of the International Society for Scientometrics and Informetrics, Vols 1 and 2, 2005, : 575 - 584
  • [9] Studying the Temporal Dynamics of Word Co-Occurrences: An Application to Event Detection
    Preotiuc-Pietro, Daniel
    Srijith, P. K.
    Hepple, Mark
    Cohn, Trevor
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4380 - 4387
  • [10] Video Classification using Semantic Concept Co-occurrences
    Assari, Shayan Modiri
    Zamir, Amir Roshan
    Shah, Mubarak
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2529 - 2536