Investigating the Extent to which Distributional Semantic Models Capture a Broad Range of Semantic Relations

被引:3
|
作者
Brown, Kevin S. [1 ,2 ,8 ,9 ]
Yee, Eiling [3 ]
Joergensen, Gitte [3 ]
Troyer, Melissa [4 ]
Saltzman, Elliot [5 ]
Rueckl, Jay [3 ]
Magnuson, James S. [3 ,6 ,7 ]
McRae, Ken [4 ]
机构
[1] Oregon State Univ, Dept Pharmaceut Sci, Corvallis, OR USA
[2] Oregon State Univ, Sch Chem Biol & Environm Engn, Corvallis, OR USA
[3] Univ Connecticut, Dept Psychol Sci, Mansfield, CT USA
[4] Univ Western Ontario, Dept Psychol, London, ON, Canada
[5] Boston Univ, Dept Phys Therapy, Boston, MA USA
[6] Basque Ctr Cognit Brain & Language, BCBL, San Sebastian, Spain
[7] Basque Fdn Sci, Ikerbasque, Bilbao, Spain
[8] Oregon State Univ, Dept Pharmaceut Sci, Pharm Bldg 317, 1601 SW Jefferson Ave, Corvallis, OR 97331 USA
[9] Oregon State Univ, Sch Chem Biol & Environm Engn, Pharm Bldg 317, 1601 SW Jefferson Ave, Corvallis, OR 97331 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Distributional semantic models; Semantic relations; Thematic fit; Event-based relations; Function; shape; and color relations; FEATURE PRODUCTION NORMS; LARGE SET; MEMORY; REPRESENTATION; ACTIVATION; CONTEXT; VERBS;
D O I
10.1111/cogs.13291
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Distributional semantic models (DSMs) are a primary method for distilling semantic information from corpora. However, a key question remains: What types of semantic relations among words do DSMs detect? Prior work typically has addressed this question using limited human data that are restricted to semantic similarity and/or general semantic relatedness. We tested eight DSMs that are popular in current cognitive and psycholinguistic research (positive pointwise mutual information; global vectors; and three variations each of Skip-gram and continuous bag of words (CBOW) using word, context, and mean embeddings) on a theoretically motivated, rich set of semantic relations involving words from multiple syntactic classes and spanning the abstract-concrete continuum (19 sets of ratings). We found that, overall, the DSMs are best at capturing overall semantic similarity and also can capture verb-noun thematic role relations and noun-noun event-based relations that play important roles in sentence comprehension. Interestingly, Skip-gram and CBOW performed the best in terms of capturing similarity, whereas GloVe dominated the thematic role and event-based relations. We discuss the theoretical and practical implications of our results, make recommendations for users of these models, and demonstrate significant differences in model performance on event-based relations.
引用
收藏
页数:42
相关论文
共 50 条
  • [1] Discovering Distributional Thesauri Semantic Relations
    Stoykova, Velislava
    [J]. KDWEB 2017: KNOWLEDGE DISCOVERY ON THE WEB, 2017, 1959
  • [2] Evaluation of Distributional Semantic Models for the Extraction of Semantic Relations for Named Rivers from a Small Specialized Corpus
    Rojas Garcia, Juan
    Faber, Pamela
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 51 - 58
  • [3] How Well Do Distributional Models Capture Different Types of Semantic Knowledge?
    Rubinstein, Dana
    Levi, Effi
    Schwartz, Roy
    Rappoport, Ari
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 726 - 730
  • [4] Choosing which to use? A study of distributional models for nominal lexical semantic classification
    Romeo, Lauren
    Lebani, Gianluca E.
    Bel, Nuria
    Lenci, Alessandro
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4366 - 4373
  • [5] Estimating the average need of semantic knowledge from distributional semantic models
    Hollis, Geoff
    [J]. MEMORY & COGNITION, 2017, 45 (08) : 1350 - 1370
  • [6] Estimating the average need of semantic knowledge from distributional semantic models
    Geoff Hollis
    [J]. Memory & Cognition, 2017, 45 : 1350 - 1370
  • [7] Distributional Semantic Models for Affective Text Analysis
    Malandrakis, Nikolaos
    Potamianos, Alexandros
    Iosif, Elias
    Narayanan, Shrikanth
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (11): : 2379 - 2392
  • [8] Regularized Training of Compositional Distributional Semantic Models
    Yang, Xuefeng
    Mao, Kezhi
    Zhao, Rui
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
  • [9] A Complex Network Approach to Distributional Semantic Models
    Utsumi, Akira
    [J]. PLOS ONE, 2015, 10 (08):
  • [10] Exploring terminological relations between multi-word terms in distributional semantic models
    Wang, Yizhe
    Daille, Beatrice
    Hathout, Nabil
    [J]. TERMINOLOGY, 2023,