A comparative evaluation and analysis of three generations of Distributional Semantic Models

被引:22
|
作者
Lenci, Alessandro [1 ]
Sahlgren, Magnus [2 ]
Jeuniaux, Patrick [3 ]
Gyllensten, Amaru Cuba [4 ]
Miliani, Martina [1 ,5 ]
机构
[1] Univ Pisa, Pisa, Italy
[2] AI Sweden, Stockholm, Sweden
[3] Inst Natl Criminalist & Criminol, Brussels, Belgium
[4] RISE, Stockholm, Sweden
[5] Univ Stranieri Siena, Siena, Italy
关键词
Distributional semantics; Evaluation; Contextual embeddings; Representational Similarity Analysis; WORD COOCCURRENCE STATISTICS; REPRESENTATIONS;
D O I
10.1007/s10579-021-09575-z
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Distributional semantics has deeply changed in the last decades. First, predict models stole the thunder from traditional count ones, and more recently both of them were replaced in many NLP applications by contextualized vectors produced by neural language models. Although an extensive body of research has been devoted to Distributional Semantic Model (DSM) evaluation, we still lack a thorough comparison with respect to tested models, semantic tasks, and benchmark datasets. Moreover, previous work has mostly focused on task-driven evaluation, instead of exploring the differences between the way models represent the lexical semantic space. In this paper, we perform a large-scale evaluation of type distributional vectors, either produced by static DSMs or obtained by averaging the contextualized vectors generated by BERT. First of all, we investigate the performance of embeddings in several semantic tasks, carrying out an in-depth statistical analysis to identify the major factors influencing the behavior of DSMs. The results show that (i) the alleged superiority of predict based models is more apparent than real, and surely not ubiquitous and (ii) static DSMs surpass BERT representations in most out-of-context semantic tasks and datasets. Furthermore, we borrow from cognitive neuroscience the methodology of Representational Similarity Analysis (RSA) to inspect the semantic spaces generated by distributional models. RSA reveals important differences related to the frequency and part-of-speech of lexical items.
引用
收藏
页码:1269 / 1313
页数:45
相关论文
共 50 条
  • [42] Conceptual Change and Distributional Semantic Models: an Exploratory Study on Pitfalls and Possibilities
    Sommerauer, Pia
    Fokkens, Antske
    [J]. 1ST INTERNATIONAL WORKSHOP ON COMPUTATIONAL APPROACHES TO HISTORICAL LANGUAGE CHANGE, 2019, : 223 - 233
  • [43] Comparing explicit and predictive distributional semantic models endowed with syntactic contexts
    Pablo Gamallo
    [J]. Language Resources and Evaluation, 2017, 51 : 727 - 743
  • [44] Can prediction-based distributional semantic models predict typicality?
    Heyman, Tom
    Heyman, Geert
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2019, 72 (08): : 2084 - 2109
  • [45] Evaluating Distributional Semantic Models with Russian Noun-Adjective Compositions
    Panicheva, Polina
    Protopopova, Ekaterina
    Bukia, Grigoriy
    Mitrofanova, Olga
    [J]. ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2016, 2017, 661 : 236 - 247
  • [46] Three generations in minimally extended standard models
    Frampton, Paul H.
    Ho, Chiu Man
    Kephart, Thomas W.
    [J]. PHYSICS LETTERS B, 2012, 715 (1-3) : 275 - 277
  • [47] COMPARATIVE EVALUATION OF SPATIAL CONTEXT TECHNIQUES FOR SEMANTIC IMAGE ANALYSIS
    Papadopoulos, G. Th.
    Saathoff, C.
    Grzegorzek, M.
    Mezaris, V.
    Kompatsiaris, I.
    Staab, S.
    Strintzis, M. G.
    [J]. 2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 161 - +
  • [48] Comparative analysis of three mathematical models of hydrogen ignition
    Bedarev, IA
    Fedorov, AV
    [J]. COMBUSTION EXPLOSION AND SHOCK WAVES, 2006, 42 (01) : 19 - 26
  • [49] Comparative Analysis of Three GARCH Models Based on MCMC
    Gao, Yan
    Zhang, Chengjun
    Zhang, Liyan
    [J]. INFORMATION COMPUTING AND APPLICATIONS, PT II, 2011, 244 : 494 - +
  • [50] Comparative Analysis of Three Mathematical Models of Hydrogen Ignition
    I. A. Bedarev
    A. V. Fedorov
    [J]. Combustion, Explosion and Shock Waves, 2006, 42 : 19 - 26