The principal components of meaning, revisited

被引:1
|
作者
Westbury, Chris [1 ]
Yang, Michelle [2 ]
Anderson, Kris [1 ]
机构
[1] Univ Alberta, Dept Psychol, P220 Biol Sci Bldg, Edmonton, AB T6G 2E9, Canada
[2] McGill Univ, Dept Psychol, 2001 McGill Coll Ave, Montreal, PQ H3A 1G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Semantics; Word meaning; Word-embedding models; Word2vec; Lexical co-occurrence; Principal components analysis; COOCCURRENCE MODELS; ENGLISH;
D O I
10.3758/s13423-024-02551-y
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Osgood, Suci, and Tannebaum were the first to attempt to identify the principal components of semantics using dimensional reduction of a high-dimensional model of semantics constructed from human judgments of word relatedness. Modern word-embedding models analyze patterns of words to construct higher dimensional models of semantics that can be similarly subjected to dimensional reduction. Hollis and Westbury characterized the first eight principal components (PCs) of a word-embedding model by correlating them with several well-known lexical measures, such as logged word frequency, age of acquisition, valence, arousal, dominance, and concreteness. The results show some clear differentiation of interpretation between the PCs. Here, we extend this work by analyzing a larger word-embedding matrix using semantic measures initially derived from subjective inspection of the PCs. We then use quantitative analysis to confirm the utility of these subjective measures for predicting PC values and cross-validate them on two word-embedding matrices developed on distinct corpora. Several semantic and word class measures are strongly predictive of early PC values, including first-person and second-person verbs, personal relevance of abstract and concrete words, affect terms, and names of places and people. The predictors of the lowest magnitude PCs generalized well to word-embedding matrices constructed from separate corpora, including matrices constructed using different word-embedding methods. The predictive categories we describe are consistent with Wittgenstein's argument that an autonomous level of social interaction grounds linguistic meaning.
引用
收藏
页码:203 / 225
页数:23
相关论文
共 50 条
  • [21] Stoic terminology of meaning revisited
    Ildefonse, Frederique
    METHODOS-SAVOIRS ET TEXTES, 2019, (19):
  • [22] Confirmation and Meaning Holism Revisited
    Fuller, Timothy
    ERKENNTNIS, 2020, 85 (06) : 1379 - 1397
  • [23] Principal components analysis
    Garson, GD
    SOCIAL SCIENCE COMPUTER REVIEW, 1999, 17 (01) : 129 - 131
  • [24] CONSENSUS PRINCIPAL COMPONENTS
    LEFKOVITCH, LP
    BIOMETRICAL JOURNAL, 1993, 35 (05) : 567 - 580
  • [25] Variational principal components
    Bishop, CM
    NINTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS (ICANN99), VOLS 1 AND 2, 1999, (470): : 509 - 514
  • [26] Principal Components of Touch
    Aquilina, Kirsty
    Barton, David A. W.
    Lepora, Nathan F.
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 4071 - 4078
  • [27] PRINCIPAL COMPONENTS IN ECONOMETRICS
    OKSANEN, EH
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1988, 17 (08) : 2507 - 2532
  • [28] NONLINEAR PRINCIPAL COMPONENTS
    YOHAI, VJ
    ACKERMANN, W
    HAIGH, C
    QUALITY & QUANTITY, 1985, 19 (01) : 53 - 69
  • [29] Principal Dynamical Components
    de la Iglesia, Manuel D.
    Tabak, Esteban G.
    COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2013, 66 (01) : 48 - 82
  • [30] ROTATION OF PRINCIPAL COMPONENTS
    RICHMAN, MB
    JOURNAL OF CLIMATOLOGY, 1986, 6 (03): : 293 - 335