Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings

被引:0
|
作者
Ljubesic, Nikola [1 ]
Fiser, Darja [2 ]
Peti-Stanti, Anita [3 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova Cesta 39, SI-1000 Ljubljana, Slovenia
[2] Univ Ljubljana, Dept Translat, Fac Arts, Askerceva 2, SI-1000 Ljubljana, Slovenia
[3] Univ Zagreb, Fac Humanities & Social Sci, Ivana Lucica 3, HR-10000 Zagreb, Croatia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The notions of concreteness and imageability, traditionally important in psycholinguistics, are gaining significance in semantic-oriented natural language processing tasks. In this paper we investigate the predictability of these two concepts via supervised learning, using word embeddings as explanatory variables. We perform predictions both within and across languages by exploiting collections of cross-lingual embeddings aligned to a single vector space. We show that the notions of concreteness and imageability are highly predictable both within and across languages, with a moderate loss of up to 20% in correlation when predicting across languages. We further show that the crosslingual transfer via word embeddings is more efficient than the simple transfer via bilingual dictionaries.
引用
收藏
页码:217 / 222
页数:6
相关论文
共 13 条
  • [1] Concreteness, context availability, and imageability ratings and word associations for abstract, concrete, and emotion words
    Altarriba, J
    Bauer, LM
    Benvenuto, C
    [J]. BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1999, 31 (04): : 578 - 602
  • [2] Concreteness, context availability, and imageability ratings and word associations for abstract, concrete, and emotion words
    Jeanette Altarriba
    Lisa M. Bauer
    Claudia Benvenuto
    [J]. Behavior Research Methods, Instruments, & Computers, 1999, 31 : 578 - 602
  • [3] The Minho Word Pool: Norms for imageability, concreteness, and subjective frequency for 3,800 Portuguese words
    Soares, Ana Paula
    Costa, Ana Santos
    Machado, Joao
    Comesana, Montserrat
    Oliveira, Helena Mendes
    [J]. BEHAVIOR RESEARCH METHODS, 2017, 49 (03) : 1065 - 1081
  • [4] The Minho Word Pool: Norms for imageability, concreteness, and subjective frequency for 3,800 Portuguese words
    Ana Paula Soares
    Ana Santos Costa
    João Machado
    Montserrat Comesaña
    Helena Mendes Oliveira
    [J]. Behavior Research Methods, 2017, 49 : 1065 - 1081
  • [5] Number of meanings and concreteness: Consequences of ambiguity within and across languages
    Tokowicz, Natasha
    Kroll, Judith F.
    [J]. LANGUAGE AND COGNITIVE PROCESSES, 2007, 22 (05): : 727 - 779
  • [6] Improving Word Embeddings via Combining with Complementary Languages
    Li, Changliang
    Xu, Bo
    Wu, Gaowei
    Zhuang, Tao
    Wang, Xiuying
    Ge, Wendong
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 313 - 318
  • [7] Decoding the essence of two-character Chinese words: Unveiling valence, arousal, concreteness, familiarity, and imageability through word norming
    Chan, Yuen-Lai
    Tse, Chi-Shing
    [J]. BEHAVIOR RESEARCH METHODS, 2024, 56 (07) : 7574 - 7601
  • [8] Perceptual modality norms for 1,121 Italian words: A comparison with concreteness and imageability scores and an analysis of their impact in word processing tasks
    Alessandra Vergallito
    Marco Alessandro Petilli
    Marco Marelli
    [J]. Behavior Research Methods, 2020, 52 : 1599 - 1616
  • [9] Perceptual modality norms for 1,121 Italian words: A comparison with concreteness and imageability scores and an analysis of their impact in word processing tasks
    Vergallito, Alessandra
    Petilli, Marco Alessandro
    Marelli, Marco
    [J]. BEHAVIOR RESEARCH METHODS, 2020, 52 (04) : 1599 - 1616
  • [10] Comparable Corpora Within and Across Languages, Word Frequency Lists and the KELLY Project
    Kilgarriff, Adam
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1 - 5