Building Static Embeddings from Contextual Ones: Is It Useful for Building Distributional Thesauri?

被引:0
|
作者
Ferret, Olivier [1 ]
机构
[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France
关键词
static word embeddings; contextual word embeddings; semantic similarity; distributional thesauri;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
While contextual language models are now dominant in the field of Natural Language Processing, the representations they build at the token level are not always suitable for all uses. In this article, we propose a new method for building word or type-level embeddings from contextual models. This method combines the generalization and the aggregation of token representations. We evaluate it for a large set of English nouns from the perspective of the building of distributional thesauri for extracting semantic similarity relations. Moreover, we analyze the differences between static embeddings and type-level embeddings according to features such as the frequency of words or the type of semantic relations these embeddings account for, showing that the properties of these two types of embeddings can be complementary and exploited for further improving distributional thesauri.
引用
收藏
页码:2583 / 2590
页数:8
相关论文
共 50 条
  • [1] Network embeddings from distributional thesauri for improving static word representations
    Jana, Abhik
    Haldar, Siddhant
    Goyal, Pawan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [2] DAFOE: AN ONTOLOGY BUILDING PLATFORM From Texts or Thesauri
    Szulman, Sylvie
    Charlet, Jean
    Aussenac-Gilles, Nathalie
    Nazarenko, Adeline
    Sardet, Eric
    Teguiak, Valery
    [J]. KEOD 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND ONTOLOGY DEVELOPMENT, 2009, : 372 - +
  • [3] Building systems from simple hyperbolic ones
    Zwart, H.
    Le Gorrec, Y.
    Maschke, B.
    [J]. SYSTEMS & CONTROL LETTERS, 2016, 91 : 1 - 6
  • [4] Building detection from urban SAR image using building characteristics and contextual information
    Zhao, Lingjun
    Zhou, Xiaoguang
    Kuang, Gangyao
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [5] Building detection from urban SAR image using building characteristics and contextual information
    Lingjun Zhao
    Xiaoguang Zhou
    Gangyao Kuang
    [J]. EURASIP Journal on Advances in Signal Processing, 2013
  • [6] Building semantic memory from embodied and distributional language experience
    Davis, Charles P.
    Yee, Eiling
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2021, 12 (05)
  • [7] Building Location Embeddings from Physical Trajectories and Textual Representations
    Biester, Laura
    Banea, Carmen
    Mihalcea, Rada
    [J]. 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 425 - 434
  • [8] Synthesis of 2H-pyran-2-ones and fused pyran-2-ones as useful building blocks
    Pozgan, Franc
    Kranjc, Kristof
    Kepe, Vladimir
    Polanc, Slovenko
    Kocevar, Marijan
    [J]. ARKIVOC, 2007, : 97 - 111
  • [9] Lessons from Building Static Analysis Tools at Google
    Sadowski, Caitlin
    Aftandilian, Edward
    Eagle, Alex
    Miller-Cushon, Liam
    Jaspan, Ciera
    [J]. COMMUNICATIONS OF THE ACM, 2018, 61 (04) : 58 - 66
  • [10] EXTRACT USEFUL INFORMATION FROM BUILDING PERMITS DATA TO PROFILE A CITY'S BUILDING RETROFIT HISTORY
    Zhang, Wanni
    Hong, Tianzhen
    Luo, Xuan
    [J]. 2020 ASHRAE BUILDING PERFORMANCE ANALYSIS CONFERENCE AND SIMBUILD, 2020, : 674 - 680