Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation

被引:0
|
作者
Miaschi, Alessio [1 ,2 ]
Dell'Orletta, Felice [2 ]
机构
[1] Univ Pisa, Dept Comp Sci, Pisa, Italy
[2] Ist Linguist Computaz Antonio Zampolli, ItaliaNLP Lab, Pisa, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a comparison between the linguistic knowledge encoded in the internal representations of a contextual Language Model (BERT) and a contextual-independent one (Word2vec). We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that, although BERT is capable of understanding the full context of each word in an input sequence, the implicit knowledge encoded in its aggregated sentence representations is still comparable to that of a contextual-independent model. We also find that BERT is able to encode sentence-level properties even within single-word embeddings, obtaining comparable or even superior results than those obtained with sentence representations.
引用
下载
收藏
页码:110 / 119
页数:10
相关论文
共 50 条
  • [41] Development of a Non-Contextual Model for Determining the Autonomy Level of Intelligent Unmanned Systems
    Durst, Phillip J.
    Gray, Wendell
    Trentini, Michael
    UNMANNED SYSTEMS TECHNOLOGY XV, 2013, 8741
  • [42] Economic assessment of precautionary measures against floods: insights from a non-contextual approach
    Richert, Claire
    Boisgontier, Helene
    Grelot, Frederic
    NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2019, 19 (11) : 2525 - 2539
  • [43] Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics
    Ampel, Benjamin
    Chen, Hsinchun
    2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2021, : 106 - 108
  • [44] Large Scale Intent Detection in Turkish Short Sentences with Contextual Word Embeddings
    Dundar, Enes Burak
    Kilic, Osman Fatih
    Cekic, Tolga
    Manav, Yusufcan
    Deniz, Onur
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1, 2020, : 187 - 192
  • [45] The Role of Contextual Word Embeddings in Correcting the 'de/da' Clitic Errors in Turkish
    Ozturk, Hasan
    Degirmenci, Alperen
    Gungor, Onur
    Uskudarli, Suzan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [46] Exploiting Position and Contextual Word Embeddings for Keyphrase Extraction from Scientific Papers
    Patel, Krutarth
    Caragea, Cornelia
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1585 - 1591
  • [47] Bypassing the Kochen-Specker Theorem: An Explicit Non-Contextual Statistical Model for the Qutrit
    Oaknin, David H.
    AXIOMS, 2023, 12 (01)
  • [48] Sleeping Contextual/Non-Contextual Thompson Sampling MAB for mmWave D2D Two-Hop Relay Probing
    Mohamed, Ehab Mahmoud
    Hashima, Sherief
    Hatano, Kohei
    Fouda, Mostafa M.
    Fadlullah, Zubair Md
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (11) : 12101 - 12112
  • [49] Constructional preemption by contextual mismatch: A corpus-linguistic investigation
    Stefanowitsch, Anatol
    COGNITIVE LINGUISTICS, 2011, 22 (01) : 107 - 129
  • [50] Using Individual Accuracy to Create Context for Non-Contextual Multi-Armed Bandit Problems
    Gutowski, Nicolas
    Camp, Olivier
    Amghar, Tassadit
    Chhel, Fabien
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 24 - 29