All That Glitters is Not Gold: A Gold Standard of Adjective-Noun Collocations for German

被引:0
|
作者
Strakatova, Yana [1 ]
Falk, Neele [1 ]
Fuhrmann, Isabel [2 ]
Hinrichs, Erhard [1 ]
Rossmann, Daniela [1 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Berlin Brandenburg Acad Sci, Berlin, Germany
关键词
MultiWord Expressions & Collocations; Semantics; Statistical and Machine Learning Methods; WORD;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we present the GerCo dataset of adjective-noun collocations for German, such as alter Freund `old friend' and tiefe Liebe `deep love'. The annotation has been performed by experts based on the annotation scheme introduced in this paper. The resulting dataset contains 4,732 positive and negative instances of collocations and covers all the 16 semantic classes of adjectives as defined in the German wordnet GermaNet. The dataset can serve as a reliable empirical basis for comparing different theoretical frameworks concerned with collocations or as material for data-driven approaches to the studies of collocations including different machine learning experiments. This paper addresses the latter issue by using the GerCo dataset for evaluating different models on the task of automatic collocation identification. We compare lexical association measures with static and contextualized word embeddings. The experiments showthatword embeddings outperformmethods based on statistical association measures by a wide margin.
引用
收藏
页码:4368 / 4378
页数:11
相关论文
共 50 条
  • [1] Semantic Modelling of Adjective-Noun Collocations Using FrameNet
    Strakatova, Yana
    Hinrichs, Erhard
    JOINT WORKSHOP ON MULTIWORD EXPRESSIONS AND WORDNET (MWE-WN 2019), 2019, : 104 - 113
  • [2] Learning collocations: Effects of online tools on teaching English adjective-noun collocations
    Basal, Ahmet
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2019, 50 (01) : 342 - 356
  • [3] Measured GFR as "Gold Standard"-All that Glitters Is Not Gold?
    Hsu, Chi-yuan
    Bansal, Nisha
    CLINICAL JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2011, 6 (08): : 1813 - 1814
  • [5] The Gold Standard of Pediatric Hemodynamic Monitoring: Not All That Glitters Is Gold
    Ricci, Zaccaria
    Iacobelli, Roberta
    Romagnoli, Stefano
    ANESTHESIA AND ANALGESIA, 2023, 136 (06): : E26 - E27
  • [6] Challenges and Strategies of Translating COVID-19 Adjective-Noun and Noun-Noun Collocations from English into Arabic
    Olimat, Sameer Naser
    Mahadin, Dana
    Al-Khawaldeh, Nisreen Naji
    Almahasees, Zakaryia
    EURASIAN JOURNAL OF APPLIED LINGUISTICS, 2022, 8 (03): : 120 - 133
  • [7] How word choice matters: An analysis of adjective-noun collocations in a corpus of learner essays
    Takac, Visnja Pavicic
    Lukac, Morana
    JEZIKOSLOVLJE, 2013, 14 (2-3): : 385 - 402
  • [8] ALL THAT GLITTERS IS NOT GOLD
    HEUSER, RR
    CATHETERIZATION AND CARDIOVASCULAR DIAGNOSIS, 1994, 33 (04): : 330 - 330
  • [9] All that glitters is gold
    Agisim, Gary
    Kenny, Richard
    Magee, Sara
    Patel, Bhalchandra
    JOURNAL OF COSMETIC SCIENCE, 2007, 58 (05) : 592 - 593
  • [10] All that glitters is not gold
    Schwedt, G
    CHEMIE IN UNSERER ZEIT, 2005, 39 (05) : 358 - 359