Corpus-based metonymy analysis

被引:20
|
作者
Markert, K [1 ]
Nissim, M [1 ]
机构
[1] Univ Edinburgh, Div Informat, Sch Informat, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
D O I
10.1207/S15327868MS1803_04
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this article, we make the case for corpus-based metonymy analysis and show that many interesting linguistic and statistical questions can only be answered by working with real texts. To facilitate such studies, we present a method for annotating metonymies in domain- and genre-independent text. We advocate an annotation scheme that builds on regularities in metonymic usage, that takes underspecification in metonymic reference into account, and that is organized hierarchically. We combine previous metonymy classification proposals with insights from a corpus study to present a fully worked-out annotation scheme for location names, illustrating the previously mentioned principles. We present several experiments measuring annotation agreement and show that the annotation scheme is reliable and has wide coverage. We also provide a gold standard for annotations of this kind consisting of 2,000 annotated occurrences of country names in the British National Corpus. We use the resulting corpus to study metonymy distributions and the factors that influence the choice of literal versus metonymic readings in real texts.
引用
收藏
页码:175 / 188
页数:14
相关论文
共 50 条