Corpus-based acquisition of collocational prepositional phrases

被引:0
|
作者
Bouma, G [1 ]
Villada, B [1 ]
机构
[1] RU Groningen, Alfa Informat, NL-9700 AS Groningen, Netherlands
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Collocational prepositional phrases like ten koste van (at the expense of), met het oog op (with an eye on), and onder het mom van (under the pretext of) are patterns of the form P-NP-P, which have a non-compositional semantics and which are syntactically rigid or idiosyncratic. We present a number of linguistic tests which set such items apart from regularly built prepositional phrases. To find candidate strings which should be included in a computational lexicon as collocational prepositional phrases, we extract all instances of the relevant pattern from a corpus annotated with POS tags, Next, we introduce a number of statistical tests (mutual information, log-likelihood, and chi(2)) to find those instances which behave like strong collocations. The strongest collocations according to the statistical tests are compared with lists of such items presented elsewhere, and were evaluated by human judges.
引用
收藏
页码:23 / 37
页数:15
相关论文
共 50 条