A divide-and-conquer strategy for shallow parsing of German

被引:0
|
作者
Neumann, G [1 ]
Braun, C [1 ]
Piskorski, J [1 ]
机构
[1] DFKI GmbH, D-66123 Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a divide-and-conquer strategy based on finite state technology for shallow parsing of realworld German texts. In a first phase only the topological structure of a sentence (i.e., verb groups, subclauses) are determined. In a second phase the phrasal grammars are applied to the contents of the different fields of the main and sub-clauses. Shallow parsing is supported by suitably configured preprocessing, including: morphological and on-line compound analysis, efficient POS-filtering, and named entity recognition. The whole approach proved to be very useful for processing of free word order languages like German. Especially for the divide-and-conquer parsing strategy, we obtained an f-measure of 87.14% on unseen data.
引用
收藏
页码:239 / 246
页数:8
相关论文
共 50 条