A Dataset for Analysing Complex Document Layouts in the Digital Humanities and Its Evaluation with Krippendorff's Alpha

被引:2
|
作者
Tschirschwitz, David [1 ]
Klemstein, Franziska [1 ]
Stein, Benno [1 ]
Rodehorst, Volker [1 ]
机构
[1] Bauhaus Univ, Weimar, Germany
来源
关键词
Document layout analysis; Digital humanities; Instance segmentation; Inter-annotator-agreement;
D O I
10.1007/978-3-031-16788-1_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new research resource in the form of a high-quality, domain-specific dataset for analysing the document layout of historical documents. The dataset provides an instance segmentation ground truth with 19 classes based on historical layout structures that stem (a) from the publication production process and the respective genres (life sciences, architecture, art, decorative arts, etc.) and, (b) from selected text registers (such as monograph, trade journal, illustrated magazine). Altogether, the dataset contains more than 52,000 instances annotated by experts. A baseline has been tested with the well-known Mask R-CNN and compared to the state-of-the-art model VSR [55]. Inspired by evaluation practices from the field of Natural Language Processing (NLP), we have developed a new method for evaluating annotation consistency. Our method is based on Krippendorff's alpha (K-alpha), a statistic for quantifying the so-called "inter-annotator-agreement". In particular, we propose an adaptation of K-alpha that treats annotations as a multipartite graph for assessing the agreement of a variable number of annotators. The method is adjustable with regard to evaluation strictness, and it can be used in 2D or 3D as well as for a variety of tasks such as semantic segmentation, instance segmentation, and 3D point cloud segmentation.
引用
收藏
页码:354 / 374
页数:21
相关论文
共 23 条