Annotating Entities with Fine-Grained Types in Austrian Court Decisions

被引:1
|
作者
Revenko, Artem [1 ]
Breit, Anna [1 ]
Mireles, Victor [1 ]
Moreno-Schneider, Julian [2 ]
Sageder, Christian [3 ]
Karampatakisi, Sotirios [1 ]
机构
[1] Semant Web Co GmbH, Vienna, Austria
[2] DFKI GmbH, Kaiserslautern, Germany
[3] Cybly GmbH, Salzburg, Austria
来源
关键词
Named Entity Recognition; Entity Typing; Legal Corpus; Natural Language Processing;
D O I
10.3233/SSW210041
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The usage of Named Entity Recognition tools on domain-specific corpora is often hampered by insufficient training data. We investigate an approach to produce fine-grained named entity annotations of a large corpus of Austrian court decisions from a small manually annotated training data set. We apply a general purpose Named Entity Recognition model to produce annotations of common coarse-grained types. Next, a small sample of these annotations are manually inspected by domain experts to produce an initial fine-grained training data set. To efficiently use the small manually annotated data set we formulate the task of named entity typing as a binary classification task - for each originally annotated occurrence of an entity, and for each fine-grained type we verify if the entity belongs to it. For this purpose we train a transformer-based classifier. We randomly sample 547 predictions and evaluate them manually. The incorrect predictions are used to improve the performance of the classifier - the corrected annotations are added to the training set. The experiments show that re-training with even a very small number (5 or 10) of originally incorrect predictions can significantly improve the classifier performance. We finally train the classifier on all available data and re-annotate the whole data set.
引用
收藏
页码:139 / 153
页数:15
相关论文
共 50 条
  • [31] Fine-Grained Crowd Counting
    Wan, Jia
    Kumar, Nikil Senthil
    Chan, Antoni B.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2114 - 2126
  • [32] FINE-GRAINED SEDIMENTS - TERMINOLOGY
    STOW, DAV
    [J]. QUARTERLY JOURNAL OF ENGINEERING GEOLOGY, 1981, 14 (04): : 243 - 244
  • [33] Fine-Grained Visual Entailment
    Thomas, Christopher
    Zhang, Yipeng
    Chang, Shih-Fu
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 398 - 416
  • [34] Fine-grained redundancy in adders
    Ndai, Patrick
    Lu, Shih-Lien
    Somesekhar, Dinesh
    Roy, Kaushik
    [J]. ISQED 2007: PROCEEDINGS OF THE EIGHTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, 2007, : 317 - +
  • [35] Fine-Grained Secure Computation
    Campanelli, Matteo
    Gennaro, Rosario
    [J]. THEORY OF CRYPTOGRAPHY, TCC 2018, PT II, 2018, 11240 : 66 - 97
  • [36] Fine-Grained Entity Linking
    Rosales-Mendez, Henry
    Hogan, Aidan
    Poblete, Barbara
    [J]. JOURNAL OF WEB SEMANTICS, 2020, 65
  • [37] Fine-grained LFW Database
    Zhang, Nanhai
    Deng, Weihong
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2016,
  • [38] SELECTION IN A FINE-GRAINED ENVIRONMENT
    STROBECK, C
    [J]. AMERICAN NATURALIST, 1975, 109 (968): : 419 - 425
  • [39] Fine-Grained Cryptography Revisited
    Egashira, Shohei
    Wang, Yuyu
    Tanaka, Keisuke
    [J]. JOURNAL OF CRYPTOLOGY, 2021, 34 (03)
  • [40] On Fine-Grained Relevance Scales
    Roitero, Kevin
    Maddalena, Eddy
    Demartini, Gianluca
    Mizzaro, Stefano
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 675 - 684