Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents

被引:5
|
作者
Ehrmann, Maud [1 ]
Romanello, Matteo [2 ]
Najem-Meyer, Sven [1 ]
Doucet, Antoine [3 ]
Clematide, Simon [4 ]
机构
[1] EPFL, Digital Humanities Lab, Vaud, Switzerland
[2] Univ Lausanne, Lausanne, Switzerland
[3] Univ La Rochelle, La Rochelle, France
[4] Univ Zurich, Dept Computat Linguist, Zurich, Switzerland
关键词
Named entity recognition and classification; Entity linking; Historical texts; Information extraction; Digitised newspapers; Digital humanities;
D O I
10.1007/978-3-031-13643-6_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places and other Entities), a shared task on named entity recognition and linking in multilingual historical documents. Following the success of the first CLEF-HIPE-2020 evaluation lab, HIPE-2022 confronts systems with the challenges of dealing with more languages, learning domain-specific entities, and adapting to diverse annotation tag sets. This shared task is part of the ongoing efforts of the natural language processing and digital humanities communities to adapt and develop appropriate technologies to efficiently retrieve and explore information from historical texts. On such material, however, named entity processing techniques face the challenges of domain heterogeneity, input noisiness, dynamics of language, and lack of resources. In this context, the main objective of HIPE-2022, run as an evaluation lab of the CLEF 2022 conference, is to gain new insights into the transferability of named entity processing approaches across languages, time periods, document types, and annotation tag sets. Tasks, corpora, and results of participating teams are presented.
引用
收藏
页码:423 / 446
页数:24
相关论文
共 50 条
  • [21] Comparison of named entity recognition methodologies in biomedical documents
    Hye-Jeong Song
    Byeong-Cheol Jo
    Chan-Young Park
    Jong-Dae Kim
    Yu-Seop Kim
    BioMedical Engineering OnLine, 17
  • [22] Named Entity Recognition for Digitised Historical Texts
    Grover, Claire
    Givon, Sharon
    Tobin, Richard
    Ball, Julian
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1343 - 1346
  • [23] A Comprehensive Study of Open-Source Libraries for Named Entity Recognition on Handwritten Historical Documents
    Monroc, Claire Bizon
    Miret, Blanche
    Bonhomme, Marie-Laurence
    Kermorvant, Christopher
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 429 - 444
  • [24] Tuning Multilingual Transformers for Named Entity Recognition on Slavic Languages
    Arkhipov, Mikhail
    Trofimova, Maria
    Kuratov, Yuri
    Sorokin, Alexey
    7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, : 89 - 93
  • [25] On the Strength of Character Language Models for Multilingual Named Entity Recognition
    Yu, Xiaodong
    Mayhew, Stephen
    Sammons, Mark
    Roth, Dan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3073 - 3077
  • [26] Named Entity Recognition for Entity Linking: WhatWorks and What's Next
    Tedeschi, Simone
    Conia, Simone
    Cecconi, Francesco
    Navigli, Roberto
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2584 - 2596
  • [27] GERBIL - Benchmarking Named Entity Recognition and Linking consistently
    Roeder, Michael
    Usbeck, Ricardo
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB, 2018, 9 (05) : 605 - 625
  • [28] Named Entity Recognition, Linking and Generation for Greek Legislation
    Angelidis, Iosif
    Chalkidis, Ilias
    Koubarakis, Manolis
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2018), 2018, 313 : 1 - 10
  • [29] An Approach to Named Entity Extraction from Mongolian Historical Documents
    Batjargal, Biligsaikhan
    Khaltarkhuu, Garmaabazar
    Maeda, Akira
    2015 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE COMPUTING), 2015, : 205 - 206
  • [30] Overview of NLPCC2022 Shared Task 5 Track 2: Named Entity Recognition
    Cai, Borui
    Zhang, He
    Liu, Fenghong
    Liu, Ming
    Zong, Tianrui
    Chen, Zhe
    Li, Yunfeng
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 336 - 341