Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents

被引:5
|
作者
Ehrmann, Maud [1 ]
Romanello, Matteo [2 ]
Najem-Meyer, Sven [1 ]
Doucet, Antoine [3 ]
Clematide, Simon [4 ]
机构
[1] EPFL, Digital Humanities Lab, Vaud, Switzerland
[2] Univ Lausanne, Lausanne, Switzerland
[3] Univ La Rochelle, La Rochelle, France
[4] Univ Zurich, Dept Computat Linguist, Zurich, Switzerland
关键词
Named entity recognition and classification; Entity linking; Historical texts; Information extraction; Digitised newspapers; Digital humanities;
D O I
10.1007/978-3-031-13643-6_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places and other Entities), a shared task on named entity recognition and linking in multilingual historical documents. Following the success of the first CLEF-HIPE-2020 evaluation lab, HIPE-2022 confronts systems with the challenges of dealing with more languages, learning domain-specific entities, and adapting to diverse annotation tag sets. This shared task is part of the ongoing efforts of the natural language processing and digital humanities communities to adapt and develop appropriate technologies to efficiently retrieve and explore information from historical texts. On such material, however, named entity processing techniques face the challenges of domain heterogeneity, input noisiness, dynamics of language, and lack of resources. In this context, the main objective of HIPE-2022, run as an evaluation lab of the CLEF 2022 conference, is to gain new insights into the transferability of named entity processing approaches across languages, time periods, document types, and annotation tag sets. Tasks, corpora, and results of participating teams are presented.
引用
收藏
页码:423 / 446
页数:24
相关论文
共 50 条
  • [41] Firefly Algorithm Based Multilingual Named Entity Recognition for Indian Languages
    Biswas, Sitanath
    Dash, Sujata
    Acharya, Sweta
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 540 - 552
  • [42] Eaglet - a Named Entity Recognition and Entity Linking Gold Standard Checking Tool
    Jha, Kunal
    Roeder, Michael
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB: ESWC 2017 SATELLITE EVENTS, 2017, 10577 : 149 - 154
  • [43] Named Entity Recognition and Linking in Tweets Based on Linguistic Similarity
    Pipitone, Arianna
    Tirone, Giuseppe
    Pirrone, Roberto
    AI*IA 2017 ADVANCES IN ARTIFICIAL INTELLIGENCE, 2017, 10640 : 101 - 113
  • [44] Named Entity Recognition an Aid to Improve Multilingual Entity Filling In Language-Independent Approach
    Bhagavatula, Mahathi
    Santosh, G. S. K.
    Varma, Vasudeva
    PROCEEDINGS OF THE FIRST WORKSHOP ON INFORMATION AND KNOWLEDGE MANAGEMENT FOR DEVELOPING REGION, 2012, : 3 - 9
  • [45] Deep Learning for Named-Entity Linking with Transfer Learning for Legal Documents
    Elnaggar, Ahmed
    Otto, Robin
    Matthes, Florian
    PROCEEDINGS OF 2018 ARTIFICIAL INTELLIGENCE AND CLOUD COMPUTING CONFERENCE (AICCC 2018), 2018, : 23 - 28
  • [46] A Benchmark of Named Entity Recognition Approaches in Historical Documents Application to 19th Century French Directories
    Abadie, N.
    Carlinet, E.
    Chazalon, J.
    Dumenieu, B.
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 445 - 460
  • [47] Named entity recognition for Chinese judgment documents based on BiLSTM and CRF
    Wenming Huang
    Dengrui Hu
    Zhenrong Deng
    Jianyun Nie
    EURASIP Journal on Image and Video Processing, 2020
  • [48] Cross-Model Named Entity Recognition in Pictures for Procurement Documents
    Yang, Sai
    Liu, Xin
    Yu, Shaowen
    Computer Engineering and Applications, 2024, 60 (03) : 213 - 219
  • [49] ArRaNER: A novel named entity recognition model for biomedical literature documents
    R. Ramachandran
    K. Arutchelvan
    The Journal of Supercomputing, 2022, 78 : 16498 - 16511
  • [50] Named entity recognition for Chinese judgment documents based on BiLSTM and CRF
    Huang, Wenming
    Hu, Dengrui
    Deng, Zhenrong
    Nie, Jianyun
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)