Are End-to-End Systems Really Necessary for NER on Handwritten Document Images?

被引:4
|
作者
Tueselmann, Oliver [1 ]
Wolf, Fabian [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, D-44227 Dortmund, Germany
关键词
Named entity recognition; Document image analysis; Information retrieval; Handwritten documents; RECOGNITION;
D O I
10.1007/978-3-030-86331-9_52
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entities (NEs) are fundamental in the extraction of information from text. The recognition and classification of these entities into predefined categories is called Named Entity Recognition (NER) and plays a major role in Natural Language Processing. However, only a few works consider this task with respect to the document image domain. The approaches are either based on a two-stage or end-to-end architecture. A two-stage approach transforms the document image into a textual representation and determines the NEs using a textual NER. The end-to-end approach, on the other hand, avoids the explicit recognition step at text level and determines the NEs directly on image level. Current approaches that try to tackle the task of NER on segmented word images use end-to-end architectures. This is motivated by the assumption that handwriting recognition is too erroneous to allow for an effective application of textual NLP methods. In this work, we present a two-stage approach and compare it against state-of-the-art end-to-end approaches. Due to the lack of datasets and evaluation protocols, such a comparison is currently difficult. Therefore, we manually annotated the known IAM and George Washington datasets with NE labels and publish them along with optimized splits and an evaluation protocol. Our experiments show, contrary to the common belief, that a two-stage model can achieve higher scores on all tested datasets.
引用
收藏
页码:808 / 822
页数:15
相关论文
共 50 条
  • [1] End-to-end Piece-wise Unwarping of Document Images
    Das, Sagnik
    Singh, Kunwar Yashraj
    Wu, Jon
    Bas, Erhan
    Mahadevan, Vijay
    Bhotika, Rahul
    Samaras, Dimitris
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4248 - 4257
  • [2] End-to-end system for printed Amazigh script recognition in document images
    Aharrane, Nabil
    Dahmouni, Abdellatif
    Ensah, Karim El Moutaouakil
    Satori, Khalid
    2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 313 - 318
  • [3] An end-to-end framework for the detection of mathematical expressions in scientific document images
    Phong, Bui Hai
    Hoang, Thang Manh
    Le, Thi-Lan
    EXPERT SYSTEMS, 2022, 39 (01)
  • [4] Applying End-to-End Trainable Approach on Stroke Extraction in Handwritten Math Expressions Images
    Moussa, Elmokhtar Mohamed
    Lelore, Thibault
    Mouchere, Harold
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 445 - 458
  • [5] End-To-End Deep-Learning-Based Tamil Handwritten Document Recognition and Classification Model
    Vinotheni, C.
    Pandian, S. Lakshmana
    IEEE ACCESS, 2023, 11 : 43195 - 43204
  • [6] An end-to-end administrative document analysis system
    Hamza, Hatem
    Belaid, Yolande
    Belaid, Abdel
    Chaudhuri, Bidyut B.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 175 - 182
  • [7] DocFormer: End-to-End Transformer for Document Understanding
    Appalaraju, Srikar
    Jasani, Bhavan
    Kota, Bhargava Urala
    Xie, Yusheng
    Manmatha, R.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 973 - 983
  • [8] External Knowledge Acquisition for End-to-End Document-Oriented Dialog Systems
    Lai, Tuan M.
    Castellucci, Giuseppe
    Kuzi, Saar
    Ji, Heng
    Rokhlenko, Oleg
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3633 - 3647
  • [9] End-to-End Optical Character Recognition for Bengali Handwritten Words
    Safir, Farisa Benta
    Ohi, Abu Quwsar
    Mridha, M. F.
    Monowar, Muhammad Mostafa
    Hamid, Md Abdul
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1067 - +
  • [10] End-to-End Handwritten Text Detection and Transcription in Full Pages
    Carbonell, Manuel
    Mas, Joan
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 29 - 34