A Comparative Study of Information Extraction Strategies Using an Attention-Based Neural Network

被引:3
|
作者
Tarride, Solene [1 ,2 ]
Lemaitre, Aurelie [1 ]
Couasnon, Bertrand [1 ]
Tardivel, Sophie [2 ]
机构
[1] Univ Rennes, CNRS, IRISA, Rennes, France
[2] Doptim, Cesson Sevigne, France
来源
关键词
Document image analysis; Historical documents; Information extraction; Handwriting recognition; Named entity recognition;
D O I
10.1007/978-3-031-06555-2_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article focuses on information extraction in historical handwritten marriage records. Traditional approaches rely on a sequential pipeline of two consecutive tasks: handwriting recognition is applied before named entity recognition. More recently, joint approaches that handle both tasks at the same time have been investigated, yielding state-of-the-art results. However, as these approaches have been used in different experimental conditions, they have not been fairly compared yet. In this work, we conduct a comparative study of sequential and joint approaches based on the same attention-based architecture, in order to quantify the gain that can be attributed to the joint learning strategy. We also investigate three new joint learning configurations based on multi-task or multi-scale learning. Our study shows that relying on a joint learning strategy can lead to an 8% increase of the complete recognition score. We also highlight the interest of multi-task learning and demonstrate the benefit of attention-based networks for information extraction. Our work achieves state-of-the-art performance in the ICDAR 2017 Information Extraction competition on the Esposalles database at line-level, without any language modelling or post-processing.
引用
收藏
页码:644 / 658
页数:15
相关论文
共 50 条
  • [1] ATTENTION-BASED NEURAL NETWORK FOR JOINT DIARIZATION AND SPEAKER EXTRACTION
    Chazan, Shlomo E.
    Gannot, Sharon
    Goldberger, Jacob
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 301 - 305
  • [2] Enhanced streamflow forecasting using attention-based neural network models: a comparative study in MOPEX basins
    Muhammad, Abdullahi Uwaisu
    Muazu, Tasiu
    Ying, Haihua
    Ba, Abdoul Fatakhou
    Tijjani, Sani
    Adam, Jibril Muhammad
    Bello, Aliyu Uthman
    Bala, Muhammad Muhammad
    Ali, Mosaad Hussein
    Dabai, Umar Sani
    Yahaya, Muhammad Sabo
    [J]. MODELING EARTH SYSTEMS AND ENVIRONMENT, 2024, 10 (04) : 5717 - 5734
  • [3] Contour extraction of medical images using an attention-based network
    Lv, Ju Jian
    Chen, Hao Yuan
    Li, Jia Wen
    Lin, Kai Han
    Chen, Rong Jun
    Wang, Lei Jun
    Zeng, Xian Xian
    Ren, Jin Chang
    Zhao, Hui Min
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84
  • [4] Wind Power Forecasting Using Attention-Based Recurrent Neural Networks: A Comparative Study
    Huang, Bin
    Liang, Yuying
    Qiu, Xiaolin
    [J]. IEEE ACCESS, 2021, 9 : 40432 - 40444
  • [5] Structured Information Extraction of Pathology Reports with Attention-based Graph Convolutional Network
    Wu, Jialun
    Tang, Kaiwen
    Zhang, Haichuan
    Wang, Chunbao
    Li, Chen
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2395 - 2402
  • [6] Topical-Relevance Detection Using Attention-Based Neural Network
    Li, Xia
    Yang, Zhanyuan
    Chen, Minping
    Feng, Wenhe
    [J]. 2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 373 - 377
  • [7] Mineral prospectivity mapping using attention-based convolutional neural network
    Li, Quanke
    Chen, Guoxiong
    Luo, Lei
    [J]. ORE GEOLOGY REVIEWS, 2023, 156
  • [8] Attention-based sentiment analysis using convolutional and recurrent neural network
    Usama, Mohd
    Ahmad, Belal
    Song, Enmin
    Hossain, M. Shamim
    Alrashoud, Mubarak
    Muhammad, Ghulam
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 113 : 571 - 578
  • [9] Environment sound classification using an attention-based residual neural network
    Tripathi, Achyut Mani
    Mishra, Aakansha
    [J]. NEUROCOMPUTING, 2021, 460 : 409 - 423
  • [10] A Sentence Summarizer using Recurrent Neural Network and Attention-Based Encoder
    Kuremoto, Takashi
    Tsuruda, Takuji
    Mabu, Shingo
    Obayashi, Masanao
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELING AND SIMULATION (AMMS 2017), 2017, 153 : 245 - 248