Recognition-Free Question Answering on Handwritten Document Collections

被引:2
|
作者
Tueselmann, Oliver [1 ]
Mueller, Friedrich [1 ]
Wolf, Fabian [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, D-44227 Dortmund, Germany
关键词
Visual question answering; Information retrieval; Handwritten documents; Document understanding;
D O I
10.1007/978-3-031-21648-0_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, considerable progress has been made in the research area of Question Answering (QA) on document images. Current QA approaches from the Document Image Analysis community are mainly focusing on machine-printed documents and perform rather limited on handwriting. This is mainly due to the reduced recognition performance on handwritten documents. To tackle this problem, we propose a recognition-free QA approach, especially designed for handwritten document image collections. We present a robust document retrieval method, as well as two QA models. Our approaches outperform the state-of-theart recognition-free models on the challenging BenthamQA and HWSQuAD datasets.
引用
收藏
页码:259 / 273
页数:15
相关论文
共 50 条
  • [1] Question answering beyond CLEF document collections
    Costa, Luis
    [J]. Evaluation of Multilingual and Multi-modal Information Retrieval, 2007, 4730 : 405 - 414
  • [2] Towards a segmentation and recognition-free approach for content-based document image retrieval of handwritten queries
    Chatbri, Houssem
    Kameyama, Keisuke
    Kwan, Paul
    [J]. Proceedings 3rd IAPR Asian Conference on Pattern Recognition ACPR 2015, 2015, : 146 - 150
  • [3] Recognition-free Retrieval of Old Arabic Document Images
    Sari, Toufik
    Kefali, Abderrahmane
    [J]. COMPUTACION Y SISTEMAS, 2011, 15 (02): : 195 - 208
  • [4] A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections
    Pappas, Dimitris
    Androutsopoulos, Ion
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3896 - 3907
  • [5] Document Image Retrieval Based on Texture Features: A Recognition-Free Approach
    Alaei, Fahimeh
    Alaei, Alireza
    Pal, Umapada
    Blumenstein, Michael
    [J]. 2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 456 - 462
  • [6] Asking questions on handwritten document collections
    Mathew, Minesh
    Gomez, Lluis
    Karatzas, Dimosthenis
    Jawahar, C., V
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021, 24 (03) : 235 - 249
  • [7] Asking questions on handwritten document collections
    Minesh Mathew
    Lluis Gomez
    Dimosthenis Karatzas
    C. V. Jawahar
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 235 - 249
  • [8] SESAME - self-supervised framework for extractive question answering over document collections
    Batista, Vitor A.
    Gomes, Diogo S. M.
    Evsukoff, Alexandre
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024,
  • [9] DAN: A Segmentation-Free Document Attention Network for Handwritten Document Recognition
    Coquenet, Denis
    Chatelain, Clement
    Paquet, Thierry
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8227 - 8243
  • [10] Individual Recognition-Free Target Enclosure Model
    Kubo, Masao
    Yoshimura, Tatsurou
    Yamaguchi, Akihiro
    Sato, Hiroshi
    [J]. PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 608 - 613