DocExtractNet: A novel framework for enhanced information extraction from business documents

被引:0
|
作者
Yan, Zhengjin [1 ]
Ye, Zheng [1 ]
Ge, Jun [2 ]
Qin, Jun [1 ]
Liu, Jing [1 ]
Cheng, Yu [3 ]
Gurrin, Cathal [4 ]
机构
[1] South Cent Minzu Univ, Coll Comp Sci & Informat Phys Fus Intelligent Comp, Key Lab Natl Ethn Affairs Commiss, Wuhan, Peoples R China
[2] Wuchang Univ Technol, Sch Artificial Intelligence, Wuhan, Peoples R China
[3] Hangzhou Boyan Private Equ Fund Management Partner, Hangzhou, Peoples R China
[4] Dublin City Univ, Dublin, Ireland
关键词
Receipt information extraction; LayoutLMv3; ImageEnhance; PrecisionHints; CrossModalFusion;
D O I
10.1016/j.ipm.2024.104046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient extraction of critical information from receipt is essential for automating financial processes and supporting timely decision-making in businesses. However, this process faces significant challenges, starting with variations in the quality of scanned receipt images due to differences in scanning equipment, followed by the complexity of diverse receipt formats, and further complicated by handwritten elements and noise, making accurate extraction particularly difficult. Therefore, to address these issues, we propose a model framework called DocExtractNet, based on LayoutLMv3, designed for extracting key information from receipt. Firstly, we introduce the ImageEnhance method to process image modality features, enhancing image clarity and significantly improving recognition accuracy for low-quality images. Then, we implement the PrecisionHints strategy to supplement missing key-value pairs in the text modality, improving data integrity and the model's overall performance. Furthermore, we apply the CrossModalFusion method to combine both image and text features, allowing the model to better understand and extract receipt information. The experimental results on the Finance- Receipts, FUNSD, and CORD datasets show that DocExtractNet significantly improves F1 scores compared to other models, with F1 scores reaching 97.07% for Finance-Receipts, 91.80% for FUNSD, and 97.38% for CORD, highlighting its superior performance in receipt information extraction.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Semantic information extraction from images of complex documents
    Claudio Antonio Peanho
    Henrique Stagni
    Flavio Soares Correa da Silva
    Applied Intelligence, 2012, 37 : 543 - 557
  • [22] Efficient Temporal Information Extraction from Korean Documents
    Lim, Chae-Gyun
    Choi, Ho-Jin
    2017 18TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (IEEE MDM 2017), 2017, : 366 - 370
  • [23] Template mining for information extraction from digital documents
    Chowdhury, GG
    LIBRARY TRENDS, 1999, 48 (01) : 182 - 208
  • [24] Structure recognition and information extraction from tabular documents
    Chandran, S
    Balasubramanian, S
    Gandhi, T
    Prasad, A
    Kasturi, R
    Chhabra, A
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1996, 7 (04) : 289 - 303
  • [25] Information Extraction from Presentation-Oriented Documents
    Ruffolo, Massimo
    Oro, Ermelinda
    ERCIM NEWS, 2012, (89): : 44 - 44
  • [26] Handwritten information extraction from historical census documents
    Nion, Thibauld
    Menasri, Fares
    Louradour, Jerome
    Sibade, Cedric
    Retornaz, Thomas
    Metaireau, Pierre-Yves
    Kermorvant, Christopher
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 822 - 826
  • [27] Information extraction from documents for automating software testing
    Lutsky, P
    ARTIFICIAL INTELLIGENCE IN ENGINEERING, 2000, 14 (01): : 63 - 69
  • [28] Information Extraction from Handwritten Tables in Historical Documents
    Andres, Jose
    Ramon Prieto, Jose
    Granell, Emilio
    Romero, Veronica
    Andreu Sanchez, Joan
    Vidal, Enrique
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 184 - 198
  • [29] Information extraction from the structured part of office documents
    Hao, XL
    Wang, JTL
    Ng, PA
    INFORMATION SCIENCES, 1996, 91 (3-4) : 245 - 274
  • [30] AURORA: An Information Extraction System of Domain-specific Business Documents with Limited Data
    Minh-Tien Nguyen
    Dung Tien Le
    Le Thai Linh
    Nguyen Hong Son
    Do Hoang Thai Duong
    Bui Cong Minh
    Nguyen Hai Phong
    Nguyen Huu Hiep
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3437 - 3440