DocExtractNet: A novel framework for enhanced information extraction from business documents

被引:0
|
作者
Yan, Zhengjin [1 ]
Ye, Zheng [1 ]
Ge, Jun [2 ]
Qin, Jun [1 ]
Liu, Jing [1 ]
Cheng, Yu [3 ]
Gurrin, Cathal [4 ]
机构
[1] South Cent Minzu Univ, Coll Comp Sci & Informat Phys Fus Intelligent Comp, Key Lab Natl Ethn Affairs Commiss, Wuhan, Peoples R China
[2] Wuchang Univ Technol, Sch Artificial Intelligence, Wuhan, Peoples R China
[3] Hangzhou Boyan Private Equ Fund Management Partner, Hangzhou, Peoples R China
[4] Dublin City Univ, Dublin, Ireland
关键词
Receipt information extraction; LayoutLMv3; ImageEnhance; PrecisionHints; CrossModalFusion;
D O I
10.1016/j.ipm.2024.104046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient extraction of critical information from receipt is essential for automating financial processes and supporting timely decision-making in businesses. However, this process faces significant challenges, starting with variations in the quality of scanned receipt images due to differences in scanning equipment, followed by the complexity of diverse receipt formats, and further complicated by handwritten elements and noise, making accurate extraction particularly difficult. Therefore, to address these issues, we propose a model framework called DocExtractNet, based on LayoutLMv3, designed for extracting key information from receipt. Firstly, we introduce the ImageEnhance method to process image modality features, enhancing image clarity and significantly improving recognition accuracy for low-quality images. Then, we implement the PrecisionHints strategy to supplement missing key-value pairs in the text modality, improving data integrity and the model's overall performance. Furthermore, we apply the CrossModalFusion method to combine both image and text features, allowing the model to better understand and extract receipt information. The experimental results on the Finance- Receipts, FUNSD, and CORD datasets show that DocExtractNet significantly improves F1 scores compared to other models, with F1 scores reaching 97.07% for Finance-Receipts, 91.80% for FUNSD, and 97.38% for CORD, highlighting its superior performance in receipt information extraction.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Information extraction from semi-structured web documents
    Yun, Bo-Hyun
    Seo, Chang-Ho
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2006, 4092 : 586 - 598
  • [32] Layout based information extraction from HTML']HTML documents
    Buraet, Radek
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 624 - 628
  • [33] Extraction of Information from Public Health Emergency Web Documents
    Wang, Li
    Zhang, Yuanpeng
    Qian, Danmin
    Yao, Min
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 765 - 770
  • [34] Collaborative Information Extraction and Mining from Multiple Web Documents
    Wong, Tak-Lam
    Lam, Wai
    Chan, Shing-Kit
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 442 - 452
  • [35] XML as a means to support information extraction from legal documents
    Martínez, MM
    de la Fuente, P
    Derniame, JC
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2003, 18 (05): : 263 - 277
  • [36] Automatic Key Information Extraction from Visually Rich Documents
    De Trogoff, Charles
    Hantach, Rim
    Lechuga, Gisela
    Calvez, Philippe
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 89 - 96
  • [37] Lightweight Spatial Modeling for Combinatorial Information Extraction From Documents
    Dong, Yanfei
    Deng, Lambert
    Zhang, Jiazheng
    Yu, Xiaodong
    Lin, Ting
    Gelli, Francesco
    Poriadecla, Soujanya
    Lee, Wee Sun
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1471 - 1484
  • [38] An Information Extraction Framework for Legal Documents: a Case Study of Thai Supreme Court Verdicts
    Kowsrihawat, Kankawin
    Vateekul, Peerapon
    PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 275 - 280
  • [39] Fast title extraction method for business documents
    Katsuyama, Y
    Naoi, S
    DOCUMENT RECOGNITION IV, 1997, 3027 : 192 - 201
  • [40] Editorial: Information extraction for health documents
    Mensa, Enrico
    Fernandez, Paloma Martinez
    Roller, Roland
    Radicioni, Daniele P.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6