DocExtractNet: A novel framework for enhanced information extraction from business documents

被引:0
|
作者
Yan, Zhengjin [1 ]
Ye, Zheng [1 ]
Ge, Jun [2 ]
Qin, Jun [1 ]
Liu, Jing [1 ]
Cheng, Yu [3 ]
Gurrin, Cathal [4 ]
机构
[1] South Cent Minzu Univ, Coll Comp Sci & Informat Phys Fus Intelligent Comp, Key Lab Natl Ethn Affairs Commiss, Wuhan, Peoples R China
[2] Wuchang Univ Technol, Sch Artificial Intelligence, Wuhan, Peoples R China
[3] Hangzhou Boyan Private Equ Fund Management Partner, Hangzhou, Peoples R China
[4] Dublin City Univ, Dublin, Ireland
关键词
Receipt information extraction; LayoutLMv3; ImageEnhance; PrecisionHints; CrossModalFusion;
D O I
10.1016/j.ipm.2024.104046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient extraction of critical information from receipt is essential for automating financial processes and supporting timely decision-making in businesses. However, this process faces significant challenges, starting with variations in the quality of scanned receipt images due to differences in scanning equipment, followed by the complexity of diverse receipt formats, and further complicated by handwritten elements and noise, making accurate extraction particularly difficult. Therefore, to address these issues, we propose a model framework called DocExtractNet, based on LayoutLMv3, designed for extracting key information from receipt. Firstly, we introduce the ImageEnhance method to process image modality features, enhancing image clarity and significantly improving recognition accuracy for low-quality images. Then, we implement the PrecisionHints strategy to supplement missing key-value pairs in the text modality, improving data integrity and the model's overall performance. Furthermore, we apply the CrossModalFusion method to combine both image and text features, allowing the model to better understand and extract receipt information. The experimental results on the Finance- Receipts, FUNSD, and CORD datasets show that DocExtractNet significantly improves F1 scores compared to other models, with F1 scores reaching 97.07% for Finance-Receipts, 91.80% for FUNSD, and 97.38% for CORD, highlighting its superior performance in receipt information extraction.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Prior Information Enhanced Extraction Framework for Document-level Financial Event Extraction
    Wang, Haitao
    Zhu, Tong
    Wang, Mingtao
    Zhang, Guoliang
    Chen, Wenliang
    DATA INTELLIGENCE, 2021, 3 (03) : 460 - 476
  • [42] A Prior Information Enhanced Extraction Framework for Document-level Financial Event Extraction
    Haitao Wang
    Tong Zhu
    Mingtao Wang
    Guoliang Zhang
    Wenliang Chen
    Data Intelligence, 2021, 3 (03) : 460 - 476
  • [43] Transformers-based information extraction with limited data for domain-specific business documents
    Nguyen, Minh-Tien
    Le, Dung Tien
    Le, Linh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 97
  • [44] Extraction of Financial Information from Online Business Reports
    Simmons, Lakisha L.
    Conlon, Sumali J.
    DATA BASE FOR ADVANCES IN INFORMATION SYSTEMS, 2013, 44 (03): : 34 - 48
  • [45] A business-driven framework for automatic information extraction in professional media production
    Elser, Matthias
    Mies, Ronald
    Altendorf, Peter
    Messina, Alberto
    Negro, Fulvio
    Bailer, Werner
    Hofmann, Albert
    Thallinger, Georg
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2015, 11 (03) : 397 - 414
  • [46] Information Extraction Framework
    Sleiman, Hassan A.
    Corchuelo, Rafael
    TRENDS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2012, 157 : 149 - 156
  • [47] Automatic Information Extraction from Electronic Documents Using Machine Learning
    Kamaleson, Nishanthan
    Chu, Dominique
    Otero, Fernando E. B.
    ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 183 - 194
  • [48] Information Extraction from Visually Rich Documents with Font Style Embeddings
    Oussaid, Ismail
    Vanhuffel, William
    Ratnamogan, Pirashanth
    Hajaiej, Mhamed
    Mathey, Alexis
    Gilles, Thomas
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1657 - 1663
  • [49] An approach of information extraction from web documents for automatic ontology generation
    Yeom, KW
    Park, JH
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 450 - 457
  • [50] Information Extraction from Scanned Documents by Stochastic Page Layout Analysis
    Takasu, Atsuhiro
    Aihara, Kenro
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 447 - 448