Intelligent automation of invoice parsing using computer vision techniques

被引:8
|
作者
Chazhoor, Anisha [1 ]
Sarobin, Vergin Raja [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Document parsing; Computer vision; Object detection; Transfer learning; Optical character recognition; Region-based convolutional neural network;
D O I
10.1007/s11042-022-12916-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual parsing of invoices is a tedious, arduous and error-prone task. Due to the academic and business importance of this problem, it has attracted the attention of machine learning enthusiasts. There are several complexities and challenges in the automated parsing of invoices. Some of them include a paucity of useful datasets, eclectic template formats, and poor performance of algorithms in real life scenarios. This problem can be solved by the automatic traversal of the invoices by object detection algorithms such as YOLO, SSD and R-CNN. These state-of-the-art algorithms will be trained to detect various fields or entities present in an invoice. In this paper, a dataset of 315 invoices has been generated using web testing tools. The dataset has been annotated for eight entities: billing address, shipping address, invoice date, invoice number, product name, price, quantity, and total amount. The text boxes detected by the models is converted to machine encoded text, using text extraction methods such as Optical Character Recognition (OCR). Hyperparameter tuning has been performed to improve model accuracy. The models have been evaluated on myriad metrics such as mean Average Precision (mAP), common objects in context (COCO) evaluation metrics and total loss during training and validation. The loss vs iteration graph has been visualized using Tensorboard. A front-end application encapsulates all the functions of the research paper and allows testing of various models.
引用
收藏
页码:29383 / 29403
页数:21
相关论文
共 50 条
  • [1] Intelligent automation of invoice parsing using computer vision techniques
    Anisha Chazhoor
    Vergin Raja Sarobin
    [J]. Multimedia Tools and Applications, 2022, 81 : 29383 - 29403
  • [2] Intelligent Traffic Signal Automation Based on Computer Vision Techniques Using Deep Learning
    Ubaid, Muhammad Talha
    Saba, Tanzila
    Draz, Hafiz Umer
    Rehman, Amjad
    Ghani, Muhammad Usman
    Kolivand, Hoshang
    [J]. IT PROFESSIONAL, 2022, 24 (01) : 27 - 33
  • [3] Design and Implementation of an Intelligent Nail Machine with Computer Vision Techniques
    Hung, Chi-Huang
    Wang, Yuan-Kai
    Pei-Yen
    Shih
    Huang, Hsin-Yu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 708 - 709
  • [4] Research on Information Recognition of VAT Invoice Based on Computer Vision
    Zhang, Jiaqiao
    Ren, Fuji
    Ni, Hongiun
    Zhang, Zhenya
    Wang, Kaixuan
    [J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 126 - 130
  • [5] Intelligent pesticide recommendation system for cocoa plant using computer vision and deep learning techniques
    Arakeri, Megha
    M P, M. P.
    Kavan, A., V
    Murthy, Kamma Sushreya
    Nishitha, Nagineni Lakshmi
    Lakshmi, Napa
    [J]. ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2024, 6 (07):
  • [6] Computer vision and highway automation
    Dickmanns, ED
    [J]. VEHICLE SYSTEM DYNAMICS, 1999, 31 (5-6) : 325 - 343
  • [7] Hybrid optoelectronic processing and computer vision techniques for intelligent debris analysis
    Wu, QMJ
    Grover, CP
    Dumitras, A
    Liew, D
    Jerbi, A
    [J]. ALGORITHMS, DEVICES, AND SYSTEMS FOR OPTICAL INFORMATION PROCESSING, 1998, 3466 : 94 - 103
  • [8] Industrial exploitation of Computer Vision in logistic automation: Autonomous control of an intelligent forklift truck
    Garibotto, G
    Masciangelo, S
    Bassino, P
    Coelho, C
    Pavan, A
    Marson, M
    Bailey, E
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 1459 - 1464
  • [9] DRAGLINE SELECTION USING INTELLIGENT COMPUTER TECHNIQUES
    DENBY, B
    SCHOFIELD, D
    [J]. TRANSACTIONS OF THE INSTITUTION OF MINING AND METALLURGY SECTION A-MINING INDUSTRY, 1992, 101 : A79 - A84
  • [10] A Computer Vision-Based Intelligent Fish Feeding System Using Deep Learning Techniques for Aquaculture
    Hu, Wu-Chih
    Chen, Liang-Bi
    Huang, Bo-Kai
    Lin, Hong-Ming
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (07) : 7185 - 7194