Intelligent automation of invoice parsing using computer vision techniques

被引:8
|
作者
Chazhoor, Anisha [1 ]
Sarobin, Vergin Raja [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Document parsing; Computer vision; Object detection; Transfer learning; Optical character recognition; Region-based convolutional neural network;
D O I
10.1007/s11042-022-12916-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual parsing of invoices is a tedious, arduous and error-prone task. Due to the academic and business importance of this problem, it has attracted the attention of machine learning enthusiasts. There are several complexities and challenges in the automated parsing of invoices. Some of them include a paucity of useful datasets, eclectic template formats, and poor performance of algorithms in real life scenarios. This problem can be solved by the automatic traversal of the invoices by object detection algorithms such as YOLO, SSD and R-CNN. These state-of-the-art algorithms will be trained to detect various fields or entities present in an invoice. In this paper, a dataset of 315 invoices has been generated using web testing tools. The dataset has been annotated for eight entities: billing address, shipping address, invoice date, invoice number, product name, price, quantity, and total amount. The text boxes detected by the models is converted to machine encoded text, using text extraction methods such as Optical Character Recognition (OCR). Hyperparameter tuning has been performed to improve model accuracy. The models have been evaluated on myriad metrics such as mean Average Precision (mAP), common objects in context (COCO) evaluation metrics and total loss during training and validation. The loss vs iteration graph has been visualized using Tensorboard. A front-end application encapsulates all the functions of the research paper and allows testing of various models.
引用
收藏
页码:29383 / 29403
页数:21
相关论文
共 50 条
  • [21] Predicting Eye Fixations Using Computer Vision Techniques
    Alevizaki, Ada
    Melanitis, Nikos
    Nikita, Konstantina
    [J]. 2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 309 - 315
  • [22] Automation of a Wheelchair Mounted Robotic Arm using Computer Vision Interface
    Karuppiah, Priyanka
    Metalia, Hem
    George, Kiran
    [J]. 2018 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC): DISCOVERING NEW HORIZONS IN INSTRUMENTATION AND MEASUREMENT, 2018, : 1922 - 1926
  • [23] Analysis of esthetic smiles by using computer vision techniques
    Wong, NKC
    Kassim, AA
    Foong, KWC
    [J]. AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2005, 128 (03) : 404 - 411
  • [24] Detecting Driver Drowsiness Using Computer Vision Techniques
    Vural, Esra
    Cetin, Muejdat
    Ercil, Aytuel
    Littlewort, Gwen
    Bartlett, Marian
    Movellan, Javier
    [J]. 2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 549 - +
  • [25] Detecting road potholes using computer vision techniques
    Camilleri, Neil
    Gatt, Thomas
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 343 - 350
  • [26] Automated video segmentation using computer vision techniques
    Yoo, HW
    Jang, DS
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2004, 3 (01) : 129 - 143
  • [27] Structural identification of bridges using computer vision techniques
    Dong, C. Z.
    Catbas, F. N.
    [J]. ADVANCES IN ENGINEERING MATERIALS, STRUCTURES AND SYSTEMS: INNOVATIONS, MECHANICS AND APPLICATIONS, 2019, : 2096 - 2100
  • [28] Fabric Texture Analysis Using Computer Vision Techniques
    Wang, Xin
    Georganas, Nicolas D.
    Petriu, Emil M.
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2011, 60 (01) : 44 - 56
  • [29] Automatic chickpea classification using computer vision techniques
    Sabzi, Sajad
    Manuel Garcia-Amicis, Victor
    Abbaspour-Gilandeh, Yousef
    Garcia-Mateos, Gines
    Miguel Molina-Martinez, Jose
    [J]. IX CONGRESO IBERICO DE AGROINGENIERIA - LIBROS DE ACTAS, 2018, : 1167 - 1176
  • [30] Classroom Engagement Evaluation using Computer Vision Techniques
    Duraisamy, Prakash
    Van Haneghan, James
    Blackwell, William
    Jackson, Stephen
    Murugesan, G.
    Tamilselvan, K. S.
    [J]. PATTERN RECOGNITION AND TRACKING XXX, 2019, 10995