IIRM: Intelligent Information Retrieval Model for Structured Documents by One-Shot Training Using Computer Vision

被引:0
|
作者
Abhijit Guha
Debabrata Samanta
SK Hafizul Islam
机构
[1] Christ (Deemed to be) University,Department of Computer Science
[2] First American India Private Limited,Department of Computer Science and Engineering
[3] Indian Institute of Information Technology Kalyani,undefined
关键词
Digital image processing; Information extraction; Best match region; One-shot training; Structured document; Template matching; Title insurance;
D O I
暂无
中图分类号
学科分类号
摘要
Various information retrieval algorithms have matured in recent years to facilitate data extraction from structured (with a predefined template) digital document images, primarily to manage and automate different organizations’ invoice and bill reimbursement processes. The algorithms are designated either rule-based or machine-learning-based. Both approaches have respective advantages and disadvantages. The rule-based algorithms struggle to generalize and need periodic adjustments, whereas machine learning-based supervised approaches need extensive data for training and substantial time and effort for manual annotation. The proposed system attempts to address both problems by providing a one-shot training approach using image processing, template matching, and optical character recognition. The model is extensible for any structured documents such as closing disclosure, bill, tax receipt, besides invoices. The model is validated against six different structured document types obtained from a reputed title insurance (TI) company. The comprehensive analysis of the experimental results confirms entity-wise extraction accuracy between 73.91 and 100% and straight through pass 81.81%, which is within business acceptable precision for a live environment. Out of total 32 tested entities, 17 outperformed all state-of-the-art techniques, where max accuracy has been 93%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$93\%$$\end{document} with only invoices or sales receipts. The system has been set operational to assist the robotic process automation of the TI mentioned above based on the experimental results.
引用
收藏
页码:1285 / 1301
页数:16
相关论文
共 31 条
  • [21] 3D object retrieval based on histogram of local orientation using one-shot score support vector machine
    Vahid Mehrdad
    Hossein Ebrahimnezhad
    [J]. Frontiers of Computer Science, 2015, 9 : 990 - 1005
  • [22] Structured Intelligent Search Engine for Effective Information Retrieval using Query Clustering Technique and Semantic Web
    Prakasha, S.
    Shashidhar, H. R.
    Raju, G. T.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 688 - 695
  • [23] An Accurate Reconstruction Model Using Structured Light of 3-D Computer Vision
    Cui, Haihua
    Dai, Ning
    Liao, Wenhe
    Cheng, Xiaosheng
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5100 - 5104
  • [24] One-Shot Federated Learning on Medical Data Using Knowledge Distillation with Image Synthesis and Client Model Adaptation
    Kang, Myeongkyun
    Chikontwe, Philip
    Kim, Soopil
    Jin, Kyong Hwan
    Adeli, Ehsan
    Pohl, Kilian M.
    Park, Sang Hyun
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 521 - 531
  • [25] Cross Language Information Retrieval Model For Discovering WSDL Documents Using Arabic Language Query
    Sultan, Torkey I.
    Khedr, Ayman E.
    Alsheref, Fahad Kamal
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (08) : 118 - 129
  • [26] Computer Vision Enabled Building Digital Twin Using Building Information Model
    Zhou, Xiaoping
    Sun, Kaiyue
    Wang, Jia
    Zhao, Jichao
    Feng, Chiyuan
    Yang, Yalong
    Zhou, Wei
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 2684 - 2692
  • [27] Model Based Controller Design and Its Fine Tuning for Mechanical Resonant System Using One-Shot Experimental Data
    Matsui, Yoshihiro
    Ayano, Hideki
    Nakano, Kazushi
    [J]. 2013 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2013,
  • [28] Using Vagueness Measures to Re-rank Documents Retrieved by a Fuzzy Set Information Retrieval Model
    Lynn, Stephen
    Ng, Yiu-Kai
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 39 - 43
  • [29] Computer Intelligent Assessment Method of Curriculum Information System Using AHP-FCE Model
    Haixiang, Xu
    Zheyu, Li
    Zhongzheng, Wang
    [J]. Haixiang, Xu (vj162774@163.com), 2021, Institute of Electrical and Electronics Engineers Inc. : 724 - 728
  • [30] CheXPrune: sparse chest X-ray report generation model using multi-attention and one-shot global pruning
    Kaur N.
    Mittal A.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 7485 - 7497