IIRM: Intelligent Information Retrieval Model for Structured Documents by One-Shot Training Using Computer Vision

被引:0
|
作者
Abhijit Guha
Debabrata Samanta
SK Hafizul Islam
机构
[1] Christ (Deemed to be) University,Department of Computer Science
[2] First American India Private Limited,Department of Computer Science and Engineering
[3] Indian Institute of Information Technology Kalyani,undefined
关键词
Digital image processing; Information extraction; Best match region; One-shot training; Structured document; Template matching; Title insurance;
D O I
暂无
中图分类号
学科分类号
摘要
Various information retrieval algorithms have matured in recent years to facilitate data extraction from structured (with a predefined template) digital document images, primarily to manage and automate different organizations’ invoice and bill reimbursement processes. The algorithms are designated either rule-based or machine-learning-based. Both approaches have respective advantages and disadvantages. The rule-based algorithms struggle to generalize and need periodic adjustments, whereas machine learning-based supervised approaches need extensive data for training and substantial time and effort for manual annotation. The proposed system attempts to address both problems by providing a one-shot training approach using image processing, template matching, and optical character recognition. The model is extensible for any structured documents such as closing disclosure, bill, tax receipt, besides invoices. The model is validated against six different structured document types obtained from a reputed title insurance (TI) company. The comprehensive analysis of the experimental results confirms entity-wise extraction accuracy between 73.91 and 100% and straight through pass 81.81%, which is within business acceptable precision for a live environment. Out of total 32 tested entities, 17 outperformed all state-of-the-art techniques, where max accuracy has been 93%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$93\%$$\end{document} with only invoices or sales receipts. The system has been set operational to assist the robotic process automation of the TI mentioned above based on the experimental results.
引用
收藏
页码:1285 / 1301
页数:16
相关论文
共 31 条
  • [1] IIRM: Intelligent Information Retrieval Model for Structured Documents by One-Shot Training Using Computer Vision
    Guha, Abhijit
    Samanta, Debabrata
    Islam, S. K. Hafizul
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1285 - 1301
  • [2] One-Shot Texture Retrieval Using Global Grouping Metric
    Zhu, Kai
    Cao, Yang
    Zhai, Wei
    Zha, Zheng-Jun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3726 - 3737
  • [3] A machine learning model for information retrieval with structured documents
    Piwowarski, B
    Gallinari, P
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2734 : 425 - 438
  • [4] Single Color One-Shot Scan Using Topology Information
    Kawasaki, Hiroshi
    Masuyama, Hitoshi
    Sagawa, Ryusuke
    Furukawa, Ryo
    [J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 486 - 495
  • [5] Artificial Intelligent Information Retrieval Using Assigning Context of Documents
    Liu Yong-Min
    Cheng Shu
    [J]. NSWCTC 2009: INTERNATIONAL CONFERENCE ON NETWORKS SECURITY, WIRELESS COMMUNICATIONS AND TRUSTED COMPUTING, VOL 2, PROCEEDINGS, 2009, : 592 - +
  • [6] Architecture for one-shot compressive imaging using computer-generated holograms
    Macfaden, Alexander J.
    Kindness, Stephen J.
    Wilkinson, Timothy D.
    [J]. APPLIED OPTICS, 2016, 55 (26) : 7399 - 7405
  • [7] Fast 3D Reconstruction using One-shot Spatial Structured Light
    Huang, Bingyao
    Tang, Ying
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 531 - 536
  • [8] One-shot phase retrieval method for interferometry using a hypercolumns convolutional neural network
    Zhao, Zhuo
    Li, Bing
    Lu, Jiasheng
    Kang, Xiaoqin
    Liu, Tongkun
    [J]. OPTICS EXPRESS, 2021, 29 (11) : 16406 - 16421
  • [9] ARticulate: One-Shot Interactions with Intelligent Assistants in Unfamiliar Smart Spaces Using Augmented Reality
    Clark, Meghan
    Newman, Mark W.
    Dutta, Prabal
    [J]. PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2022, 6 (01):
  • [10] Ranking structured documents using utility theory in the Bayesian Network retrieval model
    Crestani, F
    de Campos, LM
    Fernández-Luna, JM
    Huete, JF
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2003, 2857 : 168 - 182