IIRM: Intelligent Information Retrieval Model for Structured Documents by One-Shot Training Using Computer Vision

被引：0

作者：

Abhijit Guha

Debabrata Samanta

SK Hafizul Islam

机构：

[1] Christ (Deemed to be) University,Department of Computer Science

[2] First American India Private Limited,Department of Computer Science and Engineering

[3] Indian Institute of Information Technology Kalyani,undefined

来源：

Arabian Journal for Science and Engineering | 2023年 / 48卷

关键词：

Digital image processing; Information extraction; Best match region; One-shot training; Structured document; Template matching; Title insurance;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Various information retrieval algorithms have matured in recent years to facilitate data extraction from structured (with a predefined template) digital document images, primarily to manage and automate different organizations’ invoice and bill reimbursement processes. The algorithms are designated either rule-based or machine-learning-based. Both approaches have respective advantages and disadvantages. The rule-based algorithms struggle to generalize and need periodic adjustments, whereas machine learning-based supervised approaches need extensive data for training and substantial time and effort for manual annotation. The proposed system attempts to address both problems by providing a one-shot training approach using image processing, template matching, and optical character recognition. The model is extensible for any structured documents such as closing disclosure, bill, tax receipt, besides invoices. The model is validated against six different structured document types obtained from a reputed title insurance (TI) company. The comprehensive analysis of the experimental results confirms entity-wise extraction accuracy between 73.91 and 100% and straight through pass 81.81%, which is within business acceptable precision for a live environment. Out of total 32 tested entities, 17 outperformed all state-of-the-art techniques, where max accuracy has been 93%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$93\%$$\end{document} with only invoices or sales receipts. The system has been set operational to assist the robotic process automation of the TI mentioned above based on the experimental results.

引用

页码：1285 / 1301

页数：16

共 31 条

[1] IIRM: Intelligent Information Retrieval Model for Structured Documents by One-Shot Training Using Computer Vision
Guha, Abhijit
Samanta, Debabrata
Islam, S. K. Hafizul
[J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1285 - 1301
[2] One-Shot Texture Retrieval Using Global Grouping Metric
Zhu, Kai
Cao, Yang
Zhai, Wei
Zha, Zheng-Jun
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3726 - 3737
[3] A machine learning model for information retrieval with structured documents
Piwowarski, B
Gallinari, P
[J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2734 : 425 - 438
[4] Single Color One-Shot Scan Using Topology Information
Kawasaki, Hiroshi
Masuyama, Hitoshi
Sagawa, Ryusuke
Furukawa, Ryo
[J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 486 - 495
[5] Artificial Intelligent Information Retrieval Using Assigning Context of Documents
Liu Yong-Min
Cheng Shu
[J]. NSWCTC 2009: INTERNATIONAL CONFERENCE ON NETWORKS SECURITY, WIRELESS COMMUNICATIONS AND TRUSTED COMPUTING, VOL 2, PROCEEDINGS, 2009, : 592 - +
[6] Architecture for one-shot compressive imaging using computer-generated holograms
Macfaden, Alexander J.
Kindness, Stephen J.
Wilkinson, Timothy D.
[J]. APPLIED OPTICS, 2016, 55 (26) : 7399 - 7405
[7] Fast 3D Reconstruction using One-shot Spatial Structured Light
Huang, Bingyao
Tang, Ying
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 531 - 536
[8] One-shot phase retrieval method for interferometry using a hypercolumns convolutional neural network
Zhao, Zhuo
Li, Bing
Lu, Jiasheng
Kang, Xiaoqin
Liu, Tongkun
[J]. OPTICS EXPRESS, 2021, 29 (11) : 16406 - 16421
[9] ARticulate: One-Shot Interactions with Intelligent Assistants in Unfamiliar Smart Spaces Using Augmented Reality
Clark, Meghan
Newman, Mark W.
Dutta, Prabal
[J]. PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2022, 6 (01):
[10] Ranking structured documents using utility theory in the Bayesian Network retrieval model
Crestani, F
de Campos, LM
Fernández-Luna, JM
Huete, JF
[J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2003, 2857 : 168 - 182

← 1 2 3 4 →