IIRM: Intelligent Information Retrieval Model for Structured Documents by One-Shot Training Using Computer Vision

被引：0

作者：

Abhijit Guha

Debabrata Samanta

SK Hafizul Islam

机构：

[1] Christ (Deemed to be) University,Department of Computer Science

[2] First American India Private Limited,Department of Computer Science and Engineering

[3] Indian Institute of Information Technology Kalyani,undefined

来源：

Arabian Journal for Science and Engineering | 2023年 / 48卷

关键词：

Digital image processing; Information extraction; Best match region; One-shot training; Structured document; Template matching; Title insurance;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Various information retrieval algorithms have matured in recent years to facilitate data extraction from structured (with a predefined template) digital document images, primarily to manage and automate different organizations’ invoice and bill reimbursement processes. The algorithms are designated either rule-based or machine-learning-based. Both approaches have respective advantages and disadvantages. The rule-based algorithms struggle to generalize and need periodic adjustments, whereas machine learning-based supervised approaches need extensive data for training and substantial time and effort for manual annotation. The proposed system attempts to address both problems by providing a one-shot training approach using image processing, template matching, and optical character recognition. The model is extensible for any structured documents such as closing disclosure, bill, tax receipt, besides invoices. The model is validated against six different structured document types obtained from a reputed title insurance (TI) company. The comprehensive analysis of the experimental results confirms entity-wise extraction accuracy between 73.91 and 100% and straight through pass 81.81%, which is within business acceptable precision for a live environment. Out of total 32 tested entities, 17 outperformed all state-of-the-art techniques, where max accuracy has been 93%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$93\%$$\end{document} with only invoices or sales receipts. The system has been set operational to assist the robotic process automation of the TI mentioned above based on the experimental results.

引用

页码：1285 / 1301

页数：16

共 31 条

[21] 3D object retrieval based on histogram of local orientation using one-shot score support vector machine
Vahid Mehrdad
Hossein Ebrahimnezhad
[J]. Frontiers of Computer Science, 2015, 9 : 990 - 1005
[22] Structured Intelligent Search Engine for Effective Information Retrieval using Query Clustering Technique and Semantic Web
Prakasha, S.
Shashidhar, H. R.
Raju, G. T.
[J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 688 - 695
[23] An Accurate Reconstruction Model Using Structured Light of 3-D Computer Vision
Cui, Haihua
Dai, Ning
Liao, Wenhe
Cheng, Xiaosheng
[J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5100 - 5104
[24] One-Shot Federated Learning on Medical Data Using Knowledge Distillation with Image Synthesis and Client Model Adaptation
Kang, Myeongkyun
Chikontwe, Philip
Kim, Soopil
Jin, Kyong Hwan
Adeli, Ehsan
Pohl, Kilian M.
Park, Sang Hyun
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 521 - 531
[25] Cross Language Information Retrieval Model For Discovering WSDL Documents Using Arabic Language Query
Sultan, Torkey I.
Khedr, Ayman E.
Alsheref, Fahad Kamal
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (08) : 118 - 129
[26] Computer Vision Enabled Building Digital Twin Using Building Information Model
Zhou, Xiaoping
Sun, Kaiyue
Wang, Jia
Zhao, Jichao
Feng, Chiyuan
Yang, Yalong
Zhou, Wei
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 2684 - 2692
[27] Model Based Controller Design and Its Fine Tuning for Mechanical Resonant System Using One-Shot Experimental Data
Matsui, Yoshihiro
Ayano, Hideki
Nakano, Kazushi
[J]. 2013 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2013,
[28] Using Vagueness Measures to Re-rank Documents Retrieved by a Fuzzy Set Information Retrieval Model
Lynn, Stephen
Ng, Yiu-Kai
[J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 39 - 43
[29] Computer Intelligent Assessment Method of Curriculum Information System Using AHP-FCE Model
Haixiang, Xu
Zheyu, Li
Zhongzheng, Wang
[J]. Haixiang, Xu (vj162774@163.com), 2021, Institute of Electrical and Electronics Engineers Inc. : 724 - 728
[30] CheXPrune: sparse chest X-ray report generation model using multi-attention and one-shot global pruning
Kaur N.
Mittal A.
[J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 7485 - 7497

← 1 2 3 4 →