Automatic Table Detection and Retention from Scanned Document Images via Analysis of Structural Information

被引:0
|
作者
Ranka, Varsha [1 ]
Patil, Shubham [1 ]
Patni, Shubham [1 ]
Raut, Tushar [1 ]
Mehrotra, Kapil [2 ]
Gupta, Manish Kumar [2 ]
机构
[1] PICT, Dept Comp Engn, Pune, Maharashtra, India
[2] Ctr Dev Adv Comp, Pune, Maharashtra, India
关键词
Optical Character Recognition; Table detection; Table Retention; Layout analysis; Document Analysis and Recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The problem of automatic table detection has always been a great topic of debate in the field of Document Analysis and Recognition (DAR). Digital documents are efficient than their printed counterparts for storage, maintenance and republishing. Being a non-textual object of a document, tables prevent OCR system to digitize a document perfectly and distorts layout and structure of digitized documents. There is no available algorithm or method which solves this problem for all possible types of tables. This paper tackles the problem of table detection and retention by proposing a bi-modular approach based on structural information of tables. This structural information includes bounding lines, row/column separators and space between columns. Through analysis of these properties, our experiments on a dataset of above 600 images consisting of more than 829 tables have detected 90% of the table correctly.
引用
收藏
页码:244 / 249
页数:6
相关论文
共 50 条
  • [21] Learning to Detect Tables in Scanned Document Images using Line Information
    Kasar, T.
    Barlas, P.
    Adam, S.
    Chatelain, C.
    Paquet, T.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1185 - 1189
  • [22] Nonparametric Illumination Correction for Scanned Document Images via Convex Hulls
    Meng, Gaofeng
    Xiang, Shiming
    Zheng, Nanning
    Pan, Chunhong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (07) : 1730 - 1743
  • [23] Improving table detection for document images using boundary
    Yingli Liu
    Jianfeng Zheng
    Guangtao Zhang
    Tao Shen
    Complex & Intelligent Systems, 2024, 10 : 1703 - 1714
  • [24] Improving table detection for document images using boundary
    Liu, Yingli
    Zheng, Jianfeng
    Zhang, Guangtao
    Shen, Tao
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1703 - 1714
  • [25] Rethinking Learnable Proposals for Graphical Object Detection in Scanned Document Images
    Sinha, Sankalp
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [26] Path Searching Based Crease Detection for Large Scale Scanned Document Images
    Zhang J.
    Li Y.
    Li S.
    Sun B.
    Sun J.
    Sensing and Imaging, 2017, 18 (1):
  • [27] Towards automatic tree rings detection in images of scanned wood samples
    Fabijanska, Anna
    Danek, Malgorzata
    Barniak, Joanna
    Piorkowski, Adam
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2017, 140 : 279 - 289
  • [28] e-PCP: A robust skew detection method for scanned document images
    Dey, Prasenjit
    Noushath, S.
    PATTERN RECOGNITION, 2010, 43 (03) : 937 - 948
  • [29] Cascade Network with Deformable Composite Backbone for Formula Detection in Scanned Document Images
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [30] Table Detection in Document Images using Foreground and Background Features
    Arif, Saman
    Shafait, Faisal
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 245 - 252