Automatic localization and extraction of tables from handheld mobile-camera captured handwritten document images

被引:5
|
作者
Amarnath, R. [1 ]
Sindhushree, G. S. [1 ]
Nagabhushan, P. [1 ,2 ]
Javed, Mohammed [2 ]
机构
[1] Univ Mysore, Dept Studies Comp Sci, Mysore 570006, Karnataka, India
[2] Indian Inst Informat Technol Allahabad, Dept Informat Technol, Allahabad, Uttar Pradesh, India
关键词
Handwritten document images; mobile-cameras; block-wise mean-computed fuzzy based binarization; fast edge-feature extraction; localizing the table;
D O I
10.3233/JIFS-181242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A table is a compact, effective and structured way of representing information in any document. Automatic localization of tables in scanned handwritten document images, and extracting the information are very critical and challenging task for applications like Optical Character Recognition, handwriting analysis, and auto-evaluation systems. The same task becomes more complex, when the handwritten document images are acquired through handheld mobile-cameras, because the captured images naturally get distorted due to poor illumination, device vibration, camera-angle, camera-orientation, camera-movement, and camera-distance. In this research article, a novel technique of automatic localization and segmentation of tables in handwritten document images which are captured using a handheld mobile-camera is proposed. Generally, ruling lines are used for structuring tables, sketching figures, and scribing scientific equations. In the current research work, tables are detected and extracted based on edge features of the ruling lines subjected to three main stages. Firstly, block-wise mean-computed fuzzy based binarization technique is proposed for analyzing the distortion in the acquired image, and subsequently the background surface that envelops the document area of the image is removed. Secondly, horizontal and vertical granule or strip-based technique is proposed for fast edge-feature extraction from the ruling lines of the table in the binarized image. Finally, entropy quantifiers are employed for segmenting the table in the image. The performance of the proposed technique is evaluated and reported using the proposed composite handwritten benchmark daset. Linear computational benefit 0(h x w) is observed in the worst-case tolerance.
引用
收藏
页码:2527 / 2544
页数:18
相关论文
共 38 条
  • [1] AUTOMATIC CHARACTER LABELING FOR CAMERA CAPTURED DOCUMENT IMAGES
    Fan, Wei
    Kise, Koichi
    Iwamura, Masakazu
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3284 - 3288
  • [2] Automatic dewarping of camera-captured comic document images
    Garai, Arpan
    Dutta, Arpita
    Biswas, Samit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 1537 - 1552
  • [3] Automatic dewarping of camera-captured comic document images
    Arpan Garai
    Arpita Dutta
    Samit Biswas
    Multimedia Tools and Applications, 2023, 82 : 1537 - 1552
  • [4] TEXTLINE INFORMATION EXTRACTION FROM GRAYSCALE CAMERA-CAPTURED DOCUMENT IMAGES
    Bukhari, Syed Saqib
    Breuel, Thomas M.
    Shafait, Faisal
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2013 - +
  • [5] Binarization and localization of text images captured on a mobile phone camera
    Antony, Bhavna
    Pati, Peeta Basa
    Ramakrishnan, A. G.
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSSING, PROCEEDINGS, 2006, : 224 - +
  • [6] Automatic Extraction of Numeric Strings in Unconstrained Handwritten Document Images
    Haji, M. Mehdi
    Bui, Tien D.
    Suen, Ching Y.
    DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [7] Automatic dewarping of Camera Captured Born-Digital Bangla Document Images
    Garai, Arpan
    Biswas, Samit
    Mandal, Sekhar
    Chaudhuri, Bidyut. B.
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 94 - 99
  • [8] Text region extraction and text segmentation on camera-captured document style images
    Song, YJ
    Kim, KC
    Choi, YW
    Byun, HR
    Kim, SH
    Chi, SY
    Jang, DK
    Chung, YK
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 172 - 176
  • [9] A Robust Approach to Extraction of Texts from Camera Captured Images
    Banerjee, Sudipto
    Mullick, Koustav
    Bhattacharya, Ujjwal
    CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, CBDAR 2013, 2014, 8357 : 30 - 46
  • [10] Automatic Blur Detection in Mobile Captured Document Images Towards quality check in mobile based document imaging applications
    Nunnagoppula, Ganesh
    Deepak, K. Sai
    Rai, Harikrishna G. N.
    Krishna, P. Radha
    Vesdapunt, Noranart
    2013 IEEE SECOND INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2013, : 299 - 304