Edge-based connected component approach for skew correction of complex document images

被引:0
|
作者
Kumar, J. [1 ]
Kasar, T. [1 ]
Ramakrishnan, A. G. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Med Intelligence & Language Engn Lab, Bangalore 560012, Karnataka, India
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Skew correction of complex document images is a difficult task. We propose an edge-based connected component approach for robust skew correction of documents with complex layout and content. The algorithm essentially consists of two steps - an 'initialization' step to determine the image orientation from the centroids of the connected components and a 'search' step to find the actual skew of the image. During initialization, we choose two different sets of points regularly spaced across the the image, one from the left to right and the other from top to bottom. The image orientation is determined from the slope between the two succesive nearest neighbors of each of the points in the chosen set. The search step finds succesive nearest neighbors that satisfy the parameters obtained in the initialization step. The final skew is determined from the slopes obtained in the 'search' step. Unlike other connected component based methods, the proposed method does not require any binarization step that generally precedes connected component analysis. The method works well for scanned documents with complex layout of any skew with a precision of 0.5 degrees.
引用
收藏
页码:1233 / 1236
页数:4
相关论文
共 50 条
  • [1] Edge-based method for text detection from complex document images
    Pietikäinen, M
    Okun, O
    [J]. SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 286 - 291
  • [2] Entropy based Skew correction of document images
    Arvind, K. R.
    Kumar, Jayant
    Ramakrishnan, A. G.
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2007, 4815 : 495 - 502
  • [3] Scanned Document Images Skew Correction Based on Shearlet Transform
    Zhang, Fan
    Zhang, Yifan
    Qu, Xingxing
    Liu, Bin
    Zhang, Ruoya
    [J]. MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, MIWAI 2015, 2015, 9426 : 226 - 232
  • [4] A Combined Edge and Connected Component Based Approach for Kannada Text Detection in Images
    Siddiqua, Shahzia
    Naveena, C.
    Manvi, Sunil Kumar
    [J]. 2017 INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN ELECTRONICS AND COMMUNICATION TECHNOLOGY (ICRAECT), 2017, : 121 - 125
  • [5] Multiscale edge-based text extraction from complex images
    Liu, Xiaoqing
    Samarabandu, Jagath
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1721 - +
  • [6] A Correction Algorithm for Document Images Based on Edge Contour
    Ding, Jianhao
    Lin, Zhijie
    Yu, Lingyun
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY AND MANAGEMENT SCIENCE (ITMS 2015), 2015, 34 : 105 - 108
  • [7] Increasing resolution of digital images using edge-based approach
    Swierczynski, Z.
    Rokita, P.
    [J]. OPTO-ELECTRONICS REVIEW, 2008, 16 (01) : 76 - 84
  • [8] A hybrid edge-based segmentation approach for ultrasound medical images
    Gupta, Deep
    Anand, R. S.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2017, 31 : 116 - 126
  • [9] Efficient skew estimation and correction algorithm for document images
    Kwag, HK
    Kim, SH
    Jeong, SH
    Lee, GS
    [J]. IMAGE AND VISION COMPUTING, 2002, 20 (01) : 25 - 35
  • [10] Skew Angle Estimation and Correction for Noisy Document Images
    Manomathi, M.
    Chitrakala, S.
    [J]. ADVANCES IN COMPUTING AND COMMUNICATIONS, PT III, 2011, 192 : 415 - 424