Efficient skew detection of printed document images based on novel combination of enhanced profiles

被引:11
|
作者
Papandreou, A. [1 ,2 ]
Gatos, B. [2 ]
Perantonis, S. J. [2 ]
Gerardis, I. [2 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, Athens 15784, Greece
[2] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens 15310, Greece
关键词
Document skew correction; Projection profiles; Document image preprocessing;
D O I
10.1007/s10032-014-0228-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document skew is often introduced during the capturing process of the document image processing pipeline and may seriously affect the performance of subsequent stages of segmentation and recognition. Skew detection is often accomplished with the use of horizontal projections, while recently, a new approach that is based on vertical projections has been introduced. In this paper, we use the technique of minimum bounding box area in order to combine a horizontal with a new reinforced vertical projection profile method. We are motivated by the fact that the horizontal and the novel vertical projection profiles are found to be complementary to each other. We claim that the proposed approach has more accurate performance compared with other state-of-the-art skew detection algorithms; it deals with all the drawbacks of the projection profile methods; it is more noise and warp resistant and gives accurate results for any kind of printed document image. For these reasons, it can be efficiently applied to historical machine printed or multicolumn documents, documents with figures and tables, while it is robust for any kind of script. Extended experimental results on two databases in different skew angle range, with representative printed documents of all kinds, as well as printed documents of two historical books, prove the efficiency of the proposed approach. There is also a comparison with commercial products in several cases where the contribution of the proposed algorithm is demonstrated at optical character recognition level. Moreover, an analysis of the accuracy performance of the main elements of the proposed technique is also performed.
引用
收藏
页码:433 / 454
页数:22
相关论文
共 50 条
  • [31] Voting-Based Document Image Skew Detection
    Boiangiu, Costin-Anton
    Dinu, Ovidiu-Alexandru
    Popescu, Cornel
    Constantin, Nicolae
    Petrescu, Catalin
    APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [32] A new boundary growing and Hough transform based approach for accurate skew detection in binary document images
    Shivakumara, P
    Kumar, GH
    Guru, DS
    Nagabhushan, P
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 140 - 146
  • [33] Binarization of Degraded Document Images Based on Combination of Contrast Images
    Arruda, A. W. A.
    Mello, C. A. B.
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 615 - 620
  • [34] e-PCP: A robust skew detection method for scanned document images
    Dey, Prasenjit
    Noushath, S.
    PATTERN RECOGNITION, 2010, 43 (03) : 937 - 948
  • [35] An Efficient Thresholding Algorithm for Degraded Document Images Based on Intelligent Block Detection
    Chang, Yi-Fan
    Pai, Yu-Ting
    Ruan, Shanq-Jang
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 667 - 672
  • [36] A novel boundary growing approach for accurate skew estimation of binary document images
    Shivakumara, P
    Kumar, GH
    PATTERN RECOGNITION LETTERS, 2006, 27 (07) : 791 - 801
  • [37] Skew Estimation Technique for Binary Document Images based on Thinning and Moments
    Aradhya, Manjunath V. N.
    Kumar, Hemantha G.
    Shivakumara, P.
    ENGINEERING LETTERS, 2007, 14 (01)
  • [38] Robust Skew Estimation of Handwritten and Printed Documents based on Grayvalue Images
    Kleber, Florian
    Diem, Markus
    Sablatnig, Robert
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3020 - 3025
  • [39] Skew detection algorithm for form document based on elongate feature
    Xie, Feng-ying
    Jiang, Zhi-guo
    Wang, Lei
    ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 2007, 4679 : 127 - +
  • [40] Skew Detection of Scanned Document Image Based on Shearlet Transform
    Zhang Xinhong
    Zhang Yifan
    Zhang Fan
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (01)