Efficient skew detection of printed document images based on novel combination of enhanced profiles

被引:11
|
作者
Papandreou, A. [1 ,2 ]
Gatos, B. [2 ]
Perantonis, S. J. [2 ]
Gerardis, I. [2 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, Athens 15784, Greece
[2] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens 15310, Greece
关键词
Document skew correction; Projection profiles; Document image preprocessing;
D O I
10.1007/s10032-014-0228-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document skew is often introduced during the capturing process of the document image processing pipeline and may seriously affect the performance of subsequent stages of segmentation and recognition. Skew detection is often accomplished with the use of horizontal projections, while recently, a new approach that is based on vertical projections has been introduced. In this paper, we use the technique of minimum bounding box area in order to combine a horizontal with a new reinforced vertical projection profile method. We are motivated by the fact that the horizontal and the novel vertical projection profiles are found to be complementary to each other. We claim that the proposed approach has more accurate performance compared with other state-of-the-art skew detection algorithms; it deals with all the drawbacks of the projection profile methods; it is more noise and warp resistant and gives accurate results for any kind of printed document image. For these reasons, it can be efficiently applied to historical machine printed or multicolumn documents, documents with figures and tables, while it is robust for any kind of script. Extended experimental results on two databases in different skew angle range, with representative printed documents of all kinds, as well as printed documents of two historical books, prove the efficiency of the proposed approach. There is also a comparison with commercial products in several cases where the contribution of the proposed algorithm is demonstrated at optical character recognition level. Moreover, an analysis of the accuracy performance of the main elements of the proposed technique is also performed.
引用
收藏
页码:433 / 454
页数:22
相关论文
共 50 条
  • [21] A novel approach for Skew estimation of document images in OCR system
    Sarfraz, M
    Zidouri, A
    Shahab, SA
    COMPUTER GRAPHICS, IMAGING AND VISION: NEW TRENDS, 2005, : 175 - 180
  • [22] COMBINATION OF HMMS FOR THE REPRESENTATION OF PRINTED CHARACTERS IN NOISY DOCUMENT IMAGES
    ELMS, AJ
    ILLINGWORTH, J
    IMAGE AND VISION COMPUTING, 1995, 13 (05) : 385 - 392
  • [23] A novel technique for estimation of skew in binary text document images based on linear regression analysis
    P. Shivakumara
    G. Hemantha Kumar
    D. S. Guru
    P. Nagabhushan
    Sadhana, 2005, 30 : 69 - 85
  • [24] A novel technique for estimation of skew in binary text document images based on linear regression analysis
    Shivakumara, P
    Kumar, GH
    Guru, DS
    Nagabhushan, P
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2005, 30 (1): : 69 - 85
  • [25] Skew detection of document images using mathematical morphology and Hough transform
    Wu, Xin
    Zhang, Zhi-Wei
    Nanjing Li Gong Daxue Xuebao/Journal of Nanjing University of Science and Technology, 2009, 33 (02): : 178 - 182
  • [26] AUTOMATED PAGE ORIENTATION AND SKEW ANGLE DETECTION FOR BINARY DOCUMENT IMAGES
    LE, DS
    THOMA, GR
    WECHSLER, H
    PATTERN RECOGNITION, 1994, 27 (10) : 1325 - 1344
  • [27] Scanned Document Images Skew Correction Based on Shearlet Transform
    Zhang, Fan
    Zhang, Yifan
    Qu, Xingxing
    Liu, Bin
    Zhang, Ruoya
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, MIWAI 2015, 2015, 9426 : 226 - 232
  • [28] An efficient skew estimation technique for binary document images based on boundary growing and linear regression analysis
    Shivakumara, P
    Kumar, GH
    Guru, DS
    Nagabhushan, P
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 659 - 665
  • [29] Document Skew Detection Based on Hough Space Derivatives
    Stahlberg, Felix
    Vogel, Stephan
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 366 - 370
  • [30] Algorithm of Document Skew Detection Based on Character Vertices
    Ju, Zhiyong
    Gu, Guoqing
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 2, PROCEEDINGS, 2009, : 23 - +