Efficient skew detection of printed document images based on novel combination of enhanced profiles

被引:11
|
作者
Papandreou, A. [1 ,2 ]
Gatos, B. [2 ]
Perantonis, S. J. [2 ]
Gerardis, I. [2 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, Athens 15784, Greece
[2] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Athens 15310, Greece
关键词
Document skew correction; Projection profiles; Document image preprocessing;
D O I
10.1007/s10032-014-0228-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document skew is often introduced during the capturing process of the document image processing pipeline and may seriously affect the performance of subsequent stages of segmentation and recognition. Skew detection is often accomplished with the use of horizontal projections, while recently, a new approach that is based on vertical projections has been introduced. In this paper, we use the technique of minimum bounding box area in order to combine a horizontal with a new reinforced vertical projection profile method. We are motivated by the fact that the horizontal and the novel vertical projection profiles are found to be complementary to each other. We claim that the proposed approach has more accurate performance compared with other state-of-the-art skew detection algorithms; it deals with all the drawbacks of the projection profile methods; it is more noise and warp resistant and gives accurate results for any kind of printed document image. For these reasons, it can be efficiently applied to historical machine printed or multicolumn documents, documents with figures and tables, while it is robust for any kind of script. Extended experimental results on two databases in different skew angle range, with representative printed documents of all kinds, as well as printed documents of two historical books, prove the efficiency of the proposed approach. There is also a comparison with commercial products in several cases where the contribution of the proposed algorithm is demonstrated at optical character recognition level. Moreover, an analysis of the accuracy performance of the main elements of the proposed technique is also performed.
引用
收藏
页码:433 / 454
页数:22
相关论文
共 50 条
  • [41] An Efficient Document Skew Detection Method Using Probability Model and Q Test
    Huang, Kai
    Chen, Zixuan
    Yu, Min
    Yan, Xiaolang
    Yin, Aiguo
    ELECTRONICS, 2020, 9 (01)
  • [42] Computer Assisted Printed Character recognition in document based images
    Aghav, Sushila
    Paygude, S. S.
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3222 - 3227
  • [43] Profile based information retrieval from printed document images
    Abirami, S.
    Manjula, D.
    COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES, 2007, : 268 - +
  • [44] Document images retrieval based on multiple features combination
    Meng, Gaofeng
    Zheng, Nanning
    Song, Yonghong
    Zhang, Yuanlin
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 143 - 147
  • [45] A NOVEL FORM DETECTION AND REMOVAL SCHEME FOR DOCUMENT IMAGES
    Kuo, Tien-Ying
    Lo, Yi-Chung
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2141 - 2144
  • [46] A nearest-neighbor chain based approach to skew estimation in document images
    Lu, Y
    Tan, CL
    PATTERN RECOGNITION LETTERS, 2003, 24 (14) : 2315 - 2323
  • [47] Matching word images for content-based retrieval from printed document images
    Million Meshesha
    C. V. Jawahar
    International Journal of Document Analysis and Recognition (IJDAR), 2008, 11 : 29 - 38
  • [48] Matching word images for content-based retrieval from printed document images
    Meshesha, Million
    Jawahar, C. V.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2008, 11 (01) : 29 - 38
  • [49] Document Image Skew Detection and Correction Method Based on Extreme Points
    Wagdy, Marian
    Faye, Ibrahima
    DayangRohaya
    2014 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2014,
  • [50] MLR-NET: An Arbitrary Skew Angle Detection Algorithm for Complex Layout Document Images
    Wang, Peisen
    Wang, Bo
    Nie, Xixi
    Gu, Chunyi
    Li, Kaijiang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 246 - 260