Efficient skew detection and correction in scanned document images through clustering of probabilistic hough transforms

被引:17
|
作者
Ahmad, Riaz [1 ]
Naz, Saeeda [2 ]
Razzak, Imran [3 ]
机构
[1] Shaheed Benazir Bhutto Univ, Sheringal, Pakistan
[2] Govt Girls Postgrad Coll, Comp Sci Dept, Abbottabad, Pakistan
[3] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
Skew; Scanned document; Clustering; Probabilistic hough transform; ANGLE ESTIMATION;
D O I
10.1016/j.patrec.2021.09.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Documents scanning is still one of the widely used documents digitization steps; however, skew in scanned documents is inevitable. If this skew is not corrected, the extraction of region/s of interest (RoI) and further processing like; detection and classification on such RoI becomes difficult. It has been shown that skew detection and correction significantly improve Optical Character Recognition (OCR) systems' accuracy. This paper introduces a novel, robust and straightforward skew detection method for scanned documents, which uses Probabilistic Hough Transformation (PHT) for line detection in a first step and clusters the lines in a second step based on parallelism. The cluster with maximum parallel lines represents the expected skewed lines. The proposed method is tested on real scanned images taken from the Document Image Skew Estimation Contest (DISEC'13), Pashto, and Tobacco800 datasets. The proposed method performs well both in terms of accuracy and efficiency. It is efficient and robust to noise. Furthermore, we show that it also works on Arabic and Latin scripts. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 99
页数:7
相关论文
共 50 条
  • [1] Scanned Document Images Skew Correction Based on Shearlet Transform
    Zhang, Fan
    Zhang, Yifan
    Qu, Xingxing
    Liu, Bin
    Zhang, Ruoya
    [J]. MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, MIWAI 2015, 2015, 9426 : 226 - 232
  • [2] Skew detection of document images using mathematical morphology and Hough transform
    Wu, Xin
    Zhang, Zhi-Wei
    [J]. Nanjing Li Gong Daxue Xuebao/Journal of Nanjing University of Science and Technology, 2009, 33 (02): : 178 - 182
  • [3] Efficient skew estimation and correction algorithm for document images
    Kwag, HK
    Kim, SH
    Jeong, SH
    Lee, GS
    [J]. IMAGE AND VISION COMPUTING, 2002, 20 (01) : 25 - 35
  • [4] An Improved Skew Angle Detection and Correction Technique for Historical Scanned Documents Using Morphological Skeleton and Progressive Probabilistic Hough Transform
    Boudraa, Omar
    Hidouci, Walid Khaled
    Michelucci, Dominique
    [J]. 2017 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING - BOUMERDES (ICEE-B), 2017,
  • [5] Automatic particle detection through efficient Hough transforms
    Zhu, YX
    Carragher, B
    Mouche, F
    Potter, CS
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2003, 22 (09) : 1053 - 1062
  • [6] e-PCP: A robust skew detection method for scanned document images
    Dey, Prasenjit
    Noushath, S.
    [J]. PATTERN RECOGNITION, 2010, 43 (03) : 937 - 948
  • [7] Skew Detection and Correction Method of Fabric Images Based on Hough Transform
    Zhang Ruilin
    Hu Yan
    Fang Zhijian
    Zhang Lei
    [J]. ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL II, PROCEEDINGS, 2009, : 340 - 343
  • [8] Fast detection of generic biological particles in CRYO-EM images through efficient hough transforms
    Zhu, YX
    Carragher, B
    Potter, CS
    [J]. 2002 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, PROCEEDINGS, 2002, : 205 - 208
  • [9] Document Skew Detection Based on Hough Space Derivatives
    Stahlberg, Felix
    Vogel, Stephan
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 366 - 370
  • [10] A document skew detection method using the Hough Transform
    Amin, A
    Fischer, S
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2000, 3 (03) : 243 - 253