A novel boundary growing approach for accurate skew estimation of binary document images

被引:14
|
作者
Shivakumara, P
Kumar, GH
机构
[1] Natl Univ Singapore, Dept Comp Sci, Sch Comp, Singapore 117543, Singapore
[2] Univ Mysore, Dept Studies Comp Sci, Mysore 570006, Karnataka, India
关键词
document analysis; optical character recognition; boundary growing; linear regression analysis; skew detection;
D O I
10.1016/j.patrec.2005.11.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skew angle estimation is an important component of optical character recognition (OCR) systems and document analysis systems (DAS). In this paper, a novel and an efficient method to estimate the skew angle of a scanned document image is proposed. The proposed method has two stages. In first stage, using boundary-growing approach, text lines containing characters of the scanned document image are extracted. From each text line, coordinates of the positions of the characters are obtained. In second stage, the obtained coordinates are fed to linear regression analysis (LRA) for the purpose of computation of skew angle. Several experiments have been conducted on various types of documents such as documents containing different language texts, documents with different fonts and documents with noise to reveal the robustness of the proposed method. A comparative study with the well-known methods is presented to show that the proposed method is superior in terms of accuracy and computational efficiency. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:791 / 801
页数:11
相关论文
共 50 条
  • [1] A new boundary growing and Hough transform based approach for accurate skew detection in binary document images
    Shivakumara, P
    Kumar, GH
    Guru, DS
    Nagabhushan, P
    [J]. 2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 140 - 146
  • [2] An efficient skew estimation technique for binary document images based on boundary growing and linear regression analysis
    Shivakumara, P
    Kumar, GH
    Guru, DS
    Nagabhushan, P
    [J]. NEURAL INFORMATION PROCESSING, 2004, 3316 : 659 - 665
  • [3] A novel approach for Skew estimation of document images in OCR system
    Sarfraz, M
    Zidouri, A
    Shahab, SA
    [J]. COMPUTER GRAPHICS, IMAGING AND VISION: NEW TRENDS, 2005, : 175 - 180
  • [4] A novel technique for estimation of skew in binary text document images based on linear regression analysis
    P. Shivakumara
    G. Hemantha Kumar
    D. S. Guru
    P. Nagabhushan
    [J]. Sadhana, 2005, 30 : 69 - 85
  • [5] A novel technique for estimation of skew in binary text document images based on linear regression analysis
    Shivakumara, P
    Kumar, GH
    Guru, DS
    Nagabhushan, P
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2005, 30 (1): : 69 - 85
  • [6] Skew Estimation Technique for Binary Document Images based on Thinning and Moments
    Aradhya, Manjunath V. N.
    Kumar, Hemantha G.
    Shivakumara, P.
    [J]. ENGINEERING LETTERS, 2007, 14 (01)
  • [7] Improved nearest neighbor based approach to accurate document skew estimation
    Lu, Y
    Tan, CL
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 503 - 507
  • [8] Accurate method of document skew estimation by PCA
    Okun, OG
    [J]. SCIA '97 - PROCEEDINGS OF THE 10TH SCANDINAVIAN CONFERENCE ON IMAGE ANALYSIS, VOLS 1 AND 2, 1997, : 751 - 758
  • [9] Skew Estimation of Document Images Using Bagging
    Meng, Gaofeng
    Pan, Chunhong
    Zheng, Nanning
    Sun, Chen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (07) : 1837 - 1846
  • [10] A nearest-neighbor chain based approach to skew estimation in document images
    Lu, Y
    Tan, CL
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (14) : 2315 - 2323