SKEW CORRECTION AND LINE EXTRACTION IN BINARIZED PRINTED TEXT IMAGES

被引:0
|
作者
Li, Wei [1 ]
Breier, Matthias [1 ]
Merhof, Dorit [1 ]
机构
[1] Rhein Westfal TH Aachen, Inst Imaging & Comp Vis, D-52056 Aachen, Germany
关键词
skew correction; text line extraction; text analysis; printed text image; binary image processing; MEAN SHIFT; DOCUMENT; ROBUST;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Skew correction and text line extraction are essential steps for optical character recognition ( OCR) applications. For this purpose, numerous approaches were developed, which conduct the analysis primarily in document images. However, they often suffer from limited detection range and application-specific parameter tuning. Inspired by the intrinsic properties of printed text, a novel subregion-based approach is proposed in this paper, which is applicable for generic printed text images and no parameter tuning is required. Guided by the spacing between text lines, the detection of a skew angle between +/-90 degrees is feasible. As verified by the experimental results, the proposed approach is robust to diverse skew directions and significantly improves the state-of-the-art OCR performance.
引用
收藏
页码:472 / 476
页数:5
相关论文
共 50 条
  • [1] Skew Correction and Text Line Extraction of Arabic Historical Documents
    Zoizon, Abdelhay
    Zarghili, Ars Alane
    Chaker, Ilham
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 181 - 193
  • [2] Kannada Text Line Extraction Based on Energy Minimization and Skew Correction
    Dixit, Sunanda
    Narayan, Suresh Hosahalli
    Belur, Mahesh
    [J]. SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 62 - 67
  • [3] Text Line Extraction in Document Images
    Wang, Liuan
    Fan, Wei
    Sun, Jun
    Naoi, Satshi
    Tanaka, Hiroshi
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 191 - 195
  • [4] Noise Removal from Binarized Text Images
    Le, Huy Phat
    Lee, GueeSang
    [J]. 2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 3, 2010, : 586 - 589
  • [5] Text line extraction for historical document images
    Saabni, Raid
    Asi, Abedelkadir
    El-Sana, Jihad
    [J]. PATTERN RECOGNITION LETTERS, 2014, 35 : 23 - 33
  • [6] FAST TEXT LINE EXTRACTION IN DOCUMENT IMAGES
    Ha, Seong Jong
    Jin, Bora
    Cho, Nam Ik
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 797 - 800
  • [7] Skew Angle Detection and Correction in Text Images Using RGB Gradient
    Rocha, Bruno
    Vieira, Gabriel
    Pedrini, Helio
    Fonseca, Afonso
    Fernandes, Deborah
    de Lima, Junio Cesar
    Ferreira, Julio Cesar
    Soares, Fabrizzio
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 249 - 262
  • [8] On Skew estimation and correction of text
    Sarfraz, M.
    Mahmoud, S. A.
    Rasheed, Z.
    [J]. COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES, 2007, : 308 - +
  • [9] Robust text detection from binarized document images
    Okun, O
    Yan, Y
    Pietikäinen, M
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 61 - 64
  • [10] Text Line Based Correction of Distorted Document Images
    Luo, Sanding
    Fang, Xiaomin
    Zhao, Cong
    Luo, Yisha
    [J]. TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1494 - 1499