Kannada Text Line Extraction Based on Energy Minimization and Skew Correction

被引:0
|
作者
Dixit, Sunanda [1 ]
Narayan, Suresh Hosahalli [1 ]
Belur, Mahesh [1 ]
机构
[1] Dayananda Sagar Coll Engn, Informat Sci & Engn Dept, Bangalore, Karnataka, India
关键词
Document analysis; skew angle; skew detection and correction; cost function; energy minimization; baseline skew and fluctuations; skating window approach;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There are many governmental, cultural, commercial and educational organizations that manage large number of manuscript textual information. Kannada being one of the official languages of South India, such organizations include Kannada handwritten documents. Text line segmentation in such documents remains an open document analysis problem. Detection and correction of skew angle of the segmented text lines become another important step in document analysis. Most of the segmentation algorithms, for skewed text lines, present in the literature today are sensitive to the degree of skew, direction of skew, and spacing between adjacent lines. In this paper, proposed method for the text line extraction and skew correction of the extracted text lines uses a new cost function, which considers the spacing between text lines and the skew of each text line is used. Precisely, the problem is formulated as an energy minimization problem so that the minimization of the cost function yields a set of text lines. Further it is required to efficiently correct baseline skew and fluctuations of these text lines. This proposed method also uses an efficient algorithm for baseline correction. It consists of normalizing the lower baseline to a horizontal line using a skating window approaches, thus, avoiding the segmentation of text lines into subparts. This approach copes with baselines which arc skewed, fluctuating, or both. It differs from machine learning approaches which need manual pixel assignments to baselines. Experimental results show that this baseline correction approach highly improves performance.
引用
收藏
页码:62 / 67
页数:6
相关论文
共 50 条
  • [1] Skew Correction and Text Line Extraction of Arabic Historical Documents
    Zoizon, Abdelhay
    Zarghili, Ars Alane
    Chaker, Ilham
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 181 - 193
  • [2] SKEW CORRECTION AND LINE EXTRACTION IN BINARIZED PRINTED TEXT IMAGES
    Li, Wei
    Breier, Matthias
    Merhof, Dorit
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 472 - 476
  • [3] Text-Line Extraction in Handwritten Chinese Documents Based on an Energy Minimization Framework
    Koo, Hyung Il
    Cho, Nam Ik
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (03) : 1169 - 1175
  • [4] A Skew Detection and Correction Technique for Arabic Script Text-line based on Subwords Bounding
    AL-Shatnawi, Atallah M.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 1297 - 1301
  • [5] On Skew estimation and correction of text
    Sarfraz, M.
    Mahmoud, S. A.
    Rasheed, Z.
    [J]. COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES, 2007, : 308 - +
  • [6] Multi-level Skew Correction Approach for Hand Written Kannada Documents
    Vinod, H. C.
    Niranjan, S. K.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY & SYSTEMS (ICITS 2018), 2018, 721 : 376 - 386
  • [7] Morphology-based text line extraction
    Wu, Jui-Chen
    Hsieh, Jun-Wei
    Chen, Yung-Sheng
    [J]. MACHINE VISION AND APPLICATIONS, 2008, 19 (03) : 195 - 207
  • [8] Morphology-based text line extraction
    Jui-Chen Wu
    Jun-Wei Hsieh
    Yung-Sheng Chen
    [J]. Machine Vision and Applications, 2008, 19 : 195 - 207
  • [9] A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images
    Angadi, S. A.
    Kodabagi, M. M.
    [J]. 2014 FIFTH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2014), 2014, : 42 - 49
  • [10] A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images
    Angadi, S. A.
    Kodabagi, M. M.
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2014, 14 (1-2)