A Thresholding Approach for Text Extraction in Handwritten Historical Documents using Adaptive Morphology

被引:1
|
作者
Roy, Bishakha [1 ]
Chatterjee, Rohit Kamal [1 ]
机构
[1] BIT, Dept Comp Sci & Engn, Mesra Kolkata Campus, Kolkata, India
关键词
historical handwritten document; structurally adaptive operator; segmentation; morphology; Gaussian surface; adaptive thresholding; IMAGE BINARIZATION;
D O I
10.1109/EAIT.2014.65
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The aim of preserving historical handwritten documents is to restore the degraded text containing information. But generally global threshold fails to restore the text adequately. Adaptive (local) thresholding is required for preserving the text in these documents. In recent past many standard adaptive thresholding methods have been proposed for binarization of handwritten text document images. We propose a new adaptive thresholding method using locally adaptive mathematical morphology. Formulation of an adaptive structural element is a challenging work and addressed recently by some researchers. Our method at initial step binarizes the image applying global threshold. The residual background image below threshold containing low intensity texts mixed with noise is further processed. A new approach for constructing spatially variant operator corresponding to local variances is proposed. Gaussian surface is selected as an adaptive gray-scale structuring element for mathematical morphological operations (opening and closing), whose parameters base and height depends on local variance. The proposed method successfully denoises various kinds of degraded documents enhancing textures with clear background. Experimental result on real historical handwritten document and artificial images show that our method outperforms several other existing methods both visually and using some evaluation metrics.
引用
收藏
页码:198 / 203
页数:6
相关论文
共 50 条
  • [41] Text Line Segmentation for Handwritten Documents Using Constrained Seam Carving
    Zhang, Xi
    Tan, Chew Lim
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 98 - 103
  • [42] A segmentation based adaptive approach for cursive handwritten text recognition
    Verma, Brijesh
    Lee, Hong
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2212 - 2216
  • [43] Text Extraction from Historical Document Images by the Combination of Several Thresholding Techniques
    Sari, Toufik
    Kefali, Abderrahmane
    Bahi, Halima
    ADVANCES IN MULTIMEDIA, 2014, 2014 (2014)
  • [44] Segmentation of Arabic Handwritten Documents into Text Lines using Watershed Transform
    Souhar, A.
    Boulid, Y.
    Ameur, ElB.
    Ouagague, Mly. M.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2017, 4 (06): : 96 - 102
  • [45] Text Line Segmentation of Multilingual Handwritten Documents Using Fourier Approximation
    Chavan, Vishal
    Mehrotra, Kapil
    2017 FOURTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2017, : 250 - 255
  • [46] Separator and Content based Approach for Table Extraction in Handwritten Chemistry Documents
    Ghanmi, Nabil
    Belaid, Abdel
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 296 - 300
  • [47] Localization of handwritten text in documents using moment invariants and Delaunay triangulation
    Ramakrishnan, Kandan
    Arvind, K. R.
    Ramakrishnan, A. G.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL III, PROCEEDINGS, 2007, : 408 - 414
  • [48] Handwritten Arabic Documents Segmentation into Text Lines using Seam Carving
    Daldali, M.
    Souhar, A.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2019, 5 (05): : 89 - 96
  • [49] Entropy-Based Approach for Enabling Text Line Segmentation in Handwritten Documents
    Sindhushree, G. S.
    Amarnath, R.
    Nagabhushan, P.
    DATA ANALYTICS AND LEARNING, 2019, 43 : 169 - 184
  • [50] Information Extraction and Text Mining of Ancient Vattezhuthu Characters in Historical Documents Using Image Zoning
    Vellingiriraj, E. K.
    Balamurugan, M.
    Balasubramanie, P.
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 37 - 40