A Thresholding Approach for Text Extraction in Handwritten Historical Documents using Adaptive Morphology

被引:1
|
作者
Roy, Bishakha [1 ]
Chatterjee, Rohit Kamal [1 ]
机构
[1] BIT, Dept Comp Sci & Engn, Mesra Kolkata Campus, Kolkata, India
关键词
historical handwritten document; structurally adaptive operator; segmentation; morphology; Gaussian surface; adaptive thresholding; IMAGE BINARIZATION;
D O I
10.1109/EAIT.2014.65
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The aim of preserving historical handwritten documents is to restore the degraded text containing information. But generally global threshold fails to restore the text adequately. Adaptive (local) thresholding is required for preserving the text in these documents. In recent past many standard adaptive thresholding methods have been proposed for binarization of handwritten text document images. We propose a new adaptive thresholding method using locally adaptive mathematical morphology. Formulation of an adaptive structural element is a challenging work and addressed recently by some researchers. Our method at initial step binarizes the image applying global threshold. The residual background image below threshold containing low intensity texts mixed with noise is further processed. A new approach for constructing spatially variant operator corresponding to local variances is proposed. Gaussian surface is selected as an adaptive gray-scale structuring element for mathematical morphological operations (opening and closing), whose parameters base and height depends on local variance. The proposed method successfully denoises various kinds of degraded documents enhancing textures with clear background. Experimental result on real historical handwritten document and artificial images show that our method outperforms several other existing methods both visually and using some evaluation metrics.
引用
收藏
页码:198 / 203
页数:6
相关论文
共 50 条
  • [31] A robust approach to text line grouping in online handwritten Japanese documents
    Zhou, Xiang-Dong
    Wang, Da-Han
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2009, 42 (09) : 2077 - 2088
  • [32] An Approach of Strike-through Text Identification from Handwritten Documents
    Adak, Chandranath
    Chaudhuri, Bidyut B.
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 643 - 648
  • [33] Using Scale-Space Anisotropic Smoothing for Text Line Extraction in Historical Documents
    Cohen, Rafi
    Dinstein, Itshak
    El-Sana, Jihad
    Kedem, Klara
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 349 - 358
  • [34] Image thresholding of historical documents using entropy and ROC curves
    Mello, CAB
    Costa, AHM
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 905 - 916
  • [35] Skew Correction and Text Line Extraction of Arabic Historical Documents
    Zoizon, Abdelhay
    Zarghili, Ars Alane
    Chaker, Ilham
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 181 - 193
  • [36] Performance Analysis of Handwritten Text Augmentation on Style-Based Dating of Historical Documents
    Koopmans L.
    Dhali M.A.
    Schomaker L.
    SN Computer Science, 5 (4)
  • [37] Date Field Extraction from Handwritten Documents Using HMMs
    Mandal, Ranju
    Roy, Partha Pratim
    Pal, Umapada
    Blumenstein, Michael
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 866 - 870
  • [38] Recognition-based Approach of Numeral Extraction in Handwritten Chemistry Documents using Contextual Knowledge
    Ghanmi, Nabil
    Belaid, Abdel
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 251 - 256
  • [39] Text line detection in unconstrained handwritten documents using a block-based hough transform approach
    Louloudis, G.
    Gatos, B.
    Halatsis, C.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 599 - +
  • [40] Text-Line Extraction in Handwritten Chinese Documents Based on an Energy Minimization Framework
    Koo, Hyung Il
    Cho, Nam Ik
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (03) : 1169 - 1175