A Thresholding Approach for Text Extraction in Handwritten Historical Documents using Adaptive Morphology

被引:1
|
作者
Roy, Bishakha [1 ]
Chatterjee, Rohit Kamal [1 ]
机构
[1] BIT, Dept Comp Sci & Engn, Mesra Kolkata Campus, Kolkata, India
关键词
historical handwritten document; structurally adaptive operator; segmentation; morphology; Gaussian surface; adaptive thresholding; IMAGE BINARIZATION;
D O I
10.1109/EAIT.2014.65
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The aim of preserving historical handwritten documents is to restore the degraded text containing information. But generally global threshold fails to restore the text adequately. Adaptive (local) thresholding is required for preserving the text in these documents. In recent past many standard adaptive thresholding methods have been proposed for binarization of handwritten text document images. We propose a new adaptive thresholding method using locally adaptive mathematical morphology. Formulation of an adaptive structural element is a challenging work and addressed recently by some researchers. Our method at initial step binarizes the image applying global threshold. The residual background image below threshold containing low intensity texts mixed with noise is further processed. A new approach for constructing spatially variant operator corresponding to local variances is proposed. Gaussian surface is selected as an adaptive gray-scale structuring element for mathematical morphological operations (opening and closing), whose parameters base and height depends on local variance. The proposed method successfully denoises various kinds of degraded documents enhancing textures with clear background. Experimental result on real historical handwritten document and artificial images show that our method outperforms several other existing methods both visually and using some evaluation metrics.
引用
收藏
页码:198 / 203
页数:6
相关论文
共 50 条
  • [1] Text Line Extraction in Handwritten Historical Documents
    Capobianco, Samuele
    Marinai, Simone
    [J]. DIGITAL LIBRARIES AND ARCHIVES, IRCDL 2017, 2017, 733 : 68 - 79
  • [2] A Local Thresholding Algorithm for Images of Handwritten Historical Documents
    Neves, Renata F. P.
    Mello, Carlos A. B.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2934 - 2939
  • [3] Segmentation of Historical Handwritten Documents into Text Zones and Text Lines
    Gatos, Basilis
    Louloudis, Georgios
    Stamatopoulos, Nikolaos
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 464 - 469
  • [4] A set of benchmarks for Handwritten Text Recognition on historical documents
    Andreu Sanchez, Joan
    Romero, Veronica
    Toselli, Alejandro H.
    Villegas, Mauricio
    Vidal, Enrique
    [J]. PATTERN RECOGNITION, 2019, 94 : 122 - 134
  • [5] Text Line Segmentation in Images of Handwritten Historical Documents
    Sanchez, A.
    Suarez, P. D.
    Melloz, C. A. B.
    Oliveira, A. L. I.
    Alves, V. M. O.
    [J]. 2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 232 - +
  • [6] Preserving Text Content from Historical Handwritten Documents
    Chakraborty, Arpita
    Blumenstein, Michael
    [J]. PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 329 - 334
  • [7] A general approach for multi-oriented text line extraction of handwritten documents
    Nazih Ouwayed
    Abdel Belaïd
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 297 - 314
  • [8] Lanna Handwritten Character Recognition on Historical Documents Using Feature Extraction
    Khankasikam, Krisda
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2553 - 2560
  • [9] A general approach for multi-oriented text line extraction of handwritten documents
    Ouwayed, Nazih
    Belaid, Abdel
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (04) : 297 - 314
  • [10] Text line segmentation and binarization of handwritten historical documents using the fast and adaptive bidimensional empirical mode decomposition
    Dyla, M. H. Mohamed
    Morain-Nicolier, F.
    [J]. OPTIK, 2019, 188 : 52 - 63