A Thresholding Approach for Text Extraction in Handwritten Historical Documents using Adaptive Morphology

被引:1
|
作者
Roy, Bishakha [1 ]
Chatterjee, Rohit Kamal [1 ]
机构
[1] BIT, Dept Comp Sci & Engn, Mesra Kolkata Campus, Kolkata, India
关键词
historical handwritten document; structurally adaptive operator; segmentation; morphology; Gaussian surface; adaptive thresholding; IMAGE BINARIZATION;
D O I
10.1109/EAIT.2014.65
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The aim of preserving historical handwritten documents is to restore the degraded text containing information. But generally global threshold fails to restore the text adequately. Adaptive (local) thresholding is required for preserving the text in these documents. In recent past many standard adaptive thresholding methods have been proposed for binarization of handwritten text document images. We propose a new adaptive thresholding method using locally adaptive mathematical morphology. Formulation of an adaptive structural element is a challenging work and addressed recently by some researchers. Our method at initial step binarizes the image applying global threshold. The residual background image below threshold containing low intensity texts mixed with noise is further processed. A new approach for constructing spatially variant operator corresponding to local variances is proposed. Gaussian surface is selected as an adaptive gray-scale structuring element for mathematical morphological operations (opening and closing), whose parameters base and height depends on local variance. The proposed method successfully denoises various kinds of degraded documents enhancing textures with clear background. Experimental result on real historical handwritten document and artificial images show that our method outperforms several other existing methods both visually and using some evaluation metrics.
引用
收藏
页码:198 / 203
页数:6
相关论文
共 50 条
  • [21] Separating lines of text in free-form handwritten historical documents
    Kennard, Douglas J.
    Barrett, William A.
    SECOND INTERNATIONAL CONFERENCE ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2006, : 12 - +
  • [22] Learning-Free Text Line Segmentation for Historical Handwritten Documents
    Barakat, Berat Kurar
    Cohen, Rafi
    Droby, Ahmad
    Rabaev, Irina
    El-Sana, Jihad
    APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 19
  • [23] Text line extraction from multi-skewed handwritten documents
    Basu, S.
    Chaudhuri, C.
    Kundu, M.
    Nasipuri, M.
    Basu, D. K.
    PATTERN RECOGNITION, 2007, 40 (06) : 1825 - 1839
  • [24] A Simple and Novel Adaptive Binarization Approach for Handwritten Documents
    Panwar, Subhash
    Nain, Neeta
    2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING (ISSP), 2013, : 106 - 110
  • [25] Text Line Extraction from Multi-skewed Handwritten Documents
    Jiang Yong
    Chen Xiaojing
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 4, 2008, : 412 - +
  • [26] Text Line Extraction in Historical Documents Using Mask R-CNN
    Droby, Ahmad
    Barakat, Berat Kurar
    Alaasam, Reem
    Madi, Boraq
    Rabaev, Irina
    El-Sana, Jihad
    SIGNALS, 2022, 3 (03): : 535 - 549
  • [27] Boosting Offline Handwritten Text Recognition in Historical Documents With Few Labeled Lines
    Aradillas, Jose Carlos
    Murillo-Fuentes, Juan Jose
    Olmos, Pablo M.
    IEEE ACCESS, 2021, 9 : 76674 - 76688
  • [28] An Interactive Approach with Off-line and On-line Handwritten Text Recognition Combination for Transcribing Historical Documents
    Granell, Emilio
    Romero, Veronica
    Martinez-Hinarejos, Carlos D.
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 269 - 274
  • [29] Multi-Oriented Text Line Extraction from Handwritten Arabic Documents
    Ouwayed, Nazih
    Belaid, Abdel
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 339 - 346
  • [30] Language-Independent Text-Line Extraction Algorithm for Handwritten Documents
    Ryu, Jewoong
    Koo, Hyung Il
    Cho, Nam Ik
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1115 - 1119