Histogram Peak Ratio-Based Binarization for Historical Document Image

被引:0
|
作者
Mahastama, Aditya W. [1 ]
Krisnawati, Lucia D. [1 ]
机构
[1] Duta Wacana Christian Univ, Informat Technol Dept, Yogyakarta, Indonesia
关键词
binarization; historical documents; image processing; histogram; background-foreground segmentation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of large scale digitization projects transforming printed heritage into digitally available resources in Europe and the United States has led to the Digital Renaissance era. The aim of these projects is to preserve the printed cultural heritage and to integrate their intellectual content into the modern information. To achieve this goal, the digitizing process, i.e. transforming a scanned book into an electronic text, becomes necessary. The first step of digitizing process is the preprocessing which involves the segmentation of the foreground, i.e. the text, from the rest of the document. With the goal of digitizing the manuscripts written in Javanese characters, this study proposes a novel approach of foreground segmentation which is intended to serve dual functions, namely to acquire the text characters and also to improve the quality of the document images from their degradation caused by nature or the age. Our method is based on the computation of histogram peak ratio to determine the threshold value of segmentation. Being experimented on Javanese manuscripts in good and degraded conditions, the performance of our method proves to be excellent as its segmentation success rate achieves 100% for manuscripts in good condition. Its performance in segmenting degraded manuscripts caused by holes, sellotape, and bleed-trough effect could be claimed more than satisfying as its success rate achieves 80%.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 50 条
  • [1] Histogram peak ratio-based binarization for historical document image
    Information Technology Dept., Duta Wacana Christian University, Yogyakarta, Indonesia
    Proc. Int. Conf. Smart Cities, Autom. Intell. Comput. Syst., ICON-SONICS, 1600, (93-98):
  • [2] Historical document image binarization
    Mello, Carlos A. B.
    Oliveira, Adriano L. I.
    Sanchez, Angel
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 108 - 113
  • [3] Historical Document Image Binarization: A Review
    Tensmeyer C.
    Martinez T.
    SN Computer Science, 2020, 1 (3)
  • [4] Bin Ratio-Based Histogram Distances and Their Application to Image Classification
    Hu, Weiming
    Xie, Nianhua
    Hu, Ruiguang
    Ling, Haibin
    Chen, Qiang
    Yan, Shuicheng
    Maybank, Stephen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) : 2338 - 2352
  • [5] Historical Document Image Binarization Based on Edge Contrast Information
    Li, Zhenjiang
    Wang, Weilan
    Cai, Zhengqi
    ADVANCES IN COMPUTER VISION, CVC, VOL 1, 2020, 943 : 614 - 628
  • [6] Restoration Based Contourlet Transform for Historical Document Image Binarization
    Zemouri, ET-Tahir
    Chibani, Youcef
    Brik, Youcef
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 309 - 313
  • [7] Performance Evaluation Methodology for Historical Document Image Binarization
    Ntirogiannis, Konstantinos
    Gatos, Basilis
    Pratikakis, Ioannis
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (02) : 595 - 609
  • [8] Document Image Binarization Based on NFCM
    Tong Li-Jing
    Chen Kan
    Zhang Yan
    Fu Xiao-Ling
    Duan Jian-Yong
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 1769 - 1773
  • [9] Histogram equalization smoothing for determining threshold accuracy on ancient document image binarization
    Dwipayana, Mahendar
    Arnia, Fitri
    Musliyana, Zuhar
    1ST INTERNATIONAL CONFERENCE ON GREEN AND SUSTAINABLE COMPUTING (ICOGES) 2017, 2018, 1019
  • [10] iDocChip - A Configurable Hardware Architecture for Historical Document Image Processing: Percentile Based Binarization
    Rybalkin, Vladimir
    Bukhari, Syed Saqib
    Ghaffar, Muhammad Mohsin
    Ghafoor, Aqib
    Wehn, Norbert
    Dengel, Andreas
    PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,