Histogram Peak Ratio-Based Binarization for Historical Document Image

被引:0
|
作者
Mahastama, Aditya W. [1 ]
Krisnawati, Lucia D. [1 ]
机构
[1] Duta Wacana Christian Univ, Informat Technol Dept, Yogyakarta, Indonesia
来源
PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART CITIES, AUTOMATION & INTELLIGENT COMPUTING SYSTEMS (ICON-SONICS 2017) | 2017年
关键词
binarization; historical documents; image processing; histogram; background-foreground segmentation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of large scale digitization projects transforming printed heritage into digitally available resources in Europe and the United States has led to the Digital Renaissance era. The aim of these projects is to preserve the printed cultural heritage and to integrate their intellectual content into the modern information. To achieve this goal, the digitizing process, i.e. transforming a scanned book into an electronic text, becomes necessary. The first step of digitizing process is the preprocessing which involves the segmentation of the foreground, i.e. the text, from the rest of the document. With the goal of digitizing the manuscripts written in Javanese characters, this study proposes a novel approach of foreground segmentation which is intended to serve dual functions, namely to acquire the text characters and also to improve the quality of the document images from their degradation caused by nature or the age. Our method is based on the computation of histogram peak ratio to determine the threshold value of segmentation. Being experimented on Javanese manuscripts in good and degraded conditions, the performance of our method proves to be excellent as its segmentation success rate achieves 100% for manuscripts in good condition. Its performance in segmenting degraded manuscripts caused by holes, sellotape, and bleed-trough effect could be claimed more than satisfying as its success rate achieves 80%.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 50 条
  • [31] MSIO: MultiSpectral Document Image BinarizatIOn
    Diem, Markus
    Hollaus, Fabian
    Sablatnig, Robert
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 84 - 89
  • [32] A MULTISCALE OPERATOR FOR DOCUMENT IMAGE BINARIZATION
    Dorini, Leyza Baldo
    Leite, Neucimar Jeronimo
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 34 - 39
  • [33] RATIO-BASED SIMILARITY CRITERIA FOR POLARIMETRIC SAR IMAGE
    Aghababaei, H.
    Ferraioli, G.
    Pascazio, V
    2020 IEEE LATIN AMERICAN GRSS & ISPRS REMOTE SENSING CONFERENCE (LAGIRS), 2020, : 318 - 321
  • [34] A Hybrid Approach for Document Image Binarization
    Sakila, A.
    Vijayarani, S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 645 - 650
  • [35] Adaptive degraded document image binarization
    Gatos, B
    Pratikakis, I
    Perantonis, SJ
    PATTERN RECOGNITION, 2006, 39 (03) : 317 - 327
  • [36] Augment Document Image Binarization by Learning
    Zhu, Yuanping
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1905 - 1908
  • [37] Combination of Document Image Binarization Techniques
    Su, Bolan
    Lu, Shijian
    Tan, Chew Lim
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 22 - 26
  • [38] Discrete CRF based combination framework for document image binarization
    Hebert, David
    Nicolas, Stephane
    Paquet, Thierry
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1165 - 1169
  • [39] A Survey on Document Image Binarization Techniques
    Lokhande, Supriya Sunil
    Dawande, N. A.
    1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 742 - 746
  • [40] Improved binarization algorithm for document image
    Chen, Dan
    Zhang, Feng
    He, Guiming
    Jisuanji Gongcheng/Computer Engineering, 2003, 29 (13):