Histogram Peak Ratio-Based Binarization for Historical Document Image

被引:0
|
作者
Mahastama, Aditya W. [1 ]
Krisnawati, Lucia D. [1 ]
机构
[1] Duta Wacana Christian Univ, Informat Technol Dept, Yogyakarta, Indonesia
来源
PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART CITIES, AUTOMATION & INTELLIGENT COMPUTING SYSTEMS (ICON-SONICS 2017) | 2017年
关键词
binarization; historical documents; image processing; histogram; background-foreground segmentation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of large scale digitization projects transforming printed heritage into digitally available resources in Europe and the United States has led to the Digital Renaissance era. The aim of these projects is to preserve the printed cultural heritage and to integrate their intellectual content into the modern information. To achieve this goal, the digitizing process, i.e. transforming a scanned book into an electronic text, becomes necessary. The first step of digitizing process is the preprocessing which involves the segmentation of the foreground, i.e. the text, from the rest of the document. With the goal of digitizing the manuscripts written in Javanese characters, this study proposes a novel approach of foreground segmentation which is intended to serve dual functions, namely to acquire the text characters and also to improve the quality of the document images from their degradation caused by nature or the age. Our method is based on the computation of histogram peak ratio to determine the threshold value of segmentation. Being experimented on Javanese manuscripts in good and degraded conditions, the performance of our method proves to be excellent as its segmentation success rate achieves 100% for manuscripts in good condition. Its performance in segmenting degraded manuscripts caused by holes, sellotape, and bleed-trough effect could be claimed more than satisfying as its success rate achieves 80%.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 50 条
  • [21] DP-LinkNet: A convolutional network for historical document image binarization
    Xiong, Wei
    Jia, Xiuhong
    Yang, Dichun
    Ai, Meihui
    Li, Lirong
    Wang, Song
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (05) : 1778 - 1797
  • [22] Historical document image binarization via style augmentation and atrous convolutions
    Hanif Rasyidi
    Salman Khan
    Neural Computing and Applications, 2021, 33 : 7339 - 7352
  • [23] Document Image Binarization Process
    Prodan, Marcel
    Boiangiu, Costin-Anton
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2023, 14 (02): : 93 - 114
  • [24] Histogram-based global thresholding method for image binarization
    Elen A.
    Dönmez E.
    Optik, 2024, 306
  • [25] Adaptive document image binarization
    Sauvola, J
    Pietikäinen, M
    PATTERN RECOGNITION, 2000, 33 (02) : 225 - 236
  • [26] LBP-Based Degraded Document Image Binarization
    Sehad, Abdenour
    Chibani, Youcef
    Hedjam, Rachid
    Cheriet, Mohamed
    5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, THEORY, TOOLS AND APPLICATIONS 2015, 2015, : 213 - 217
  • [27] Adaptive binarization of historical document images
    Kavallieratou, Ergina
    Stathis, Stamatatos
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 742 - +
  • [28] Binarization of degraded document image based on contrast enhancement
    Lu Di
    Huang Xin
    Liu Changyuan
    Lin Xue
    Zhang Huayu
    Yan Jun
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4894 - 4899
  • [29] The Elastic Ratio: Introducing Curvature Into Ratio-Based Image Segmentation
    Schoenemann, Thomas
    Masnou, Simon
    Cremers, Daniel
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (09) : 2565 - 2581
  • [30] Binarization of Color Historical Document Images Using Local Image Equalization and XDoG
    Roe, Edward
    Mello, Carlos A. B.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 205 - 209