Histogram Peak Ratio-Based Binarization for Historical Document Image

被引:0
|
作者
Mahastama, Aditya W. [1 ]
Krisnawati, Lucia D. [1 ]
机构
[1] Duta Wacana Christian Univ, Informat Technol Dept, Yogyakarta, Indonesia
来源
PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART CITIES, AUTOMATION & INTELLIGENT COMPUTING SYSTEMS (ICON-SONICS 2017) | 2017年
关键词
binarization; historical documents; image processing; histogram; background-foreground segmentation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of large scale digitization projects transforming printed heritage into digitally available resources in Europe and the United States has led to the Digital Renaissance era. The aim of these projects is to preserve the printed cultural heritage and to integrate their intellectual content into the modern information. To achieve this goal, the digitizing process, i.e. transforming a scanned book into an electronic text, becomes necessary. The first step of digitizing process is the preprocessing which involves the segmentation of the foreground, i.e. the text, from the rest of the document. With the goal of digitizing the manuscripts written in Javanese characters, this study proposes a novel approach of foreground segmentation which is intended to serve dual functions, namely to acquire the text characters and also to improve the quality of the document images from their degradation caused by nature or the age. Our method is based on the computation of histogram peak ratio to determine the threshold value of segmentation. Being experimented on Javanese manuscripts in good and degraded conditions, the performance of our method proves to be excellent as its segmentation success rate achieves 100% for manuscripts in good condition. Its performance in segmenting degraded manuscripts caused by holes, sellotape, and bleed-trough effect could be claimed more than satisfying as its success rate achieves 80%.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 50 条
  • [41] Fast binarization algorithm for document image
    Shanghai Jiaotong Univ, Shanghai, China
    Hongwai Yu Haomibo Xuebao, 5 (344-350):
  • [42] Continual Learning for Document Image Binarization
    Garrido-Munoz, Carlos
    Sanchez-Hernandez, Adrian
    Castellanos, Francisco J.
    Calvo-Zaragoza, Jorge
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1443 - 1449
  • [43] Ancient degraded document image binarization based on texture features
    Sehad, Abdenour
    Chibani, Youcef
    Cheriet, Mohamed
    Yaddaden, Yacine
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 189 - +
  • [44] A Novel Approach for Document Image Binarization
    Vishnupriya, S.
    Saranya, P.
    Elangovan, E.
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [45] Cluster-based Sample Selection for Document Image Binarization
    Krantz, Amandus
    Westphal, Florian
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 47 - 52
  • [46] Contrast Based Color Plane Selection for Binarization of Historical Document Images
    Paramasivam, M. E.
    Sabeenian, R. S.
    EMERGING TRENDS IN ELECTRICAL, COMMUNICATIONS AND INFORMATION TECHNOLOGIES, 2017, 394 : 249 - 255
  • [47] Efficient Binarization of Historical and Degraded Document Images
    Gatos, B.
    Pratikakis, I.
    Perantonis, S. J.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 447 - 454
  • [48] Innovative Binarization Solutions for Historical Document Clarity
    Kulkarni, Radhika, V
    Mude, Vedant
    Nagrale, Rutuj
    Nirgude, Aarya
    Nirmal, Tejashri
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 210 - 217
  • [49] Weighted Ratio-based Adaptive Lossless Image Coding
    Kabani, Abdul Wahab
    El-Sakka, Mahmoud R.
    2014 IEEE 27TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2014,
  • [50] A Dilated MultiRes Visual Attention U-Net for historical document image binarization
    Detsikas, Nikolaos
    Mitianoudis, Nikolaos
    Papamarkos, Nikolaos
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 122