Degraded Document Image Binarization using Novel Background Estimation Technique

被引:0
|
作者
Jindal, Harshit [1 ]
Kumar, Manoj [1 ]
Tomar, Akhil [1 ]
Malik, Ayush [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci Engn, New Delhi, India
关键词
Document Image Processing; Degraded Document Image Binarization; Thresholding; Background estimation; Noise Removal; Otsu Thresholding; Bilateral Filtering;
D O I
10.1109/I2CT51068.2021.9418084
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Over the past few decades, the use of scanned historical document images has increased dramatically, especially with the emergence of online libraries and standard benchmark datasets like DIBCO. The historical documents are usually in very-poor conditions containing noises like large ink stains, bleed-through, liquid spills, uneven-background, spots, faded-ink, weak/thin text that makes the task of binarization very difficult. In this paper, we propose an effective degraded document image binarization algorithm that performs accurate text segmentation. Our method first estimates the background utilizing information from neighboring pixels and filter smoothening. The next step is background subtraction that helps in the compensation of background distortions. The document is segmented using Otsu thresholding, and then we process the image to remove the remaining noise and maximize text content using labelled connected components. Our method outperforms several existing and widely used binarization algorithms on F-measure, PSNR, DRD, and pseudo F-measure when evaluated on H-DIBCO 2016 and H-DIBCO 2018 datasets and can very effectively detect faint characters from a document image.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Gabor filter-based texture for ancient degraded document image binarization
    Abdenour Sehad
    Youcef Chibani
    Rachid Hedjam
    Mohamed Cheriet
    Pattern Analysis and Applications, 2019, 22 : 1 - 22
  • [42] Reclamation of Information from Degraded and Damaged Document Images by Image Binarization Method
    Vishnudharan, B.
    Anusudha, K.
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [43] Modified Sauvola binarization for degraded document images
    Kaur, Amandeep
    Rani, Usha
    Josan, Gurpreet Singh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 92
  • [44] Efficient Binarization of Historical and Degraded Document Images
    Gatos, B.
    Pratikakis, I.
    Perantonis, S. J.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 447 - 454
  • [45] A Robust Multi Stage Technique for Image Binarization of Degraded Historical Documents
    Boudraa, Omar
    Hidouci, Walid Khaled
    Michelucci, Dominique
    2017 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING - BOUMERDES (ICEE-B), 2017,
  • [46] Binarization Techniques for Degraded Document Images - A Review
    Jyotsna
    Chauhan, Shivani
    Sharma, Ekta
    Doegar, Amit
    2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 163 - 166
  • [47] Parallel nonparametric binarization for degraded document images
    Chen, Xin
    Lin, Liang
    Gao, Yuefang
    NEUROCOMPUTING, 2016, 189 : 43 - 52
  • [48] Automatic Enhancement and Binarization of Degraded Document Images
    Parker, Jon
    Frieder, Ophir
    Frieder, Gideon
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 210 - 214
  • [49] A new binarization method for degraded document images
    Rani U.
    Kaur A.
    Josan G.
    International Journal of Information Technology, 2023, 15 (2) : 1035 - 1053
  • [50] A Novel Approach for Document Image Binarization Using Bit-Plane Slicing
    Karthika, M.
    James, Ajay
    8TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2014, 2015, 19 : 758 - 765