Document image improvement for OCR as a classification problem

被引:5
|
作者
Summers, K [1 ]
机构
[1] Vredenburg, Lanham, MD 20912 USA
来源
关键词
OCR; optical character recognition; image enhancement; machine learning;
D O I
10.1117/12.476023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In support of the goal of automatically selecting methods of enhancing an image to improve the accuracy of OCR on that image, we consider the problem of determining whether to apply each of a set of methods as a supervised classification problem for machine learning. We characterize each image according to a combination of two sets of measures: a set that are intended to reflect the degree of particular types of noise present in documents in a single font of Roman or similar script and a more general set based on connected component statistics. We consider several potential methods of image improvement, each of which constitutes its own 2-class classification problem, according to whether transforming the image with this method improves the accuracy of OCR. In our experiments, the results varied for the different image transformation methods, but the system made the correct choice in 77% of the cases in which the decision affected the OCR score (in the range [0,1]) by at least .01, and it made the correct choice 64% of the time overall.
引用
下载
收藏
页码:73 / 83
页数:11
相关论文
共 50 条
  • [1] A New Benchmark and OCR-Free Method for Document Image Topic Classification
    Wang, Zhen
    Zhu, Peide
    Yu, Fuyang
    Okumura, Manabu
    MULTIMEDIA MODELING, MMM 2024, PT V, 2024, 14565 : 15 - 27
  • [2] Document image summarization without OCR
    Bloomberg, DS
    Chen, FR
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 229 - 232
  • [3] Hybred: An OCR document representation for classification tasks
    Laroum, Sami
    Béchet, Nicolas
    Hamza, Hatem
    Roche, Mathieu
    International Journal of Computer Science Issues, 2011, 8 (3 3-2): : 1 - 8
  • [4] OCR oriented binarization method of document image
    Yang, You
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 622 - 625
  • [5] A Survey on OCR for Over-Lapping and Broken Character's in Document Image Problem with Overlapping and Broken Character's in Document Image
    Gaur, Abhishek Kumar
    Bharangar, Devendra Singh
    Trivedi, Munesh Chand
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 138 - 141
  • [6] Fast document image comparison in multilingual corpus without OCR
    Yuping Lin
    Yingyu Li
    Yonghong Song
    Fang Wang
    Multimedia Systems, 2017, 23 : 315 - 324
  • [7] Fast document image comparison in multilingual corpus without OCR
    Lin, Yuping
    Li, Yingyu
    Song, Yonghong
    Wang, Fang
    MULTIMEDIA SYSTEMS, 2017, 23 (03) : 315 - 324
  • [8] Fast document image comparison in multilingual corpus without OCR
    Lin, Yuping
    Li, Yingyu
    Song, Yonghong
    Wang, Fang
    Multimedia Systems, 2017, 23 (03): : 315 - 324
  • [9] A super resolution framework for low resolution document image OCR
    Ma, Di
    Agam, Gady
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [10] Robust Camera Captured Image Mosaicking for Document Digitization and OCR Processing
    Tiwari, Lokender
    Kumar, Bhupendra
    Patnaik, Tushar
    2014 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT), 2014, : 100 - 105