Document image improvement for OCR as a classification problem

被引:5
|
作者
Summers, K [1 ]
机构
[1] Vredenburg, Lanham, MD 20912 USA
来源
关键词
OCR; optical character recognition; image enhancement; machine learning;
D O I
10.1117/12.476023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In support of the goal of automatically selecting methods of enhancing an image to improve the accuracy of OCR on that image, we consider the problem of determining whether to apply each of a set of methods as a supervised classification problem for machine learning. We characterize each image according to a combination of two sets of measures: a set that are intended to reflect the degree of particular types of noise present in documents in a single font of Roman or similar script and a more general set based on connected component statistics. We consider several potential methods of image improvement, each of which constitutes its own 2-class classification problem, according to whether transforming the image with this method improves the accuracy of OCR. In our experiments, the results varied for the different image transformation methods, but the system made the correct choice in 77% of the cases in which the decision affected the OCR score (in the range [0,1]) by at least .01, and it made the correct choice 64% of the time overall.
引用
收藏
页码:73 / 83
页数:11
相关论文
共 50 条
  • [21] Document classification as a theoretical problem of documentology
    Shvetsova-Vodka, Galina N.
    NAUCHNYE I TEKHNICHESKIE BIBLIOTEKI-SCIENTIFIC AND TECHNICAL LIBRARIES, 2022, (09): : 147 - 168
  • [22] Document style census for OCR
    Nagy, G
    Sarkar, P
    FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, : 134 - 147
  • [23] A proposed method for the improvement in biometric facial image recognition using document-based classification
    Senthilkumar, Rajarathinam
    Gnanamurthy, Ramasamy Kannan
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (06): : 4476 - 4494
  • [24] A proposed method for the improvement in biometric facial image recognition using document-based classification
    Rajarathinam Senthilkumar
    Ramasamy Kannan Gnanamurthy
    The Journal of Supercomputing, 2020, 76 : 4476 - 4494
  • [25] Building Super-Resolution Image Generator for OCR Accuracy Improvement
    Peng, Xujun
    Wang, Chao
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 145 - 160
  • [26] A Review of Document Image Enhancement Based on Document Degradation Problem
    Zhou, Yanxi
    Zuo, Shikai
    Yang, Zhengxian
    He, Jinlong
    Shi, Jianwen
    Zhang, Rui
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [27] Image Processing based Degraded Camera Captured Document Enhancement for Improved OCR Accuracy
    Sharma, Pooja
    Sharma, Shanu
    2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 441 - 444
  • [28] PROBLEM OF IMAGE QUALITY IMPROVEMENT
    PYTIEV, IP
    DOKLADY AKADEMII NAUK SSSR, 1979, 245 (02): : 315 - 319
  • [29] Footnote-Based Document Image Classification
    Zhalehpour, Sara
    Piper, Andrew
    Wellmon, Chad
    Cheriet, Mohamed
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2017, 2017, 10317 : 634 - 642
  • [30] Structural similarity for document image classification and retrieval
    Kumar, Jayant
    Ye, Peng
    Doermann, David
    PATTERN RECOGNITION LETTERS, 2014, 43 : 119 - 126