Optimal affine image normalization approach for optical character recognition

被引:3
|
作者
Konovalenko, I. A. [1 ,2 ]
Kokhan, V. V. [1 ,2 ]
Nikolaev, D. P. [1 ,2 ]
机构
[1] Inst Informat Transmiss Problems RAS, Bolshoy Karetny Per 19,Bld 1, Moscow 127051, Russia
[2] Smart Engines, Pr T 60 Letiya Oktyabrya 9, Moscow 117312, Russia
基金
俄罗斯基础研究基金会;
关键词
optical character recognition; image registration; image normalization; coordinate discrepancy; projective transformation; affine transformation; approximation; optimization; symbolic computation; INVARIANT; ALGORITHMS;
D O I
10.18287/2412-6179-CO-759
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Optical character recognition (OCR) in images captured from arbitrary angles requires preliminary normalization, i.e. a geometric transformation resulting in an image as if it was captured at an angle suitable for OCR. In most cases, a surface containing characters can be considered flat, and a pinhole model can be adopted for a camera. Thus, in theory, the normalization should be projective. Usually, the camera optical axis is approximately perpendicular to the document surface, so the projective normalization can be replaced with an affine one without a significant loss of accuracy. An affine image transformation is performed significantly faster than a projective normalization, which is important for OCR on mobile devices. In this work, we propose a fast approach for image normalization. It utilizes an affine normalization instead of a projective one if there is no significant loss of accuracy. The approach is based on a proposed criterion for the normalization accuracy: root mean square (RMS) coordinate discrepancies over the region of interest (ROI). The problem of optimal affine normalization according to this criterion is considered. We have established that this unconstrained optimization is quadratic and can be reduced to a problem of fractional quadratic functions integration over the ROI. The latter was solved analytically in the case of OCR where the ROI consists of rectangles. The proposed approach is generalized for various cases when instead of the affine transform its special cases are used: scaling, translation, shearing, and their superposition, allowing the image normalization procedure to be further accelerated.
引用
收藏
页码:90 / 100
页数:11
相关论文
共 50 条
  • [1] Handwritten Japanese character recognition using adaptive normalization by global affine transformation
    Wakahara, T
    Kimura, Y
    Sano, M
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 424 - 428
  • [2] A Proposed Approach in Applying Optical Character Recognition for Thermal Image Processing
    Ti, Chan Wai
    Swee, Sim Kok
    Ping, Tso Chih
    PROCEEDINGS OF THE 2010 34TH IEEE/CPMT INTERNATIONAL ELECTRONICS MANUFACTURING TECHNOLOGY CONFERENCE (IEMT 2010), 2011,
  • [3] Generalized affine invariant image normalization
    Shen, DG
    Ip, HHS
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (05) : 431 - 440
  • [4] Invariant Image Recognition under Projective Deformations: An Image Normalization Approach
    Wei, Xue
    Son Lam Phung
    Bouzerdoum, Abdesselam
    Bermak, Amine
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [5] Normalization ensemble for handwritten character recognition
    Liu, CL
    Marukawa, K
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 69 - 74
  • [6] CHARACTER-RECOGNITION AND OPTICAL CHARACTERISTICS OF IMAGE SCANNERS
    SZIRANYI, T
    BOROCZKI, A
    KOVACS, T
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XII, 1989, 1153 : 485 - 493
  • [7] Application of optical character recognition in thermal image processing
    Chan, W. T.
    Sim, K. S.
    Tso, C. P.
    INFRARED PHYSICS & TECHNOLOGY, 2011, 54 (04) : 353 - 366
  • [8] Validation of image defect models for optical character recognition
    Li, YH
    Lopresti, D
    Nagy, G
    Tomkins, A
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (02) : 99 - 108
  • [9] Optical Character Recognition Guided Image Super Resolution
    Hildebrandt, Philipp
    Schulze, Maximilian
    Cohen, Sarel
    Doskoc, Vanja
    Saabni, Raid
    Friedrich, Tobias
    PROCEEDINGS OF THE 2022 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2022, 2022,
  • [10] Optical Character Recognition for Quranic Image Similarity Matching
    Alotaibi, Faiz
    Abdullah, Muhamad Taufik
    Abdullah, Rusli Bin Hj
    Rahmat, Rahmita Wirza Binti O. K.
    Hashem, Ibrahim Abaker Targio
    Sangaiah, Arun Kumar
    IEEE ACCESS, 2018, 6 : 554 - 562