GPGPU-based High Throughput Image Pre-processing Towards Large-Scale Optical Character Recognition

被引:0
|
作者
Gener, Serhan [1 ]
Dattilo, Parker [1 ]
Gajaria, Dhruv [1 ]
Fusco, Alexander [1 ]
Akoglu, Ali [1 ]
机构
[1] Univ Arizona, Dept Elect & Comp Engn, Tucson, AZ 85721 USA
来源
2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA) | 2022年
基金
美国国家科学基金会;
关键词
Optical Character Recognition (OCR); Tesseract; Leptonica; Image Processing; CUDA; GPU;
D O I
10.1109/AICCSA56895.2022.10017481
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Studies have shown that pre-processing digital images through scaling, rotation and blurring type of operations allow optical character recognition (OCR) to focus on the key features in the image and result in improving recognition accuracy. We leverage the open-source Tesseract OCR and show that its accuracy can be improved through a pre-processing flow that includes thresholding, rotation, rescaling, erosion, dilation, and noise removal steps based on a dataset that is formed of 560 phone screen images. However, the serial CPU-based implementation of this flow introduces a latency of 48.32 ms per image on average. Even though time scale is low in the context of a single image, this latency poses as a barrier when processing millions of images with OCR. To address this, we parallelize the entire pre-processing flow on the Nvidia P100 GPU, implement a streaming based execution, and reduce the latency to 0.846 ms. This streaming-enabled implementation enables setting up a GPU based OCR engine to process large scale workloads.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Large-scale Optical Character Recognition of Pre-modern Chinese Texts
    Sturgeon, Donald
    INTERNATIONAL JOURNAL OF BUDDHIST THOUGHT & CULTURE, 2018, 28 (02): : 11 - 44
  • [2] Face recognition based on image pre-processing and gabor feature
    Zhang, Ye
    Zhang, Xiaojun
    Liu, Zhijing
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 392 - 395
  • [3] Gpu-based and streaming-enabled implementation of pre-processing flow towards enhancing optical character recognition accuracy and efficiency
    Serhan, Gener
    Parker, Dattilo
    Dhruv, Gajaria
    Alexander, Fusco
    Ali, Akoglu
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (06): : 3407 - 3419
  • [4] Gpu-based and streaming-enabled implementation of pre-processing flow towards enhancing optical character recognition accuracy and efficiency
    Gener Serhan
    Dattilo Parker
    Gajaria Dhruv
    Fusco Alexander
    Akoglu Ali
    Cluster Computing, 2023, 26 : 3407 - 3419
  • [5] Towards Portable Large-Scale Image Processing with High-Performance Computing
    Huo, Yuankai
    Blaber, Justin
    Damon, Stephen M.
    Boyd, Brian D.
    Bao, Shunxing
    Parvathaneni, Prasanna
    Noguera, Camilo Bermudez
    Chaganti, Shikha
    Nath, Vishwesh
    Greer, Jasmine M.
    Lyu, Ilwoo
    French, William R.
    Newton, Allen T.
    Rogers, Baxter P.
    Landman, Bennett A.
    JOURNAL OF DIGITAL IMAGING, 2018, 31 (03) : 304 - 314
  • [6] Towards Portable Large-Scale Image Processing with High-Performance Computing
    Yuankai Huo
    Justin Blaber
    Stephen M. Damon
    Brian D. Boyd
    Shunxing Bao
    Prasanna Parvathaneni
    Camilo Bermudez Noguera
    Shikha Chaganti
    Vishwesh Nath
    Jasmine M. Greer
    Ilwoo Lyu
    William R. French
    Allen T. Newton
    Baxter P. Rogers
    Bennett A. Landman
    Journal of Digital Imaging, 2018, 31 : 304 - 314
  • [7] Evaluation of image pre-processing techniques for eigenface based face recognition
    Heseltine, T
    Pears, N
    Austin, J
    SECOND INTERNATION CONFERENCE ON IMAGE AND GRAPHICS, PTS 1 AND 2, 2002, 4875 : 677 - 685
  • [8] PRE-PROCESSING AND POST-PROCESSING FOR DATA EFFICIENCY IN LARGE-SCALE COMPUTER-SIMULATIONS
    GARDENIER, TK
    1989 WINTER SIMULATION CONFERENCE PROCEEDINGS, 1989, : 319 - 324
  • [9] LARGE-SCALE PROCESSING AND HIGH-THROUGHPUT PERFUSION CHROMATOGRAPHY
    FULTON, SP
    SHAHIDI, AJ
    GORDON, NF
    AFEYAN, NB
    BIO-TECHNOLOGY, 1992, 10 (06): : 635 - 639
  • [10] High-throughput solution processing of large-scale graphene
    Tung V.C.
    Allen M.J.
    Yang Y.
    Kaner R.B.
    Nature Nanotechnology, 2009, 4 (1) : 25 - 29