GPGPU-based High Throughput Image Pre-processing Towards Large-Scale Optical Character Recognition

被引:0
|
作者
Gener, Serhan [1 ]
Dattilo, Parker [1 ]
Gajaria, Dhruv [1 ]
Fusco, Alexander [1 ]
Akoglu, Ali [1 ]
机构
[1] Univ Arizona, Dept Elect & Comp Engn, Tucson, AZ 85721 USA
来源
2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA) | 2022年
基金
美国国家科学基金会;
关键词
Optical Character Recognition (OCR); Tesseract; Leptonica; Image Processing; CUDA; GPU;
D O I
10.1109/AICCSA56895.2022.10017481
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Studies have shown that pre-processing digital images through scaling, rotation and blurring type of operations allow optical character recognition (OCR) to focus on the key features in the image and result in improving recognition accuracy. We leverage the open-source Tesseract OCR and show that its accuracy can be improved through a pre-processing flow that includes thresholding, rotation, rescaling, erosion, dilation, and noise removal steps based on a dataset that is formed of 560 phone screen images. However, the serial CPU-based implementation of this flow introduces a latency of 48.32 ms per image on average. Even though time scale is low in the context of a single image, this latency poses as a barrier when processing millions of images with OCR. To address this, we parallelize the entire pre-processing flow on the Nvidia P100 GPU, implement a streaming based execution, and reduce the latency to 0.846 ms. This streaming-enabled implementation enables setting up a GPU based OCR engine to process large scale workloads.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Deep learning based data augmentation for large-scale mineral image recognition and classification
    Liu, Yang
    Wang, Xueyi
    Zhang, Zelin
    Deng, Fang
    MINERALS ENGINEERING, 2023, 204
  • [42] Location-based large-scale landmark image recognition scheme for mobile devices
    Kim, Daehoon
    Hwang, Eenjun
    Rho, Seungmin
    2012 THIRD FTRA INTERNATIONAL CONFERENCE ON MOBILE, UBIQUITOUS, AND INTELLIGENT COMPUTING (MUSIC), 2012, : 47 - 52
  • [43] A gossip-based reliable multicast for large-scale high-throughput applications
    Sun, QX
    Sturman, DC
    DSN 2000: INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2000, : 347 - 358
  • [44] Compact Representation of High-Dimensional Feature Vectors for Large-Scale Image Recognition and Retrieval
    Zhang, Yu
    Wu, Jianxin
    Cai, Jianfei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) : 2407 - 2419
  • [45] Performance Evaluation of Image-Based Location Recognition Approaches based on large-scale UAV Imagery
    Hesse, Nikolas
    Bodensteiner, Christoph
    Arens, Michael
    ELECTRO-OPTICAL REMOTE SENSING, PHOTONIC TECHNOLOGIES, AND APPLICATIONS VIII; AND MILITARY APPLICATIONS IN HYPERSPECTRAL IMAGING AND HIGH SPATIAL RESOLUTION SENSING II, 2014, 9250
  • [46] Towards Cloud-based Distributed Scaleable Processing over Large-scale Temporal Graphs
    Steinbauer, Matthias
    Kotsis, Gabriele
    2014 IEEE 23RD INTERNATIONAL WETICE CONFERENCE (WETICE), 2014, : 143 - 148
  • [47] Optimal Feature Selection based on Image Pre-processing using Accelerated Binary Particle Swarm Optimization for Enhanced Face Recognition
    Aneesh, M. U.
    Masand, Abhishek A. K.
    Manikantan, K.
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 750 - 758
  • [48] DWT-based Face Recognition using Fast Walsh Hadamard Transform and Chiral Image Superimposition as pre-processing techniques
    Niveditha, G., V
    Sharmila, B. P.
    Manikantan, K.
    Ramachandran, S.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 7 - 14
  • [49] Classifying for a Mixture of Object Images and Character Patterns by Using CNN Pre-trained for Large-scale Object Image Dataset
    Shima, Yoshihiro
    Nakashima, Yumi
    Yasuda, Michio
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 2360 - 2365
  • [50] Automatic target recognition scheme for a high-resolution and large-scale synthetic aperture radar image
    Tu, Song
    Su, Yi
    Wang, Wei
    Xiong, Boli
    Li, Yu
    JOURNAL OF APPLIED REMOTE SENSING, 2015, 9