Don't Miss the Fine Print! An Enhanced Framework to Extract Text from Low Resolution Images

被引:1
|
作者
Dugar, Pranay [1 ]
Vikram, Aditya [1 ,2 ,3 ]
Chatterjee, Anirban [1 ]
Banerjee, Kunal [1 ]
Agneeswaran, Vijay [1 ]
机构
[1] Walmart Global Tech, Bangalore, Karnataka, India
[2] Flipkart, Bangalore, Karnataka, India
[3] Indian Inst Sci, Bangalore, Karnataka, India
关键词
Scene Text Recognition; Super-resolution; Text Extraction; Convolution Neural Network; NETWORK;
D O I
10.5220/0010971100003124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Recognition (STR) enables processing and understanding of the text in the wild. However, roadblocks like natural degradation, blur, and uneven lighting in the captured images result in poor accuracy during detection and recognition. Previous approaches have introduced Super-Resolution (SR) as a processing step between detection and recognition; however, post enhancement, there is a significant drop in the quality of the reconstructed text in the image. This drop is especially significant in the healthcare domain because any loss in accuracy can be detrimental. This paper will quantitatively show the drop in quality of the text in an image from the existing SR techniques across multiple optimization-based and GAN-based models. We propose a new loss function for training and an improved deep neural network architecture to address these shortcomings and recover text with sharp boundaries in the SR images. We also show that the Peak Signal-to-Noise Ratio (PSNR) and the Structural Similarity Index Measure (SSIM) scores are not effective metrics for identifying the quality of the text in an SR image. Extensive experiments show that our model achieves better accuracy and visual improvements against state-of-the-art methods in terms of text recognition accuracy. We plan to add our module on SR in the near future to our already deployed solution for text extraction from product images for our company.
引用
收藏
页码:664 / 671
页数:8
相关论文
共 21 条
  • [1] Text Region Extraction from Low Resolution Natural Scene Images using Texture Features
    Angadi, S. A.
    Kodabagi, M. M.
    [J]. 2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 121 - 128
  • [2] Machine learning based framework for fine-grained word segmentation and enhanced text normalization for low resourced language
    Nazir, Shahzad
    Asif, Muhammad
    Rehman, Mariam
    Ahmad, Shahbaz
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [3] HIGH RESOLUTION T1 ESTIMATION FROM MULTIPLE LOW RESOLUTION MAGNETIC RESONANCE IMAGES
    Van Steenkiste, Gwendolyn
    Poot, Dirk H. J.
    Jeurissen, Ben
    den Dekker, Arnold J.
    Sijbers, Jan
    [J]. 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI), 2015, : 1036 - 1039
  • [4] A framework for cortical laminar composition analysis using low-resolution T1 MRI images
    Ittai Shamir
    Omri Tomer
    Zvi Baratz
    Galia Tsarfaty
    Maya Faraggi
    Assaf Horowitz
    Yaniv Assaf
    [J]. Brain Structure and Function, 2019, 224 : 1457 - 1467
  • [5] A framework for cortical laminar composition analysis using low-resolution T1 MRI images
    Shamir, Ittai
    Tomer, Omri
    Baratz, Zvi
    Tsarfaty, Galia
    Faraggi, Maya
    Horowitz, Assaf
    Assaf, Yaniv
    [J]. BRAIN STRUCTURE & FUNCTION, 2019, 224 (04): : 1457 - 1467
  • [6] Use of salient features for the design of a multistage framework to extract roads from high-resolution multispectral satellite images
    Das, Sukhendu
    Mirnalinee, T.T.
    Varghese, Koshy
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2011, 49 (10 PART 2): : 3906 - 3931
  • [7] Use of Salient Features for the Design of a Multistage Framework to Extract Roads From High-Resolution Multispectral Satellite Images
    Das, Sukhendu
    Mirnalinee, T. T.
    Varghese, Koshy
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (10): : 3906 - 3931
  • [8] Historic Low Wall Detection via Topographic Parameter Images Derived from Fine-Resolution DEM
    Chu, Hone-Jay
    Huang, Min-Lang
    Tain, Yu-Ching
    Yang, Mon-Shieh
    Hoefle, Bernhard
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (11)
  • [9] A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images
    Angadi, S. A.
    Kodabagi, M. M.
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2014, 14 (1-2)
  • [10] A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images
    Angadi, S. A.
    Kodabagi, M. M.
    [J]. 2014 FIFTH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2014), 2014, : 42 - 49