A light-weight natural scene text detection and recognition system

被引:3
|
作者
Ghosh, Jyoti [1 ]
Talukdar, Anjan Kumar [1 ]
Sarma, Kandarpa Kumar [1 ]
机构
[1] Gauhati Univ, Dept Elect & Commun Engn, Gauhati 781014, Assam, India
关键词
Scene text detection; Scene text recognition; Deep learning; Light-weight; MobileNetV2;
D O I
10.1007/s11042-023-15696-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene text recognition is an application of Computer Vision that analyses the scene image and recognizes the text present on it. This task has many applications and will gain more importance if it can be used in handheld devices. The problem with existing methods is that if the model has a huge number of parameters and complex architectures, then the model will have a huge file size which will be problematic to deploy the application on mobile devices. Therefore, the aim of this paper is to propose a light-weight model that is a model with less number of parameters, small file size and less complexity that can be used in platforms with limited resources while achieving a comparable accuracy with those of the heavy weight models. The proposed models rely on deep learning to handle most of the steps automatically, consume less time and give precise results after facing many challenges. The proposed scene text recognition model is in the form of a Convolutional-Recurrent Neural network where the Convolution network extracts the features from the cropped images of scene text and the Recurrent network processes the sequential data of varying length present in the cropped images. After training, the scene text recognition model generates a weight file of 12 MB with 1 M parameters. To reduce number of parameters, weight of files and to show trade-off between efficiency and accuracy, MobileNetV2 is used in place of Convolution network that generates weight file of 6 MB with 0.5 M parameters. The performance on ICDAR 2013, IIIT 5K and Total-Text datasets shows that the proposed work performs well in detecting and recognizing texts from natural scene images.
引用
收藏
页码:6651 / 6683
页数:33
相关论文
共 50 条
  • [31] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300
  • [32] Traffic signs detection and recognition systems by light-weight multi-stage network
    Hou, Mingzheng
    Zhang, Xin
    Chen, Yang
    Dong, Penglin
    Feng, Ziliang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 16155 - 16169
  • [33] End to End Text Recognition from Natural Scene
    Francis, Leena Mary
    Visalatchi, K. C.
    Sreenath, N.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [34] Robust Text Detection in Natural Scene Images
    Yin, Xu-Cheng
    Yin, Xuwang
    Huang, Kaizhu
    Hao, Hong-Wei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) : 970 - 983
  • [35] Fast and Light-Weight Answer Text Retrieval in Dialogue Systems
    Wan, Hui
    Patel, Siva Sankalp
    Murdock, J. William
    Potdar, Saloni
    Joshi, Sachindra
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 334 - 343
  • [36] Optical Character Recognition for Scene Text Detection, Mining and Recognition
    Nathiya, N.
    Pradeepa, K.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 662 - 665
  • [37] Text Detection in Natural Scene Image: A Survey
    Wang, Shupeng
    Fu, Chenglin
    Li, Qi
    [J]. MACHINE LEARNING AND INTELLIGENT COMMUNICATIONS, 2017, 183 : 257 - 264
  • [38] Scene Text Detection in Natural Images: A Review
    Cao, Dongping
    Zhong, Yong
    Wang, Lishun
    He, Yilong
    Dang, Jiachen
    [J]. SYMMETRY-BASEL, 2020, 12 (12): : 1 - 26
  • [39] A Review: Text Detection in Natural Scene Image
    Sun, Yue
    Dawut, Abdusalam
    Hamdulla, Askar
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2018, : 826 - 829
  • [40] Text detection and restoration in natural scene images
    Ye, Qixiang
    Hao, Jianbin
    Huang, Jun
    Yu, Hua
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (06) : 504 - 513