A light-weight natural scene text detection and recognition system

被引:3
|
作者
Ghosh, Jyoti [1 ]
Talukdar, Anjan Kumar [1 ]
Sarma, Kandarpa Kumar [1 ]
机构
[1] Gauhati Univ, Dept Elect & Commun Engn, Gauhati 781014, Assam, India
关键词
Scene text detection; Scene text recognition; Deep learning; Light-weight; MobileNetV2;
D O I
10.1007/s11042-023-15696-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene text recognition is an application of Computer Vision that analyses the scene image and recognizes the text present on it. This task has many applications and will gain more importance if it can be used in handheld devices. The problem with existing methods is that if the model has a huge number of parameters and complex architectures, then the model will have a huge file size which will be problematic to deploy the application on mobile devices. Therefore, the aim of this paper is to propose a light-weight model that is a model with less number of parameters, small file size and less complexity that can be used in platforms with limited resources while achieving a comparable accuracy with those of the heavy weight models. The proposed models rely on deep learning to handle most of the steps automatically, consume less time and give precise results after facing many challenges. The proposed scene text recognition model is in the form of a Convolutional-Recurrent Neural network where the Convolution network extracts the features from the cropped images of scene text and the Recurrent network processes the sequential data of varying length present in the cropped images. After training, the scene text recognition model generates a weight file of 12 MB with 1 M parameters. To reduce number of parameters, weight of files and to show trade-off between efficiency and accuracy, MobileNetV2 is used in place of Convolution network that generates weight file of 6 MB with 0.5 M parameters. The performance on ICDAR 2013, IIIT 5K and Total-Text datasets shows that the proposed work performs well in detecting and recognizing texts from natural scene images.
引用
收藏
页码:6651 / 6683
页数:33
相关论文
共 50 条
  • [1] A light-weight natural scene text detection and recognition system
    Jyoti Ghosh
    Anjan Kumar Talukdar
    Kandarpa Kumar Sarma
    [J]. Multimedia Tools and Applications, 2024, 83 : 6651 - 6683
  • [2] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [3] Text Detection and Recognition in Natural Scene Images
    Huang, Xiaoming
    Shen, Tao
    Wang, Run
    Gao, Chenqiang
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 44 - 49
  • [4] Research on the Text Detection and Recognition in Natural Scene Images
    Wei Zi-han
    Du Xiao-ping
    Cao Lei
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [5] Text detection and recognition in natural scene with edge analysis
    Yu, Chong
    Song, Yonghong
    Meng, Quan
    Zhang, Yuanlin
    Liu, Yang
    [J]. IET COMPUTER VISION, 2015, 9 (04) : 603 - 613
  • [6] A Database for Urdu Text Detection and Recognition in Natural Scene Images
    Chandio, Asghar Ali
    Leghari, Mehwish
    Memon, Mukhtiar Ahmed
    Leghari, Mehjabeen
    Jalbani, Akhtar Hussain
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (01) : 47 - 54
  • [7] A Light-Weight Practical Framework for Feces Detection and Trait Recognition
    Leng, Lu
    Yang, Ziyuan
    Kim, Cheonshik
    Zhang, Yue
    [J]. SENSORS, 2020, 20 (09)
  • [8] Text Flow: A Unified Text Detection System in Natural Scene Images
    Tian, Shangxuan
    Pan, Yifeng
    Huang, Chang
    Lu, Shijian
    Yu, Kai
    Tan, Chew Lim
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4651 - 4659
  • [9] A Light-Weight Text Summarization System for Fast Access to Medical Evidence
    Sarker, Abeed
    Yang, Yuan-Chi
    Al-Garadi, Mohammed Ali
    Abbas, Aamir
    [J]. FRONTIERS IN DIGITAL HEALTH, 2020, 2
  • [10] Chinese Text Detection and Recognition in Natural Scene Using HOG and SVM
    Yu, Boran
    Wan, Hongjie
    [J]. 2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS (ITMS 2016), 2016, : 148 - 152