Scene Text Extraction using Convolutional Neural Network with Amended MSER

被引:0
|
作者
Yegnaraman, Aparna [1 ]
Valli, S. [1 ]
机构
[1] Anna Univ, Dept Comp Sci & Engn, Coll Engn, Chennai 600025, Tamil Nadu, India
来源
关键词
Convolution layer; Deep learning framework; Focal loss; Maximally stable extremal regions; YOLOv2; LOCALIZATION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content in the text format helps to communicate the relevant and specific information to users meticulously. A beneficial approach for extracting text from natural scene images is introduced which employs amended Maximally Stable Extremal Region (a-MSER) together with deep learning framework, You Only Look Once YOLOv2 network. The proposed system, a-MSER with Scene Text Extraction using Modified YOLOv2 Network (STEMYN), performs remarkably well by evaluating three publicly available datasets. The method a-MSER is used to identify the region of interest based on the variation of MSER. This algorithm considers intensity changes between text and background very effectively. The drawback of original YOLOv2, the poor detection rate for small-sized objects, is overcome by employing 1 x 1 layer with image size enhanced from 13 x 13 to 26 x 26. Focal loss is applied to improve upon the existing cross entropy classification loss of YOLOv2. The repeated convolution layer in the steep layer of the original YOLOv2 is removed to reduce the network complexity as it does not improve the system performance. Experimental results demonstrate that the proposed method is productive in identifying text from natural scene images.
引用
收藏
页码:817 / 827
页数:11
相关论文
共 50 条
  • [21] Scene Recognition from Image Using Convolutional Neural Network
    Masood, Sarfaraz
    Ahsan, Umer
    Munawwar, Fatima
    Rizvi, Danish Raza
    Ahmed, Mumtaz
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1005 - 1012
  • [22] TEXNET: A DEEP CONVOLUTIONAL NEURAL NETWORK MODEL TO RECOGNIZE TEXT IN NATURAL SCENE IMAGES
    KAVITHA, D.
    RADHA, V.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2021, 16 (02): : 1782 - 1799
  • [23] A Method for Multi-Oriented Thai Text Localization in Natural Scene Images using Convolutional Neural Network
    Kobchaisawat, Thananop
    Chalidabhongse, Thanarat H.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2015, : 220 - 225
  • [24] Scene text detection with fully convolutional neural networks
    Zhandong Liu
    Wengang Zhou
    Houqiang Li
    Multimedia Tools and Applications, 2019, 78 : 18205 - 18227
  • [25] SCENE TEXT RECOGNITION WITH DEEPER CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Yuqi
    Wang, Wei
    Wang, Liang
    Wang, Liuan
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2384 - 2388
  • [26] Scene text detection with fully convolutional neural networks
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 18205 - 18227
  • [27] River body extraction using convolutional neural network
    Nath, Amitabha
    Mawlong, Peter
    Saha, Goutam
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08): : 1741 - 1751
  • [28] Scene Text Segmentation Method Based on MSER and MLBP
    Guo, Miaomiao
    Yi, Yaohua
    Liu, Juhua
    Li, Ying
    ADVANCED GRAPHIC COMMUNICATIONS AND MEDIA TECHNOLOGIES, 2017, 417 : 305 - 310
  • [29] RECURRENT GLOBAL CONVOLUTIONAL NETWORK FOR SCENE TEXT DETECTION
    Mohanty, Sabyasachi
    Dutta, Tanima
    Gupta, Hari Prabhat
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2750 - 2754
  • [30] Text Baseline Recognition Using a Recurrent Convolutional Neural Network
    Woedlinger, Matthias
    Sablatnig, Robert
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4673 - 4679