Scene Text Extraction using Convolutional Neural Network with Amended MSER

被引:0
|
作者
Yegnaraman, Aparna [1 ]
Valli, S. [1 ]
机构
[1] Anna Univ, Dept Comp Sci & Engn, Coll Engn, Chennai 600025, Tamil Nadu, India
来源
关键词
Convolution layer; Deep learning framework; Focal loss; Maximally stable extremal regions; YOLOv2; LOCALIZATION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content in the text format helps to communicate the relevant and specific information to users meticulously. A beneficial approach for extracting text from natural scene images is introduced which employs amended Maximally Stable Extremal Region (a-MSER) together with deep learning framework, You Only Look Once YOLOv2 network. The proposed system, a-MSER with Scene Text Extraction using Modified YOLOv2 Network (STEMYN), performs remarkably well by evaluating three publicly available datasets. The method a-MSER is used to identify the region of interest based on the variation of MSER. This algorithm considers intensity changes between text and background very effectively. The drawback of original YOLOv2, the poor detection rate for small-sized objects, is overcome by employing 1 x 1 layer with image size enhanced from 13 x 13 to 26 x 26. Focal loss is applied to improve upon the existing cross entropy classification loss of YOLOv2. The repeated convolution layer in the steep layer of the original YOLOv2 is removed to reduce the network complexity as it does not improve the system performance. Experimental results demonstrate that the proposed method is productive in identifying text from natural scene images.
引用
收藏
页码:817 / 827
页数:11
相关论文
共 50 条
  • [31] Thai Text Detection and Classification Using Convolutional Neural Network
    Malakar, Susanta
    Chiracharit, Werapon
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 99 - 102
  • [32] Scene text image super-resolution using multi-scale convolutional neural network with skip connections
    Walha, Rim
    Aouini, Amal
    APPLIED INTELLIGENCE, 2024, : 5931 - 5943
  • [33] A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application
    Liu, Yiyi
    Wang, Yuxin
    Shi, Hongjian
    SYMMETRY-BASEL, 2023, 15 (04):
  • [34] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300
  • [35] Natural scene text localization and detection using MSER and its variants: a comprehensive survey
    Dutta, Kalpita
    Sarkhel, Ritesh
    Kundu, Mahantapas
    Nasipuri, Mita
    Das, Nibaran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55773 - 55810
  • [36] Natural scene text localization and detection using MSER and its variants: a comprehensive survey
    Kalpita Dutta
    Ritesh Sarkhel
    Mahantapas Kundu
    Mita Nasipuri
    Nibaran Das
    Multimedia Tools and Applications, 2024, 83 : 55773 - 55810
  • [37] Scene Text Script Identification with Convolutional Recurrent Neural Networks
    Mei, Jieru
    Dai, Luo
    Shi, Baoguang
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4053 - 4058
  • [38] Scene Text Detection Images With Pyramid Image and MSER Enhanced
    Turki, Houssem
    Ben Halima, Mohamed
    Alimi, Adel M.
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 301 - 306
  • [39] Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network
    El Bahi, Hassan
    Soft Computing, 2024, 28 (20) : 12249 - 12268
  • [40] Automatic melody extraction algorithm using a convolutional neural network
    Lee, Jongseol
    Jang, Dalwon
    Yoon, Kyoungro
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (12): : 6038 - 6053