Scene Text Extraction using Convolutional Neural Network with Amended MSER

被引:0
|
作者
Yegnaraman, Aparna [1 ]
Valli, S. [1 ]
机构
[1] Anna Univ, Dept Comp Sci & Engn, Coll Engn, Chennai 600025, Tamil Nadu, India
来源
关键词
Convolution layer; Deep learning framework; Focal loss; Maximally stable extremal regions; YOLOv2; LOCALIZATION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content in the text format helps to communicate the relevant and specific information to users meticulously. A beneficial approach for extracting text from natural scene images is introduced which employs amended Maximally Stable Extremal Region (a-MSER) together with deep learning framework, You Only Look Once YOLOv2 network. The proposed system, a-MSER with Scene Text Extraction using Modified YOLOv2 Network (STEMYN), performs remarkably well by evaluating three publicly available datasets. The method a-MSER is used to identify the region of interest based on the variation of MSER. This algorithm considers intensity changes between text and background very effectively. The drawback of original YOLOv2, the poor detection rate for small-sized objects, is overcome by employing 1 x 1 layer with image size enhanced from 13 x 13 to 26 x 26. Focal loss is applied to improve upon the existing cross entropy classification loss of YOLOv2. The repeated convolution layer in the steep layer of the original YOLOv2 is removed to reduce the network complexity as it does not improve the system performance. Experimental results demonstrate that the proposed method is productive in identifying text from natural scene images.
引用
收藏
页码:817 / 827
页数:11
相关论文
共 50 条
  • [1] Text Region Extraction From Scene Images Using AGF and MSER
    Soni, Rituraj
    Kumar, Bijendra
    Chand, Satish
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2020, 20 (02)
  • [2] Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees
    Huang, Weilin
    Qiao, Yu
    Tang, Xiaoou
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 497 - 511
  • [3] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [4] Scene text recognition using residual convolutional recurrent neural network
    Zhengchao Lei
    Sanyuan Zhao
    Hongmei Song
    Jianbing Shen
    Machine Vision and Applications, 2018, 29 : 861 - 871
  • [5] Text-Attentional Convolutional Neural Network for Scene Text Detection
    He, Tong
    Huang, Weilin
    Qiao, Yu
    Yao, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2529 - 2541
  • [6] Text Detection in Low Resolution Scene Images Using Convolutional Neural Network
    Risnumawan, Anhar
    Sulistijono, Indra Adji
    Abawajy, Jemal
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 366 - 375
  • [7] Thai Text Localization in Natural Scene Images using Convolutional Neural Network
    Kobchaisawat, Thananop
    Chalidabhongse, Thanarat H.
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [8] Scene text detection using enhanced Extremal region and convolutional neural network
    Fatemeh Naiemi
    Vahid Ghods
    Hassan Khalesi
    Multimedia Tools and Applications, 2020, 79 : 27137 - 27159
  • [9] Aksara Jawa Text Detection in Scene Images using Convolutional Neural Network
    Afakh, Muhammad Labiyb
    Risnumawan, Anhar
    Anggraeni, Martianda Erste
    Tamara, Mohamad Nasyir
    Ningrum, Endah Suryawati
    2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), 2017, : 77 - 82
  • [10] Scene text detection using enhanced Extremal region and convolutional neural network
    Naiemi, Fatemeh
    Ghods, Vahid
    Khalesi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27137 - 27159