Scene Text Extraction using Convolutional Neural Network with Amended MSER

被引:0
|
作者
Yegnaraman, Aparna [1 ]
Valli, S. [1 ]
机构
[1] Anna Univ, Dept Comp Sci & Engn, Coll Engn, Chennai 600025, Tamil Nadu, India
来源
关键词
Convolution layer; Deep learning framework; Focal loss; Maximally stable extremal regions; YOLOv2; LOCALIZATION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content in the text format helps to communicate the relevant and specific information to users meticulously. A beneficial approach for extracting text from natural scene images is introduced which employs amended Maximally Stable Extremal Region (a-MSER) together with deep learning framework, You Only Look Once YOLOv2 network. The proposed system, a-MSER with Scene Text Extraction using Modified YOLOv2 Network (STEMYN), performs remarkably well by evaluating three publicly available datasets. The method a-MSER is used to identify the region of interest based on the variation of MSER. This algorithm considers intensity changes between text and background very effectively. The drawback of original YOLOv2, the poor detection rate for small-sized objects, is overcome by employing 1 x 1 layer with image size enhanced from 13 x 13 to 26 x 26. Focal loss is applied to improve upon the existing cross entropy classification loss of YOLOv2. The repeated convolution layer in the steep layer of the original YOLOv2 is removed to reduce the network complexity as it does not improve the system performance. Experimental results demonstrate that the proposed method is productive in identifying text from natural scene images.
引用
收藏
页码:817 / 827
页数:11
相关论文
共 50 条
  • [41] A robust solution for recognizing accurate handwritten text extraction using quantum convolutional neural network and transformer models
    Aparna, Chiguru
    Rajchandar, K.
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [42] Feature Extraction for Histopathological Images Using Convolutional Neural Network
    Hatipoglu, Nuh
    Bilgin, Gokhan
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 645 - 648
  • [43] Text Detection and Localization in Natural Scene Images Using MSER and Fast Guided Filter
    Soni, Rituraj
    Kumar, Bijendra
    Chand, Satish
    2017 FOURTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2017, : 351 - 356
  • [44] Convolutional Neural Network for Road Extraction
    Li, Junping
    Ding, Yazhou
    Feng, Fajie
    Xiong, Baoyu
    Cui, Weihong
    LIDAR IMAGING DETECTION AND TARGET RECOGNITION 2017, 2017, 10605
  • [45] Irregular Scene Text Detection Based on a Graph Convolutional Network
    Zhang, Shiyu
    Zhou, Caiying
    Li, Yonggang
    Zhang, Xianchao
    Ye, Lihua
    Wei, Yuanwang
    SENSORS, 2023, 23 (03)
  • [46] Deep learning classification of biomedical text using convolutional neural network
    Dollah R.
    Sheng C.Y.
    Zakaria N.
    Othman M.S.
    Rasib A.W.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (08): : 512 - 517
  • [47] Arabic Text Classification Using Convolutional Neural Network and Genetic Algorithms
    Alsaleh, Deem
    Larabi-Marie-Sainte, Souad
    IEEE ACCESS, 2021, 9 (09): : 91670 - 91685
  • [48] Deep Learning Classification of Biomedical Text using Convolutional Neural Network
    Dollah, Rozilawati
    Sheng, Chew Yi
    Zakaria, Norhawaniah
    Othman, Mohd Shahizan
    Rasib, Abd Wahid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (08) : 512 - 517
  • [49] Text-independent writer identification using convolutional neural network
    Hung Tuan Nguyen
    Cuong Tuan Nguyen
    Ino, Takeya
    Indurkhya, Bipin
    Nakagawa, Masaki
    PATTERN RECOGNITION LETTERS, 2019, 121 : 104 - 112
  • [50] Holder and Target Identification on Opinion Text using Convolutional Neural Network
    Ikhsan, Moh. Mirza Maulana
    Ruskanda, Fariska Zakhralativa
    2022 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBERNETICS TECHNOLOGY & APPLICATIONS (ICICYTA), 2022, : 222 - 227