Multi-Scale Scene Text Detection Based on Convolutional Neural Network

被引:0
|
作者
Lu, Yan-Feng [1 ]
Zhang, Ai-Xuan [2 ]
Li, Yi [3 ]
Yu, Qian-Hui [4 ]
Qiao, Hong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
[2] China Acad Aerosp, Standardizat & Prod Assurance, Beijing, Peoples R China
[3] Nanchang Univ, Sch Informat Engn, Nanchang, Jiangxi, Peoples R China
[4] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; natural scene; teal detection; convolutional neural network; pyramid network;
D O I
10.1109/cac48633.2019.8996635
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Faster R-CNN has advantages in object detection task. But in face of the variability of text and interference of the external factors, it cannot achieve perfect detection results in natural scene text detection. Moreover, the text detection algorithms based on deep learning need to use large data sets to train the network, while in some special scenarios where a mass of samples cannot be obtained, the performance of these algorithms is likely to be limited. How to accurately detect text in natural scene based on small data sets is a challenging issue. To address this issue, a multi-scale text feature extraction network with feature pyramid based on Faster R-CNN is proposed, which can accurately and comprehensively express complex and changeable text features in natural scenes even in the small data cases. Experiment results show that the proposed MSTD method is very competitive with existing related architectures.
引用
收藏
页码:583 / 587
页数:5
相关论文
共 50 条
  • [31] Birdsong classification based on ensemble multi-scale convolutional neural network
    Liu, Jiang
    Zhang, Yan
    Lv, Danjv
    Lu, Jing
    Xie, Shanshan
    Zi, Jiali
    Yin, Yue
    Xu, Haifeng
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [32] Birdsong classification based on ensemble multi-scale convolutional neural network
    Jiang Liu
    Yan Zhang
    Danjv Lv
    Jing Lu
    Shanshan Xie
    Jiali Zi
    Yue Yin
    Haifeng Xu
    [J]. Scientific Reports, 12
  • [33] Multi-Scale Acoustic Velocity Inversion Based on a Convolutional Neural Network
    Li, Wenda
    Wu, Tianqi
    Liu, Hong
    [J]. REMOTE SENSING, 2024, 16 (05)
  • [34] MULTI-SCALE SCENE TEXT DETECTION VIA RESOLUTION TRANSFORM
    Cheng, Peirui
    Wang, Weiqiang
    Cai, Yuanqiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 988 - 993
  • [35] MSR: Multi-Scale Shape Regression for Scene Text Detection
    Xue, Chuhui
    Lu, Shijian
    Zhang, Wei
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 989 - 995
  • [36] Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images
    Zhang R.
    Shao Z.
    Aleksei P.
    Wang J.
    [J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06): : 895 - 903
  • [37] Novel Approach in Vegetation Detection Using Multi-Scale Convolutional Neural Network
    Albalooshi, Fatema A.
    [J]. Applied Sciences (Switzerland), 2024, 14 (22):
  • [38] A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
    Cai, Zhaowei
    Fan, Quanfu
    Feris, Rogerio S.
    Vasconcelos, Nuno
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 354 - 370
  • [39] Pedestrian Detection via Multi-scale Feature Fusion Convolutional Neural Network
    Guo, Aixin
    Yin, Baoqun
    Zhang, Jing
    Yao, Jinfa
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 1364 - 1368
  • [40] MSCNN: Steganographer Detection Based on Multi-Scale Convolutional Neural Networks
    Yang, Jinglan
    Dong, Chao
    Zhang, Feng
    Lei, Min
    Bai, Xu
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 215 - 226