Multi-Scale Scene Text Detection Based on Convolutional Neural Network

被引:0
|
作者
Lu, Yan-Feng [1 ]
Zhang, Ai-Xuan [2 ]
Li, Yi [3 ]
Yu, Qian-Hui [4 ]
Qiao, Hong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
[2] China Acad Aerosp, Standardizat & Prod Assurance, Beijing, Peoples R China
[3] Nanchang Univ, Sch Informat Engn, Nanchang, Jiangxi, Peoples R China
[4] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; natural scene; teal detection; convolutional neural network; pyramid network;
D O I
10.1109/cac48633.2019.8996635
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Faster R-CNN has advantages in object detection task. But in face of the variability of text and interference of the external factors, it cannot achieve perfect detection results in natural scene text detection. Moreover, the text detection algorithms based on deep learning need to use large data sets to train the network, while in some special scenarios where a mass of samples cannot be obtained, the performance of these algorithms is likely to be limited. How to accurately detect text in natural scene based on small data sets is a challenging issue. To address this issue, a multi-scale text feature extraction network with feature pyramid based on Faster R-CNN is proposed, which can accurately and comprehensively express complex and changeable text features in natural scenes even in the small data cases. Experiment results show that the proposed MSTD method is very competitive with existing related architectures.
引用
收藏
页码:583 / 587
页数:5
相关论文
共 50 条
  • [1] Multi-scale face detection based on convolutional neural network
    Luo, Mingzhu
    Xiao, Yewei
    Zhou, Yan
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1752 - 1757
  • [2] Multi-scale Convolutional Neural Network for Remote Sensing Scene Classification
    Alhichri, Haikel
    Alajlan, Naif
    Bazi, Yakoub
    Rabczuk, Timon
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 113 - 117
  • [3] Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring
    Nah, Seungjun
    Kim, Tae Hyun
    Lee, Kyoung Mu
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 257 - 265
  • [4] Realtime multi-scale scene text detection with scale-based region proposal network
    He, Wenhao
    Zhang, Xu-Yao
    Yin, Fei
    Luo, Zhenbo
    Ogier, Jean-Marc
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2020, 98
  • [5] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Wang, Xiaowei
    Yin, Shoulin
    Li, Hang
    Wang, Jiachi
    Teng, Lin
    [J]. INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2020, 27 (04) : 503 - 517
  • [6] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Xiaowei Wang
    Shoulin Yin
    Hang Li
    Jiachi Wang
    Lin Teng
    [J]. International Journal of Wireless Information Networks, 2020, 27 : 503 - 517
  • [7] Scene text image super-resolution using multi-scale convolutional neural network with skip connections
    Walha, Rim
    Aouini, Amal
    [J]. APPLIED INTELLIGENCE, 2024, : 5931 - 5943
  • [8] An efficient fire detection algorithm based on multi-scale convolutional neural network
    Cheng, Yanying
    Chen, Ke
    Bai, Hui
    Mou, Chunjie
    Zhang, Yuchun
    Yang, Kai
    Gao, Yunji
    Liu, Yu
    [J]. FIRE AND MATERIALS, 2022, 46 (07) : 981 - 992
  • [9] AN IMPROVED MULTI-SCALE FIRE DETECTION METHOD BASED ON CONVOLUTIONAL NEURAL NETWORK
    Huang Hongyu
    Kuang Ping
    Li Fan
    Shi Huaxin
    [J]. 2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 109 - 112
  • [10] A Multi-Scale Fusion Convolutional Neural Network for Face Detection
    Chen, Qiaosong
    Meng, Xiaomin
    Li, Wen
    Fu, Xingyu
    Deng, Xin
    Wang, Jin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1013 - 1018