Video Text Detection with Text Edges and Convolutional Neural Network

被引:0
|
作者
Hu, Ping [1 ]
Wang, Weiqiang [1 ]
Lu, Ke [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
SCENE IMAGES; REPRESENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text and captions in videos provide useful information for content analysis and understanding. In this paper, we present an approach to detecting video text in a coarse-to-fine strategy. In the coarse phase we propose an efficient method to detect multi-scale candidate text regions with high recall. Then the candidate text regions are segmented and sent to the fine phase where a convolutional neural network(CNN) is applied to generate a confidence map for each candidate text region. Finally, the candidate text regions are further refined and partitioned into text lines by projection analysis. The CNN classifier in the fine phase enables feature sharing and robustly identifies text regions. The coarse phase sharply reduce the number of windows needed to be scanned by the CNN. The combination endows the proposed method with both efficiency and robustness when detecting video text. It was verified by experiment results on two publicly testing datasets and a dataset created by us.
引用
收藏
页码:675 / 679
页数:5
相关论文
共 50 条
  • [31] Learning Text Component Features via Convolutional Neural Networks for Scene Text Detection
    Khlif, Wafa
    Nayef, Nibal
    Burie, Jean-Christophe
    Ogier, Jean-Marc
    Alimi, Adel
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 79 - 84
  • [32] Adaptive Inclined Text Detection in Natural Scenes Fusing Convolutional Recurrent Neural Network
    Gao, Yijian
    Gao, Yu
    Wang, Zhixuan
    Lin, Xiaorui
    Xiao, Zhuoling
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 246 - 251
  • [33] Multi-lingual text detection and identification using agile convolutional neural network
    Yegnaraman, Aparna
    Valli, S.
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (04) : 1803 - 1826
  • [34] RECURRENT GLOBAL CONVOLUTIONAL NETWORK FOR SCENE TEXT DETECTION
    Mohanty, Sabyasachi
    Dutta, Tanima
    Gupta, Hari Prabhat
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2750 - 2754
  • [35] A Novel Approach for Video Text Detection and Recognition Based on a Corner Response Feature Map and Transferred Deep Convolutional Neural Network
    Lu, Wei
    Sun, Hongbo
    Chu, Jinghui
    Huang, Xiangdong
    Yu, Jiexiao
    IEEE ACCESS, 2018, 6 : 40198 - 40211
  • [36] Multilingual Text Detection with Nonlinear Neural Network
    Li, Lin
    Yu, Shengsheng
    Zhong, Luo
    Li, Xiaozhen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [37] Character Segmentation in Text Line via Convolutional Neural Network
    Li, Xiaohe
    Zhang, Xingming
    Yang, Bin
    Xia, Siyu
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1175 - 1180
  • [38] APPLICATION OF CONVOLUTIONAL NEURAL NETWORK (CNN) IN MICROBLOG TEXT CLASSIFICATION
    Wang, Xiaoming
    Li, Jianping
    Liu, Yifei
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 127 - 130
  • [39] Text Classification Based on Convolutional Neural Network and Attention Model
    Yang, Shuang
    Tang, Yan
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 67 - 73
  • [40] A Dynamic Convolutional Neural Network Approach for Legal Text Classification
    Hammami, Eya
    Faiz, Rim
    Akermi, Imen
    INFORMATION AND KNOWLEDGE SYSTEMS: DIGITAL TECHNOLOGIES, ARTIFICIAL INTELLIGENCE AND DECISION MAKING, ICIKS 2021, 2021, 425 : 71 - 84