Video Text Detection with Text Edges and Convolutional Neural Network

被引:0
|
作者
Hu, Ping [1 ]
Wang, Weiqiang [1 ]
Lu, Ke [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
SCENE IMAGES; REPRESENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text and captions in videos provide useful information for content analysis and understanding. In this paper, we present an approach to detecting video text in a coarse-to-fine strategy. In the coarse phase we propose an efficient method to detect multi-scale candidate text regions with high recall. Then the candidate text regions are segmented and sent to the fine phase where a convolutional neural network(CNN) is applied to generate a confidence map for each candidate text region. Finally, the candidate text regions are further refined and partitioned into text lines by projection analysis. The CNN classifier in the fine phase enables feature sharing and robustly identifies text regions. The coarse phase sharply reduce the number of windows needed to be scanned by the CNN. The combination endows the proposed method with both efficiency and robustness when detecting video text. It was verified by experiment results on two publicly testing datasets and a dataset created by us.
引用
收藏
页码:675 / 679
页数:5
相关论文
共 50 条
  • [1] Text-Attentional Convolutional Neural Network for Scene Text Detection
    He, Tong
    Huang, Weilin
    Qiao, Yu
    Yao, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2529 - 2541
  • [2] VIDEO TEXT DETECTION WITH FULLY CONVOLUTIONAL NETWORK AND TRACKING
    Wang, Yang
    Wang, Lan
    Su, Feng
    Shi, Jiahao
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1738 - 1743
  • [3] Thai Text Detection and Classification Using Convolutional Neural Network
    Malakar, Susanta
    Chiracharit, Werapon
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 99 - 102
  • [4] Text detection with convolutional neural networks
    Delakis, Manolis
    Garcia, Christophe
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 290 - 294
  • [5] The Recognition of Chinese Caption Text in News Video Using Convolutional Neural Network
    Zhong, Dixiu
    Shi, Ping
    Pan, Da
    Sha, Yuan
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 658 - 662
  • [6] A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network
    Ren, Xiaohang
    Chen, Kai
    Yang, Xiaokang
    Zhou, Yi
    He, Jianhua
    Sun, Jun
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [7] A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos
    Wang, Yang
    Qian, Ye
    Shi, Jiahao
    Su, Feng
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 112 - 124
  • [8] Detection of medical text semantic similarity based on convolutional neural network
    Tao Zheng
    Yimei Gao
    Fei Wang
    Chenhao Fan
    Xingzhi Fu
    Mei Li
    Ya Zhang
    Shaodian Zhang
    Handong Ma
    BMC Medical Informatics and Decision Making, 19
  • [9] Live detection of text in the natural environment using Convolutional Neural Network
    Francis, Leena Mary
    Sreenath, N.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 98 : 444 - 455
  • [10] Detection of medical text semantic similarity based on convolutional neural network
    Zheng, Tao
    Gao, Yimei
    Wang, Fei
    Fan, Chenhao
    Fu, Xingzhi
    Li, Mei
    Zhang, Ya
    Zhang, Shaodian
    Ma, Handong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)