Video Text Detection with Text Edges and Convolutional Neural Network

被引:0
|
作者
Hu, Ping [1 ]
Wang, Weiqiang [1 ]
Lu, Ke [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
SCENE IMAGES; REPRESENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text and captions in videos provide useful information for content analysis and understanding. In this paper, we present an approach to detecting video text in a coarse-to-fine strategy. In the coarse phase we propose an efficient method to detect multi-scale candidate text regions with high recall. Then the candidate text regions are segmented and sent to the fine phase where a convolutional neural network(CNN) is applied to generate a confidence map for each candidate text region. Finally, the candidate text regions are further refined and partitioned into text lines by projection analysis. The CNN classifier in the fine phase enables feature sharing and robustly identifies text regions. The coarse phase sharply reduce the number of windows needed to be scanned by the CNN. The combination endows the proposed method with both efficiency and robustness when detecting video text. It was verified by experiment results on two publicly testing datasets and a dataset created by us.
引用
收藏
页码:675 / 679
页数:5
相关论文
共 50 条
  • [41] Semantic Clustering and Convolutional Neural Network for Short Text Categorization
    Wang, Peng
    Xu, Jiaming
    Xu, Bo
    Liu, Cheng-Lin
    Zhang, Heng
    Wang, Fangyuan
    Hao, Hongwei
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 352 - 357
  • [42] Text Baseline Recognition Using a Recurrent Convolutional Neural Network
    Woedlinger, Matthias
    Sablatnig, Robert
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4673 - 4679
  • [43] Deep Convolutional Neural Network for Recognizing the Images of Text Documents
    Golovko, Vladimir
    Kroshchanka, Aliaksandr
    Mikhno, Egor
    Komar, Myroslav
    Sachenko, Anatoliy
    Bezobrazov, Sergei
    Shylinska, Inna
    MOMLET&DS-2019: MODERN MACHINE LEARNING TECHNOLOGIES AND DATA SCIENCE, 2019, 2386 : 297 - 306
  • [44] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [45] Short text sentiment analysis based on convolutional neural network
    Li, Weisen
    Li, Zhiqing
    Fang, Xupeng
    2018 14TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB 2018), 2018, : 291 - 295
  • [46] Impact of convolutional neural network and FastText embedding on text classification
    Muhammad Umer
    Zainab Imtiaz
    Muhammad Ahmad
    Michele Nappi
    Carlo Medaglia
    Gyu Sang Choi
    Arif Mehmood
    Multimedia Tools and Applications, 2023, 82 : 5569 - 5585
  • [47] Impact of convolutional neural network and FastText embedding on text classification
    Umer, Muhammad
    Imtiaz, Zainab
    Ahmad, Muhammad
    Nappi, Michele
    Medaglia, Carlo
    Choi, Gyu Sang
    Mehmood, Arif
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5569 - 5585
  • [48] Application of an Improved Convolutional Neural Network Algorithm in Text Classification
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2024, 23 (03): : 315 - 340
  • [49] MapReduce-Based Convolutional Neural Network for Text Categorization
    Ferjani, Eman
    Hidri, Adel
    Sassi Hidri, Minyar
    Frihida, Ali
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, 2019, 11684 : 155 - 166
  • [50] TextConvoNet: a convolutional neural network based architecture for text classification
    Sanskar Soni
    Satyendra Singh Chouhan
    Santosh Singh Rathore
    Applied Intelligence, 2023, 53 : 14249 - 14268