The Recognition of Chinese Caption Text in News Video Using Convolutional Neural Network

被引:0
|
作者
Zhong, Dixiu [1 ]
Shi, Ping [1 ]
Pan, Da [1 ]
Sha, Yuan [1 ]
机构
[1] Commun Univ China, Sch Informat Engn, Beijing, Peoples R China
关键词
News video; Chinese caption text recognition; CNN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
News video caption, which carries main contents of related news story, plays an important role in content-based video analysis and retrieval system. In this paper, the convolutional neural network (CNN) is used to the recognition of chinese caption text in news video. First, the color and edge feature are used for caption location. Then, the segmentation combined Otsu and K-means clustering algorithm is applied to the caption images before they are sent to CNN. It is worth mentioning that we present a method for generating and labeling training images automatically, which avoids the complex and time consuming data collection. Finally, two CNN models trained on different dataset are evaluated in our experiment. By using the baseline model, the recognition accuracy can achieve 93.3% in top-1 and 98.58% in top-5 on chinese caption texts collected from news video. We also show an improvement to 95% in top-1 accuracy by averaging the two CNN models. Experimental results suggest that CNN is competent to the challenging task of chinese character recognition.
引用
收藏
页码:658 / 662
页数:5
相关论文
共 50 条
  • [1] Caption Detection and Text Recognition in News Video
    Yang, Zhe
    Shi, Ping
    [J]. 2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 188 - 191
  • [2] Ligature Recognition in Urdu Caption Text using Deep Convolutional Neural Networks
    Hayat, Umar
    Aatif, Muhammad
    Zeeshan, Osama
    Siddiqi, Imran
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2018,
  • [3] A Combined-Convolutional Neural Network for Chinese News Text Classification
    Zhang, Yu
    Liu, Kai-Feng
    Zhang, Quan-Xin
    Wang, Yan-Ge
    Gao, Kai-Long
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (06): : 1059 - 1067
  • [4] Video-Based Chinese Sign Language Recognition Using Convolutional Neural Network
    Yang, Su
    Zhu, Qing
    [J]. 2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 929 - 934
  • [5] Recognition of Chinese food using convolutional neural network
    Teng, Jianing
    Zhang, Dong
    Lee, Dah-Jye
    Chou, Yao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (09) : 11155 - 11172
  • [6] Recognition of Chinese food using convolutional neural network
    Jianing Teng
    Dong Zhang
    Dah-Jye Lee
    Yao Chou
    [J]. Multimedia Tools and Applications, 2019, 78 : 11155 - 11172
  • [7] Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models
    Wu, Yi-Chao
    Yin, Fei
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2017, 65 : 251 - 264
  • [8] Text Baseline Recognition Using a Recurrent Convolutional Neural Network
    Woedlinger, Matthias
    Sablatnig, Robert
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4673 - 4679
  • [9] Video Text Detection with Text Edges and Convolutional Neural Network
    Hu, Ping
    Wang, Weiqiang
    Lu, Ke
    [J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 675 - 679
  • [10] Chinese License Plate Recognition Using a Convolutional Neural Network
    Zhao, Zhihong
    Yang, Shaopu
    Ma, Xinna
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 25 - 28