The Recognition of Chinese Caption Text in News Video Using Convolutional Neural Network

被引:0
|
作者
Zhong, Dixiu [1 ]
Shi, Ping [1 ]
Pan, Da [1 ]
Sha, Yuan [1 ]
机构
[1] Commun Univ China, Sch Informat Engn, Beijing, Peoples R China
关键词
News video; Chinese caption text recognition; CNN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
News video caption, which carries main contents of related news story, plays an important role in content-based video analysis and retrieval system. In this paper, the convolutional neural network (CNN) is used to the recognition of chinese caption text in news video. First, the color and edge feature are used for caption location. Then, the segmentation combined Otsu and K-means clustering algorithm is applied to the caption images before they are sent to CNN. It is worth mentioning that we present a method for generating and labeling training images automatically, which avoids the complex and time consuming data collection. Finally, two CNN models trained on different dataset are evaluated in our experiment. By using the baseline model, the recognition accuracy can achieve 93.3% in top-1 and 98.58% in top-5 on chinese caption texts collected from news video. We also show an improvement to 95% in top-1 accuracy by averaging the two CNN models. Experimental results suggest that CNN is competent to the challenging task of chinese character recognition.
引用
收藏
页码:658 / 662
页数:5
相关论文
共 50 条
  • [31] Event Recognition of Crowd Video using Corner Optical Flow and Convolutional Neural Network
    Zhang, Weihan
    Hou, Yibin
    Wang, Suyu
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2016), 2016, 10033
  • [32] Effective multiple person recognition in random video sequences using a convolutional neural network
    Niraimathi Puhalanthi
    Daw-Tung Lin
    [J]. Multimedia Tools and Applications, 2020, 79 : 11125 - 11141
  • [33] Effective multiple person recognition in random video sequences using a convolutional neural network
    Puhalanthi, Niraimathi
    Lin, Daw-Tung
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (15-16) : 11125 - 11141
  • [34] Recognition of Pollen-bearing Bees from Video using Convolutional Neural Network
    Rodriguez, Ivan F.
    Megret, Remi
    Acuna, Edgar
    Agosto-Rivera, Jose L.
    Giray, Tugrul
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 314 - 322
  • [35] Fish Recognition Using Convolutional Neural Network
    Ding, Guoqing
    Song, Yan
    Guo, Jia
    Feng, Chen
    Li, Guangliang
    He, Bo
    Yan, Tianhong
    [J]. OCEANS 2017 - ANCHORAGE, 2017,
  • [36] Iris Recognition Using Convolutional Neural Network
    Zhuang, Yuan
    Chuah, Joon Huang
    Chow, Chee Onn
    Lim, Marcus Guozong
    [J]. 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2020, : 134 - 138
  • [37] Content video browsing based on text regions extraction and classification using Convolutional Neural Network
    Bouaziz, Bassein
    Amara, Jihen
    Mahdi, Walid
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 730 - 737
  • [38] Emotion Recognition Using a Convolutional Neural Network
    Zatarain-Cabada, Ramon
    Lucia Barron-Estrada, Maria
    Gonzalez-Hernandez, Francisco
    Rodriguez-Rangel, Hector
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2017, PT II, 2018, 10633 : 208 - 219
  • [39] Gait Recognition Using Convolutional Neural Network
    Sheth, Abhishek
    Sharath, Meghana
    Reddy, Sai Charan
    Sindhu, K.
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (01) : 107 - 118
  • [40] Text image refocusing by using the convolutional neural network
    Wang, Kangkang
    Wang, Keyan
    Li, Yunsong
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2018, 45 (04): : 80 - 85