Caption analysis and recognition for building video indexing systems

被引:13
|
作者
Chang, F [1 ]
Chen, GC
Lin, CC
Lin, WH
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taipei Univ technol, Dept Elect Engn, Taipei, Taiwan
关键词
background removal; caption tracking; character recognition; support vector machines; prototype classification;
D O I
10.1007/s00530-004-0159-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose several methods for analyzing and recognizing Chinese video captions, which constitute a very useful information source for video content. Image binarization, performed by combining a global threshold method and a window-based method, is used to obtain clearer images of characters, and a caption-tracking scheme is used to locate caption regions and detect caption changes. The separation of characters from possibly complex backgrounds is achieved by using size and color constraints and by cross examination of multiframe images. To segment individual characters, we use a dynamic split-and-merge strategy. Finally, we propose a character recognition process using a prototype classification method, supplemented by a disambiguation process using support vector machines, to improve recognition outcomes. This is followed by a postprocess that integrates multiple recognition results. The overall accuracy rate for the entire process applied to test video films is 94.11%.
引用
收藏
页码:344 / 355
页数:12
相关论文
共 50 条
  • [31] VADIS: A video analysis, display and indexing system
    Gargi, U
    Antani, S
    Kasturi, R
    1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 965 - 965
  • [32] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Sato, T
    Kanade, T
    Hughes, EK
    Smith, MA
    Satoh, S
    MULTIMEDIA SYSTEMS, 1999, 7 (05) : 385 - 395
  • [33] Recognition and visual feature matching of text region in video for conceptual indexing
    Kurakake, S
    Kuwano, H
    Odaka, K
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 368 - 379
  • [34] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Toshio Sato
    Takeo Kanade
    Ellen K. Hughes
    Michael A. Smith
    Shin'ichi Satoh
    Multimedia Systems, 1999, 7 : 385 - 395
  • [35] Automatic caption localization in compressed video
    Zhong, Y
    Zhang, HJ
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (04) : 385 - 392
  • [36] A Survey On Video Caption Extraction Technology
    Wang, Zhujun
    Yang, Lei
    Wu, Xiaoyu
    Zhang, Ying
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 713 - 716
  • [37] A Multimodal Framework for Video Caption Generation
    Bhooshan, Reshmi S.
    Suresh, K.
    IEEE ACCESS, 2022, 10 : 92166 - 92176
  • [38] A Multimodal Framework for Video Caption Generation
    Bhooshan, Reshmi S.
    Suresh, K.
    IEEE Access, 2022, 10 : 92166 - 92176
  • [39] A Method of Caption Detection in News Video
    Huang, He
    Shi, Ping
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 502 - 509
  • [40] A video browser based on closed caption
    Lee, Janghwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (03) : 1124 - 1128