Caption analysis and recognition for building video indexing systems

被引:13
|
作者
Chang, F [1 ]
Chen, GC
Lin, CC
Lin, WH
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taipei Univ technol, Dept Elect Engn, Taipei, Taiwan
关键词
background removal; caption tracking; character recognition; support vector machines; prototype classification;
D O I
10.1007/s00530-004-0159-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose several methods for analyzing and recognizing Chinese video captions, which constitute a very useful information source for video content. Image binarization, performed by combining a global threshold method and a window-based method, is used to obtain clearer images of characters, and a caption-tracking scheme is used to locate caption regions and detect caption changes. The separation of characters from possibly complex backgrounds is achieved by using size and color constraints and by cross examination of multiframe images. To segment individual characters, we use a dynamic split-and-merge strategy. Finally, we propose a character recognition process using a prototype classification method, supplemented by a disambiguation process using support vector machines, to improve recognition outcomes. This is followed by a postprocess that integrates multiple recognition results. The overall accuracy rate for the entire process applied to test video films is 94.11%.
引用
收藏
页码:344 / 355
页数:12
相关论文
共 50 条
  • [21] Video Caption Duration Extraction
    Bai, Hongliang
    Sun, Jun
    Naoi, Satoshi
    Katsuyama, Yutaka
    Hotta, Yoshinobu
    Fujimoto, Katsuhito
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1332 - +
  • [22] An automatic face detection and recognition system for video indexing applications
    Acosta, E
    Torres, L
    Albiol, A
    Delp, E
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3644 - 3647
  • [23] Face indexing on video data - Extraction, recognition, tracking and modeling
    Ariki, Y
    Sugiyama, Y
    Ishikawa, N
    AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, : 62 - 69
  • [24] An indexing-based approach to pattern and video clip recognition
    Mikhailov, A. M.
    AUTOMATION AND REMOTE CONTROL, 2014, 75 (12) : 2201 - 2211
  • [25] An indexing-based approach to pattern and video clip recognition
    A. M. Mikhailov
    Automation and Remote Control, 2014, 75 : 2201 - 2211
  • [26] Hybrid approach of video indexing and machine learning for rapid indexing and highly precise object recognition
    Tsutsumi, F
    Nakajima, C
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2001, : 645 - 648
  • [27] A Space Information-Enhanced Dense Video Caption for Indoor Human Action Recognition
    Chen, Bin
    Nakamura, Yugo
    Fukushima, Shogo
    Arakawa, Yutaka
    2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 423 - 427
  • [28] A novel error measure for the evaluation of video indexing systems
    Eickeler, S
    Rigoll, G
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1991 - 1994
  • [29] Combining hierarchical classifiers with video semantic indexing systems
    Zhou, WS
    Dao, SK
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 78 - 85
  • [30] A NOVEL AUDIOVISUAL ANALYSIS FOR NEWS VIDEO INDEXING
    Huang Yubin
    Dong Yuan
    Dong Chengyu
    Wang Haila
    PROCEEDINGS OF 2009 2ND IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2009, : 486 - 490