Caption analysis and recognition for building video indexing systems

被引:13
|
作者
Chang, F [1 ]
Chen, GC
Lin, CC
Lin, WH
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taipei Univ technol, Dept Elect Engn, Taipei, Taiwan
关键词
background removal; caption tracking; character recognition; support vector machines; prototype classification;
D O I
10.1007/s00530-004-0159-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose several methods for analyzing and recognizing Chinese video captions, which constitute a very useful information source for video content. Image binarization, performed by combining a global threshold method and a window-based method, is used to obtain clearer images of characters, and a caption-tracking scheme is used to locate caption regions and detect caption changes. The separation of characters from possibly complex backgrounds is achieved by using size and color constraints and by cross examination of multiframe images. To segment individual characters, we use a dynamic split-and-merge strategy. Finally, we propose a character recognition process using a prototype classification method, supplemented by a disambiguation process using support vector machines, to improve recognition outcomes. This is followed by a postprocess that integrates multiple recognition results. The overall accuracy rate for the entire process applied to test video films is 94.11%.
引用
收藏
页码:344 / 355
页数:12
相关论文
共 50 条
  • [41] Exploiting Auxiliary Caption for Video Grounding
    Li, Hongxiang
    Cao, Meng
    Cheng, Xuxin
    Li, Yaowei
    Zhu, Zhihong
    Zou, Yuexian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18508 - 18516
  • [42] Caption Detection and Positioning in Digital Video
    Wang, Zhujun
    Lan, Shanzhen
    Yang, Lei
    Zhang, Yue
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2013), 2014, 277 : 1051 - 1059
  • [43] Shape indexing and recognition based on regional analysis
    Wei, Jie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (05) : 1049 - 1061
  • [44] The ARGOS campaign:: Evaluation of video analysis and indexing tools
    Joly, Philippe
    Benois-Pineau, Jenny
    Kijak, Ewa
    Quenot, Georges
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2007, 22 (7-8) : 705 - 717
  • [45] Automatic video indexing via object motion analysis
    Courtney, JD
    PATTERN RECOGNITION, 1997, 30 (04) : 607 - 625
  • [46] Content-based analysis and indexing of sports video
    Luo, M
    Bai, XS
    Xu, GY
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2002, 2002, 4676 : 223 - 231
  • [47] Analysis of gesture and action in technical talks for video indexing
    Ju, SX
    Black, MJ
    Minneman, S
    Kimber, D
    1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 595 - 601
  • [48] Automatic video indexing with incremental gallery creation: Integration of recognition and knowledge acquisition
    Okada, Kazunori
    von der Malsburg, Christoph
    International Conference on Knowledge-Based Intelligent Electronic Systems, Proceedings, KES, 1999, : 431 - 434
  • [49] Surgical instrument recognition for instrument usage documentation and surgical video library indexing
    Zhang, Bokai
    Sturgeon, Darrick
    Shankar, Arjun Ravi
    Goel, Varun Kejriwal
    Barker, Jocelyn
    Ghanem, Amer
    Lee, Philip
    Milecky, Meghan
    Stottler, Natalie
    Petculescu, Svetlana
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (04): : 1064 - 1072
  • [50] Semantic indexing for instructional video via combination of handwriting recognition and information retrieval
    Tang, LJ
    Kender, JR
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 921 - 924