Caption analysis and recognition for building video indexing systems

被引：13

作者：

Chang, F ^{[1
]}

Chen, GC

Lin, CC

Lin, WH

机构：

[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan

[2] Natl Taipei Univ technol, Dept Elect Engn, Taipei, Taiwan

来源：

MULTIMEDIA SYSTEMS | 2005年 / 10卷 / 04期

关键词：

background removal; caption tracking; character recognition; support vector machines; prototype classification;

D O I：

10.1007/s00530-004-0159-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose several methods for analyzing and recognizing Chinese video captions, which constitute a very useful information source for video content. Image binarization, performed by combining a global threshold method and a window-based method, is used to obtain clearer images of characters, and a caption-tracking scheme is used to locate caption regions and detect caption changes. The separation of characters from possibly complex backgrounds is achieved by using size and color constraints and by cross examination of multiframe images. To segment individual characters, we use a dynamic split-and-merge strategy. Finally, we propose a character recognition process using a prototype classification method, supplemented by a disambiguation process using support vector machines, to improve recognition outcomes. This is followed by a postprocess that integrates multiple recognition results. The overall accuracy rate for the entire process applied to test video films is 94.11%.

引用

页码：344 / 355

页数：12

共 50 条

[41] Exploiting Auxiliary Caption for Video Grounding
Li, Hongxiang
Cao, Meng
Cheng, Xuxin
Li, Yaowei
Zhu, Zhihong
Zou, Yuexian
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18508 - 18516
[42] Caption Detection and Positioning in Digital Video
Wang, Zhujun
Lan, Shanzhen
Yang, Lei
Zhang, Yue
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2013), 2014, 277 : 1051 - 1059
[43] Shape indexing and recognition based on regional analysis
Wei, Jie
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (05) : 1049 - 1061
[44] The ARGOS campaign:: Evaluation of video analysis and indexing tools
Joly, Philippe
Benois-Pineau, Jenny
Kijak, Ewa
Quenot, Georges
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2007, 22 (7-8) : 705 - 717
[45] Automatic video indexing via object motion analysis
Courtney, JD
PATTERN RECOGNITION, 1997, 30 (04) : 607 - 625
[46] Content-based analysis and indexing of sports video
Luo, M
Bai, XS
Xu, GY
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2002, 2002, 4676 : 223 - 231
[47] Analysis of gesture and action in technical talks for video indexing
Ju, SX
Black, MJ
Minneman, S
Kimber, D
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 595 - 601
[48] Automatic video indexing with incremental gallery creation: Integration of recognition and knowledge acquisition
Okada, Kazunori
von der Malsburg, Christoph
International Conference on Knowledge-Based Intelligent Electronic Systems, Proceedings, KES, 1999, : 431 - 434
[49] Surgical instrument recognition for instrument usage documentation and surgical video library indexing
Zhang, Bokai
Sturgeon, Darrick
Shankar, Arjun Ravi
Goel, Varun Kejriwal
Barker, Jocelyn
Ghanem, Amer
Lee, Philip
Milecky, Meghan
Stottler, Natalie
Petculescu, Svetlana
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (04): : 1064 - 1072
[50] Semantic indexing for instructional video via combination of handwriting recognition and information retrieval
Tang, LJ
Kender, JR
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 921 - 924

← 1 2 3 4 5 →