Story segmentation and detection of commercials in broadcast news video

被引:29
|
作者
Hauptmann, AG [1 ]
Witbrock, MJ [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
关键词
segmentation; video processing; broadcast news story analysis; closed captioning; digital library; video library creation; speech recognition;
D O I
10.1109/ADL.1998.670392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of text, audio and video material. Segmentation is an integral process in the Informedia digital video library. The success of the Informedia project hinges on two critical assumptions: that we can extract sufficiently accurate speech recognition transcripts from the broadcast audio and that we can segment the broadcast into video paragraphs, or stories, that are useful for information retrieval. In previous papers [Hauptmann97, Witbrock97, Witbrock98], we have shown that speech recognition is sufficient for information retrieval of pre-segmented video news stories. In this paper we address the issue of segmentation and demonstrate that a fully automatic system can extract story boundaries using available audio, video and closed-captioning cues. The story segmentation step for the Informedia Digital Video Library splits full-length news broadcasts into individual news stories. During this phase the system also labels commercials as separate "stories". We explain how the Informedia system takes advantage of the closed captioning frequently broadcast with the news, how it extracts timing information by aligning the closed-captions with the result of the speech recognition, and how the system integrates closed-caption cues with the results of image and audio processing.
引用
收藏
页码:168 / 179
页数:12
相关论文
共 50 条
  • [21] Using Multimodal Analysis for Story Segmentation of News Video
    Liu Hua-Yong
    He Tingting
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 124 - 127
  • [22] Content-based news video story segmentation and video retrieval
    Liu, HY
    Zhou, DR
    SECOND INTERNATION CONFERENCE ON IMAGE AND GRAPHICS, PTS 1 AND 2, 2002, 4875 : 1038 - 1044
  • [23] Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News
    Xie, Lei
    Yang, Yulian
    Zeng, Jia
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 248 - +
  • [24] Broadcast news story segmentation using sticky hierarchical dirichlet process
    Jia Yu
    Hongxiang Shao
    Applied Intelligence, 2022, 52 : 12788 - 12800
  • [25] Broadcast news story segmentation using sticky hierarchical dirichlet process
    Yu, Jia
    Shao, Hongxiang
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12788 - 12800
  • [26] BROADCAST NEWS STORY SEGMENTATION USING LATENT TOPICS ON DATA MANIFOLD
    Lu, Xiaoming
    Leung, Cheung-Chi
    Xie, Lei
    Ma, Bin
    Li, Haizhou
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8465 - 8469
  • [27] Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News
    Chen, Hongjie
    Xie, Lei
    Leung, Cheung-Chi
    Lu, Xiaoming
    Ma, Bin
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 112 - 123
  • [28] Unsupervised video-shot segmentation and model-free, anchorperson detection for news video story parsing
    Gao, XB
    Tang, X
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (09) : 765 - 776
  • [29] Player detection, tracking and segmentation in broadcast tennis video
    Hsieh, C.-H., 1600, Chung Cheng Institute of Technology (43):
  • [30] news video story segmentation silence clip shot detection audio-visual fusion
    Song, Yu
    Wang, Wenhong
    Guo, Fengjuan
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1065 - +