Story segmentation and detection of commercials in broadcast news video

被引:29
|
作者
Hauptmann, AG [1 ]
Witbrock, MJ [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
关键词
segmentation; video processing; broadcast news story analysis; closed captioning; digital library; video library creation; speech recognition;
D O I
10.1109/ADL.1998.670392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of text, audio and video material. Segmentation is an integral process in the Informedia digital video library. The success of the Informedia project hinges on two critical assumptions: that we can extract sufficiently accurate speech recognition transcripts from the broadcast audio and that we can segment the broadcast into video paragraphs, or stories, that are useful for information retrieval. In previous papers [Hauptmann97, Witbrock97, Witbrock98], we have shown that speech recognition is sufficient for information retrieval of pre-segmented video news stories. In this paper we address the issue of segmentation and demonstrate that a fully automatic system can extract story boundaries using available audio, video and closed-captioning cues. The story segmentation step for the Informedia Digital Video Library splits full-length news broadcasts into individual news stories. During this phase the system also labels commercials as separate "stories". We explain how the Informedia system takes advantage of the closed captioning frequently broadcast with the news, how it extracts timing information by aligning the closed-captions with the result of the speech recognition, and how the system integrates closed-caption cues with the results of image and audio processing.
引用
收藏
页码:168 / 179
页数:12
相关论文
共 50 条
  • [31] A multi-modal approach to story segmentation for news video
    Chaisorn, L
    Chua, TS
    Lee, CH
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2003, 6 (02): : 187 - 208
  • [32] A Multi-Modal Approach to Story Segmentation for News Video
    Lekha Chaisorn
    Tat-Seng Chua
    Chin-Hui Lee
    World Wide Web, 2003, 6 : 187 - 208
  • [33] Caption-based news video story segmentation and retrieval
    Institute of Information Technology, Beijing Forestry University, Beijing 100081, China
    不详
    J. Inf. Comput. Sci., 2008, 2 (613-619):
  • [34] On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news
    Xie, L.
    Yang, Y. -L.
    Liu, Z. -Q.
    INFORMATION SCIENCES, 2011, 181 (13) : 2873 - 2891
  • [35] Multi-scale TextTiling for automatic story segmentation in Chinese broadcast news
    Xie, Lei
    Zeng, Jia
    Feng, Wei
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 345 - +
  • [36] Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features
    Wang, Xiaoxuan
    Xie, Lei
    Lu, Mimi
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05) : 1206 - 1215
  • [37] A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News
    Zhang, Jin
    Xie, Lei
    Feng, Wei
    Zhang, Yanning
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 136 - +
  • [38] Only overlay text: novel features for TV news broadcast video segmentation
    Raghvendra Kannao
    Prithwijit Guha
    Bidyut B. Chaudhuri
    Multimedia Tools and Applications, 2022, 81 : 30493 - 30517
  • [39] Only overlay text: novel features for TV news broadcast video segmentation
    Kannao, Raghvendra
    Guha, Prithwijit
    Chaudhuri, Bidyut B.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30493 - 30517
  • [40] Automatic Story Segmentation for TV News Video Using Multiple Modalities
    Dumont, Emilie
    Quenot, Georges
    INTERNATIONAL JOURNAL OF DIGITAL MULTIMEDIA BROADCASTING, 2012, 2012