Video scene segmentation using video and audio features

被引:0
|
作者
Sundaram, H [1 ]
Chang, SF [1 ]
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present a novel algorithm for video scene segmentation. We model a scene as a semantically consistent chunk of audio-visual data. Central to the segmentation framework is the idea of a finite-memory model. We separately segment the audio and video data into scenes, using data in the memory. The audio segmentation algorithm determines the correlations amongst the envelopes of audio features. The video segmentation algorithm determines the correlations amongst shot key-frames. The scene boundaries in both cases are determined using local correlation minima. Then, we fuse the resulting segments using a nearest neighbor algorithm that is further refined using a time-alignment distribution derived from the ground truth. The algorithm was tested on a difficult data set; the first hour of a commercial film with good results. It achieves a scene segmentation accuracy of 84%.
引用
收藏
页码:1145 / 1148
页数:4
相关论文
共 50 条
  • [1] Integration of audio and video semantic features for news video scene segmentation
    Xu, J
    Liu, HB
    Zhou, DR
    [J]. VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 227 - 232
  • [2] Audio scene segmentation for video with generic content
    Niu, Feng
    Goela, Naveen
    Divakaran, Ajay
    Abdel-Mottaleb, Mohamed
    [J]. MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS II, 2008, 6820
  • [3] Indoor/Outdoor scene classification using audio and video features
    Lopes, Jose
    Singh, Sameer
    [J]. PROGRESS IN PATTERN RECOGNITION, 2007, : 232 - +
  • [4] Scene Determination Based on Video and Audio Features
    Silvia Pfeiffer
    Rainer Lienhart
    Wolfgang Efflsberg
    [J]. Multimedia Tools and Applications, 2001, 15 : 59 - 81
  • [5] Scene determination based on video and audio features
    Lienhart, R
    Pfeiffer, S
    Effelsberg, W
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 685 - 690
  • [6] Scene determination based on video and audio features
    Pfeiffer, S
    Lienhart, R
    Efflsberg, W
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2001, 15 (01) : 59 - 81
  • [7] A hidden Markov model framework for video segmentation using audio and image features
    Boreczky, JS
    Wilcox, LD
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3741 - 3744
  • [8] Automatic segmentation of news items based on video and audio features
    Wang, WQ
    Gao, W
    [J]. ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 498 - 505
  • [9] Automatic segmentation of news items based on video and audio features
    Weiqiang Wang
    Wen Gao
    [J]. Journal of Computer Science and Technology, 2002, 17 : 189 - 195
  • [10] Automatic segmentation of news items based on video and audio features
    Wang, WQ
    Gao, W
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (02) : 189 - 195