Audio Feature Extraction and Analysis for Scene Segmentation and Classification

被引:0
|
作者
Zhu Liu
Yao Wang
Tsuhan Chen
机构
[1] Polytechnic University,
[2] Carnegie Mellon University,undefined
关键词
Audio Signal; Audio Feature; Scene Change; Football Game; Audio Clip;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding of the scene content of a video sequence is very important for content-based indexing and retrieval of multimedia databases. Research in this area in the past several years has focused on the use of speech recognition and image analysis techniques. As a complimentary effort to the prior work, we have focused on using the associated audio information (mainly the nonspeech portion) for video scene analysis. As an example, we consider the problem of discriminating five types of TV programs, namely commercials, basketball games, football games, news reports, and weather forecasts. A set of low-level audio features are proposed for characterizing semantic contents of short audio clips. The linear separability of different classes under the proposed feature space is examined using a clustering analysis. The effective features are identified by evaluating the intracluster and intercluster scattering matrices of the feature space. Using these features, a neural net classifier was successful in separating the above five types of TV programs. By evaluating the changes between the feature vectors of adjacent clips, we also can identify scene breaks in an audio sequence quite accurately. These results demonstrate the capability of the proposed audio features for characterizing the semantic content of an audio sequence.
引用
收藏
页码:61 / 79
页数:18
相关论文
共 50 条
  • [1] Audio feature extraction and analysis for scene segmentation and classification
    Liu, Z
    Wang, Y
    Chen, TH
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1998, 20 (1-2): : 61 - 79
  • [2] Audio feature extraction and analysis for scene segmentation and classification
    Polytechnic Univ, Brooklyn, United States
    [J]. J VLSI Signal Process Syst Signal Image Video Technol, 1-2 (61-79):
  • [3] Audio feature extraction & analysis for scene classification
    Liu, Z
    Huang, JC
    Wang, Y
    Chen, TH
    [J]. 1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1997, : 343 - 348
  • [4] Feature analysis and extraction for audio automatic classification
    Liang, B
    Hu, YL
    Lao, SY
    Chen, JY
    Wu, LD
    [J]. INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 767 - 772
  • [5] LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM FOR ACOUSTIC SCENE CLASSIFICATION
    Geiger, Juergen T.
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [6] Deep Feature Embedding and Hierarchical Classification for Audio Scene Classification
    Pham, Lam
    McLoughlin, Ian
    Phan, Huy
    Palaniappan, R.
    Merlins, Alfred
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] Feature extraction, image segmentation, and scene reconstruction
    Lester, ED
    Whitaker, RT
    Abidi, MA
    [J]. SENSOR FUSION AND DECENTRALIZED CONTROL IN AUTONOMOUS ROBOTIC SYSTEMS, 1997, 3209 : 250 - 260
  • [8] Feature Analysis for Audio Classification
    Bengolea, Gaston
    Acevedo, Daniel
    Rais, Martin
    Mejail, Marta
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 239 - 246
  • [9] Audio feature extraction for effective emotion classification
    Han, Euihwan
    Cha, Hyungtai
    [J]. IEIE Transactions on Smart Processing and Computing, 2019, 8 (02): : 100 - 107
  • [10] Audio signal segmentation and classification for scene-cut detection
    Nitanda, N
    Haseyama, M
    Kitajima, H
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 4030 - 4033