Improvement to speech-music discrimination using sinusoidal model based features

被引:0
|
作者
Jalil Shirazi
Shahrokh Ghaemmaghami
机构
[1] Islamic Azad University,Science & Research Branch
[2] Sharif University of Technology,undefined
来源
关键词
Audio classification; Sinusoidal model;
D O I
暂无
中图分类号
学科分类号
摘要
This paper addresses a model-based audio content analysis for classification of speech-music mixed audio signals into speech and music. A set of new features is presented and evaluated based on sinusoidal modeling of audio signals. The new feature set, including variance of the birth frequencies and duration of the longest frequency track in sinusoidal model, as a measure of the harmony and signal continuity, is introduced and discussed in detail. These features are used and compared to typical features as inputs to an audio classifier. Performance of these sinusoidal model features is evaluated through classification of audio into speech and music using both the GMM (Gaussian Mixture Model) and the SVM (Support Vector Machine) classifiers. Experimental results show that the proposed features are quite successful in speech/music discrimination. By using only a set of two sinusoidal model features, extracted from 1-s segments of the signal, we achieved 96.84% accuracy in the audio classification. Experimental comparisons also confirm superiority of the sinusoidal model features to the popular time domain and frequency domain features in audio classification.
引用
收藏
页码:415 / 435
页数:20
相关论文
共 50 条
  • [1] Improvement to speech-music discrimination using sinusoidal model based features
    Shirazi, Jalil
    Ghaemmaghami, Shahrokh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 50 (02) : 415 - 435
  • [2] A speech-music discriminator using HILN model based features
    Thoshkahna, Balaji
    Sudha, V
    Ramakrishnan, K. R.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5283 - 5286
  • [3] Speech-music discrimination using deep visual feature extractors
    Papakostas, Michalis
    Giannakopoulos, Theodoros
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 334 - 344
  • [4] SPEECH-MUSIC DISCRIMINATION: A DEEP LEARNING PERSPECTIVE
    Pikrakis, Aggelos
    Theodoridis, Sergios
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 616 - 620
  • [5] An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder
    Yang, Wanzhao
    Tu, Weiping
    Zheng, Jiaxi
    Zhang, Xiong
    Yang, Yuhong
    Song, Yucheng
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 81 - 92
  • [6] Rhythm detection for speech-music discrimination in MPEG compressed domain
    Jarina, R
    O'Connor, N
    Marlow, S
    Murphy, N
    DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 129 - 132
  • [7] Pitch estimation using music algorithm based on the sinusoidal speech model
    Amirkabir University of Technology, Electrical Engineering Department, Hafez Avenue, 15914 Tehran, Iran
    不详
    Advances in Communications and Software Technologies, 2002, : 255 - 258
  • [8] Speech-Music Classification Model Based on Improved Neural Network and Beat Spectrum
    Huang, Chun
    Wei, HeFu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 52 - 64
  • [9] MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION
    Sell, Gregory
    Clark, Pascal
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] From close listening to distant listening: Developing tools for Speech-Music discrimination of Danish music radio
    Have, Iben
    Enevoldsen, Kenneth
    DIGITAL HUMANITIES QUARTERLY, 2021, 15 (01):