Speech/Music Discrimination Based on Discrete Wavelet Transform

被引:0
|
作者
Ntalampiras, Stavros [1 ]
Fakotakis, Nikos [1 ]
机构
[1] Univ Patras, Wire Commun Lab, Dept Elect & Comp Engn, Rion 26500, Greece
关键词
Computer audition; content-based audio classification; discrete wavelet transform; Gaussian mixture model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an effective approach which addresses the issue of speech/music discrimination. Our architecture focuses on the matter from the scope of improving the performance of a speech recognition system by excluding the processing of information which is not speech. Multiresolution analysis is applied to the input signal while the most significant statistical features are calculated over a predefined texture size. These characteristics are then modeled using a state of the art technique for probability density function estimation, Gaussian mixture models (GMM). A classification scheme consisting of a conventional maximum likelihood decision methodology constitutes the next step of our implementation. Despite the fact that our system is based solely on wavelet signal processing, it demonstrated very good performance achieving 91.8% recognition rate.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [1] A wavelet-based parameterization for speech/music discrimination
    Didiot, E.
    Illina, I.
    Fohr, D.
    Mella, O.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 341 - 357
  • [2] Psychoacoustic Music Analysis Based on the Discrete Wavelet Packet Transform
    He, Xing
    Scordilis, Michael S.
    [J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2008, 2008
  • [3] A Discrete Wavelet Transform Based Approach to Hindi Speech Recognition
    Ranjan, Shivesh
    [J]. 2010 INTERNATIONAL CONFERENCE ON SIGNAL ACQUISITION AND PROCESSING: ICSAP 2010, PROCEEDINGS, 2010, : 345 - 348
  • [4] Discrete Fourier Transform and Discrete Wavelet Packet Transform in Speech Denoising
    Wang, Zhanfeng
    Li, Suping
    [J]. 2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1588 - 1591
  • [5] Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination
    Duzenli, Timur
    Ozkurt, Nalan
    [J]. ISTANBUL UNIVERSITY-JOURNAL OF ELECTRICAL AND ELECTRONICS ENGINEERING, 2011, 11 (01): : 1355 - 1362
  • [6] Discrete wavelet transform techniques in speech processing
    Agbinya, JI
    [J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 514 - 519
  • [7] Speech compression and encryption based on discrete wavelet transform and chaotic signals
    Abbas Salman Hameed
    [J]. Multimedia Tools and Applications, 2021, 80 : 13663 - 13676
  • [8] Speech compression and encryption based on discrete wavelet transform and chaotic signals
    Hameed, Abbas Salman
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 13663 - 13676
  • [9] Speech and image compression using discrete wavelet transform
    Junejo, N
    Ahmed, N
    Unar, MA
    Rajput, AQK
    [J]. 2005 IEEE SARNOFF SYMPOSIUM ON ADVANCES IN WIRED AND WIRELESS COMMUNICATION, 2005, : 106 - 109
  • [10] Combined discrete wavelet transform and wavelet packet decomposition for speech enhancement
    Wang, Zhen-li
    Yang, Jie
    Zhang, Xiong-wei
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1107 - +