Speech/Music Discrimination Based on Discrete Wavelet Transform

被引:0
|
作者
Ntalampiras, Stavros [1 ]
Fakotakis, Nikos [1 ]
机构
[1] Univ Patras, Wire Commun Lab, Dept Elect & Comp Engn, Rion 26500, Greece
关键词
Computer audition; content-based audio classification; discrete wavelet transform; Gaussian mixture model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an effective approach which addresses the issue of speech/music discrimination. Our architecture focuses on the matter from the scope of improving the performance of a speech recognition system by excluding the processing of information which is not speech. Multiresolution analysis is applied to the input signal while the most significant statistical features are calculated over a predefined texture size. These characteristics are then modeled using a state of the art technique for probability density function estimation, Gaussian mixture models (GMM). A classification scheme consisting of a conventional maximum likelihood decision methodology constitutes the next step of our implementation. Despite the fact that our system is based solely on wavelet signal processing, it demonstrated very good performance achieving 91.8% recognition rate.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [21] Overcomplete discrete wavelet transform based respiratory sound discrimination with feature and decision level fusion
    Ulukaya, Sezer
    Serbes, Gorkem
    Kahya, Yasemin P.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2017, 38 : 322 - 336
  • [22] Scalable speech coding based on the wavelet transform
    Stegmann, J
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2000, 54 (06) : 321 - 330
  • [23] Scalable speech coding based on the wavelet transform
    Stegmann, Joachim
    [J]. AEU-Archiv fur Elektronik und Ubertragungstechnik, 2000, 54 (06): : 321 - 330
  • [24] Automatic Music Transcription Based on Wavelet Transform
    Azizi, Amir
    Faez, Karim
    Delui, Amin Rezaeian
    Rahati, Saeid
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, 5754 : 158 - +
  • [25] Automatic Music Scoring based on Wavelet Transform
    Kondo, Yoshiki
    Tanaka, Toshiyuki
    [J]. 2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1481 - 1484
  • [26] Subband Feature Statistics Normalization Techniques Based on a Discrete Wavelet Transform for Robust Speech Recognition
    Hung, Jeih-weih
    Fan, Hao-Teng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (09) : 806 - 809
  • [27] Geomorphometric analysis based on discrete wavelet transform
    Doglioni, Angelo
    Simeone, Vincenzo
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2014, 71 (07) : 3095 - 3108
  • [28] Discrete Wavelet Transform Implementation Based on FPGA
    Li, Juan
    Su, Binghua
    Yan, Yongming
    Jiang, Caigao
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 439 - +
  • [29] A Discrete Wavelet Transform based Cryptographic algorithm
    Goswami, Debayan
    Rahman, Naushad
    Biswas, Jayanta
    Koul, Anshu
    Tamang, Rigya Lama
    Bhattacharjee, A. K.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (04): : 178 - 182
  • [30] Optimized Discrete Wavelet Transform based Steganography
    Narasimmalou, T.
    Joseph, Allen R.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 88 - 91