Speech/Music Discrimination Based on Discrete Wavelet Transform

被引：0

作者：

Ntalampiras, Stavros ^{[1
]}

Fakotakis, Nikos ^{[1
]}

机构：

[1] Univ Patras, Wire Commun Lab, Dept Elect & Comp Engn, Rion 26500, Greece

来源：

ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, SETN 2008 | 2008年 / 5138卷

关键词：

Computer audition; content-based audio classification; discrete wavelet transform; Gaussian mixture model;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present an effective approach which addresses the issue of speech/music discrimination. Our architecture focuses on the matter from the scope of improving the performance of a speech recognition system by excluding the processing of information which is not speech. Multiresolution analysis is applied to the input signal while the most significant statistical features are calculated over a predefined texture size. These characteristics are then modeled using a state of the art technique for probability density function estimation, Gaussian mixture models (GMM). A classification scheme consisting of a conventional maximum likelihood decision methodology constitutes the next step of our implementation. Despite the fact that our system is based solely on wavelet signal processing, it demonstrated very good performance achieving 91.8% recognition rate.

引用

页码：205 / 211

页数：7

共 50 条

[1] A wavelet-based parameterization for speech/music discrimination
Didiot, E.
Illina, I.
Fohr, D.
Mella, O.
[J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 341 - 357
[2] Psychoacoustic Music Analysis Based on the Discrete Wavelet Packet Transform
He, Xing
Scordilis, Michael S.
[J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2008, 2008
[3] A Discrete Wavelet Transform Based Approach to Hindi Speech Recognition
Ranjan, Shivesh
[J]. 2010 INTERNATIONAL CONFERENCE ON SIGNAL ACQUISITION AND PROCESSING: ICSAP 2010, PROCEEDINGS, 2010, : 345 - 348
[4] Discrete Fourier Transform and Discrete Wavelet Packet Transform in Speech Denoising
Wang, Zhanfeng
Li, Suping
[J]. 2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1588 - 1591
[5] Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination
Duzenli, Timur
Ozkurt, Nalan
[J]. ISTANBUL UNIVERSITY-JOURNAL OF ELECTRICAL AND ELECTRONICS ENGINEERING, 2011, 11 (01): : 1355 - 1362
[6] Discrete wavelet transform techniques in speech processing
Agbinya, JI
[J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 514 - 519
[7] Speech compression and encryption based on discrete wavelet transform and chaotic signals
Abbas Salman Hameed
[J]. Multimedia Tools and Applications, 2021, 80 : 13663 - 13676
[8] Speech compression and encryption based on discrete wavelet transform and chaotic signals
Hameed, Abbas Salman
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 13663 - 13676
[9] Speech and image compression using discrete wavelet transform
Junejo, N
Ahmed, N
Unar, MA
Rajput, AQK
[J]. 2005 IEEE SARNOFF SYMPOSIUM ON ADVANCES IN WIRED AND WIRELESS COMMUNICATION, 2005, : 106 - 109
[10] Combined discrete wavelet transform and wavelet packet decomposition for speech enhancement
Wang, Zhen-li
Yang, Jie
Zhang, Xiong-wei
[J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1107 - +

← 1 2 3 4 5 →