Histogram Equalization-Based Features for Speech, Music, and Song Discrimination

被引：11

作者：

Gallardo-Antolin, Ascension ^{[1
]}

Montero, Juan M. ^{[2
]}

机构：

[1] Univ Carlos III Madrid, Dept Signal Theory & Commun, Madrid, Spain

[2] Univ Politecn Madrid, Dept Elect Engn, Speech Technol Grp, Madrid, Spain

来源：

IEEE SIGNAL PROCESSING LETTERS | 2010年 / 17卷 / 07期

关键词：

Acoustic features; audio classification; HEQ-based features; parameterization; speech/music/song discrimination; CLASSIFICATION;

D O I：

10.1109/LSP.2010.2049877

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.

引用

页码：659 / 662

页数：4

共 50 条

[31] HISTOGRAM EQUALIZATION AND NOISE MASKING FOR ROBUST SPEECH RECOGNITION
Zhang, Xueru
Demuynck, Kris
Van Hamme, Hugo
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4578 - 4581
[32] Probabilistic class histogram equalization for robust speech recognition
Suh, Youngjoo
Ji, Mikyong
Kim, Hoirin
IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (04) : 287 - 290
[33] Histogram Equalization to Model Adaptation for Robust Speech Recognition
Youngjoo Suh
Hoirin Kim
EURASIP Journal on Advances in Signal Processing, 2010
[34] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
Kumar, Arvind
Chandra, Mahesh
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 33 - 58
[35] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
Arvind Kumar
Mahesh Chandra
Multimedia Tools and Applications, 2023, 82 : 33 - 58
[36] Equalization-Based Beamforming for Secure Multicasting in Multicast Wiretap Channels
Hwang, Duckdong
Yang, Janghoon
Kwon, Kuhyung
Joung, Jingon
Song, Hyoung-Kyu
IEEE ACCESS, 2021, 9 : 33826 - 33835
[37] Equalization-Based Beamforming for Secure Multicasting in Multicast Wiretap Channels
Hwang, Duckdong
Yang, Janghoon
Kwon, Kuhyung
Joung, Jingon
Song, Hyoung-Kyu
IEEE Access, 2021, 9 : 33826 - 33835
[38] Enhancing Speech and Music Discrimination Through the Integration of Static and Dynamic Features
Chen, Liangwei
Zhou, Xiren
Tut, Qiang
Chen, Huanhuan
INTERSPEECH 2024, 2024, : 4318 - 4322
[39] Equalization-Based Digital Background Calibration Technique for Pipelined ADCs
Zeinali, Behzad
Moosazadeh, Tohid
Yavari, Mohammad
Rodriguez-Vazquez, Angel
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (02) : 322 - 333
[40] Speech/music discrimination based on wavelets for broadcast programs
Didiot, E.
Illina, I.
Mella, O.
Fohr, D.
Haton, J. -P
SIGMAP 2006: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2006, : 151 - +

← 1 2 3 4 5 →