Histogram Equalization-Based Features for Speech, Music, and Song Discrimination

被引：11

作者：

Gallardo-Antolin, Ascension ^{[1
]}

Montero, Juan M. ^{[2
]}

机构：

[1] Univ Carlos III Madrid, Dept Signal Theory & Commun, Madrid, Spain

[2] Univ Politecn Madrid, Dept Elect Engn, Speech Technol Grp, Madrid, Spain

来源：

IEEE SIGNAL PROCESSING LETTERS | 2010年 / 17卷 / 07期

关键词：

Acoustic features; audio classification; HEQ-based features; parameterization; speech/music/song discrimination; CLASSIFICATION;

D O I：

10.1109/LSP.2010.2049877

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.

引用

页码：659 / 662

页数：4

共 50 条

[1] Histogram Equalization-Based Thresholding
Kwon, Soon Hak
Jeong, Hye Cheuan
Seo, Suk Tae
Lee, In Keun
Son, Chang Sik
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (11) : 2751 - 2753
[2] Phase Equalization-Based Autoregressive Model of Speech Signals
Hiroya, Sadao
Mochida, Takemi
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 42 - 45
[3] Histogram equalization of contextual statistics of speech features for robust speech recognition
Hsieh, Hsin-Ju
Chen, Berlin
Hung, Jeih-weih
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (17) : 6769 - 6795
[4] Histogram equalization of contextual statistics of speech features for robust speech recognition
Hsin-Ju Hsieh
Berlin Chen
Jeih-weih Hung
Multimedia Tools and Applications, 2015, 74 : 6769 - 6795
[5] A new image quality measure for assessment of histogram equalization-based contrast enhancement techniques
Chen, Soong-Der
DIGITAL SIGNAL PROCESSING, 2012, 22 (04) : 640 - 647
[6] MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION
Sell, Gregory
Clark, Pascal
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[7] Curvelet transform and contrast adaptive clip histogram equalization-based image defogging algorithm
Wang Qi
Wang Shigang
Jia Bowen
Du Hailong
The Journal of China Universities of Posts and Telecommunications, 2018, 25 (02) : 96 - 104
[8] Curvelet transform and contrast adaptive clip histogram equalization-based image defogging algorithm
Qi W.
Shigang W.
Bowen J.
Hailong D.
Shigang, Wang (wangshigang@vip.sina.com), 2018, Beijing University of Posts and Telecommunications (25): : 96 - 104
[9] Equalization-Based Enterprise Service Selection
Xue Xiao
Wang Shufang
2014 IEEE COMPUTING, COMMUNICATIONS AND IT APPLICATIONS CONFERENCE (COMCOMAP), 2014, : 40 - 45
[10] Histogram Equalization-Based Thresholding (vol E91D, pg 2751, 2008)
Kwon, S. H.
Jeong, H. C.
Seo, S. T.
Lee, I. K.
Son, C. S.
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (12) : 2915 - 2915

← 1 2 3 4 5 →