Histogram Equalization-Based Features for Speech, Music, and Song Discrimination

被引:11
|
作者
Gallardo-Antolin, Ascension [1 ]
Montero, Juan M. [2 ]
机构
[1] Univ Carlos III Madrid, Dept Signal Theory & Commun, Madrid, Spain
[2] Univ Politecn Madrid, Dept Elect Engn, Speech Technol Grp, Madrid, Spain
关键词
Acoustic features; audio classification; HEQ-based features; parameterization; speech/music/song discrimination; CLASSIFICATION;
D O I
10.1109/LSP.2010.2049877
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
引用
收藏
页码:659 / 662
页数:4
相关论文
共 50 条
  • [1] Histogram Equalization-Based Thresholding
    Kwon, Soon Hak
    Jeong, Hye Cheuan
    Seo, Suk Tae
    Lee, In Keun
    Son, Chang Sik
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (11) : 2751 - 2753
  • [2] Phase Equalization-Based Autoregressive Model of Speech Signals
    Hiroya, Sadao
    Mochida, Takemi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 42 - 45
  • [3] Histogram equalization of contextual statistics of speech features for robust speech recognition
    Hsieh, Hsin-Ju
    Chen, Berlin
    Hung, Jeih-weih
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (17) : 6769 - 6795
  • [4] Histogram equalization of contextual statistics of speech features for robust speech recognition
    Hsin-Ju Hsieh
    Berlin Chen
    Jeih-weih Hung
    Multimedia Tools and Applications, 2015, 74 : 6769 - 6795
  • [5] A new image quality measure for assessment of histogram equalization-based contrast enhancement techniques
    Chen, Soong-Der
    DIGITAL SIGNAL PROCESSING, 2012, 22 (04) : 640 - 647
  • [6] MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION
    Sell, Gregory
    Clark, Pascal
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] Curvelet transform and contrast adaptive clip histogram equalization-based image defogging algorithm
    Wang Qi
    Wang Shigang
    Jia Bowen
    Du Hailong
    The Journal of China Universities of Posts and Telecommunications, 2018, 25 (02) : 96 - 104
  • [8] Curvelet transform and contrast adaptive clip histogram equalization-based image defogging algorithm
    Qi W.
    Shigang W.
    Bowen J.
    Hailong D.
    Shigang, Wang (wangshigang@vip.sina.com), 2018, Beijing University of Posts and Telecommunications (25): : 96 - 104
  • [9] Equalization-Based Enterprise Service Selection
    Xue Xiao
    Wang Shufang
    2014 IEEE COMPUTING, COMMUNICATIONS AND IT APPLICATIONS CONFERENCE (COMCOMAP), 2014, : 40 - 45
  • [10] Histogram Equalization-Based Thresholding (vol E91D, pg 2751, 2008)
    Kwon, S. H.
    Jeong, H. C.
    Seo, S. T.
    Lee, I. K.
    Son, C. S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (12) : 2915 - 2915