Histogram Equalization-Based Features for Speech, Music, and Song Discrimination

被引:11
|
作者
Gallardo-Antolin, Ascension [1 ]
Montero, Juan M. [2 ]
机构
[1] Univ Carlos III Madrid, Dept Signal Theory & Commun, Madrid, Spain
[2] Univ Politecn Madrid, Dept Elect Engn, Speech Technol Grp, Madrid, Spain
关键词
Acoustic features; audio classification; HEQ-based features; parameterization; speech/music/song discrimination; CLASSIFICATION;
D O I
10.1109/LSP.2010.2049877
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
引用
收藏
页码:659 / 662
页数:4
相关论文
共 50 条
  • [31] HISTOGRAM EQUALIZATION AND NOISE MASKING FOR ROBUST SPEECH RECOGNITION
    Zhang, Xueru
    Demuynck, Kris
    Van Hamme, Hugo
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4578 - 4581
  • [32] Probabilistic class histogram equalization for robust speech recognition
    Suh, Youngjoo
    Ji, Mikyong
    Kim, Hoirin
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (04) : 287 - 290
  • [33] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Youngjoo Suh
    Hoirin Kim
    EURASIP Journal on Advances in Signal Processing, 2010
  • [34] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
    Kumar, Arvind
    Chandra, Mahesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 33 - 58
  • [35] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
    Arvind Kumar
    Mahesh Chandra
    Multimedia Tools and Applications, 2023, 82 : 33 - 58
  • [36] Equalization-Based Beamforming for Secure Multicasting in Multicast Wiretap Channels
    Hwang, Duckdong
    Yang, Janghoon
    Kwon, Kuhyung
    Joung, Jingon
    Song, Hyoung-Kyu
    IEEE ACCESS, 2021, 9 : 33826 - 33835
  • [37] Equalization-Based Beamforming for Secure Multicasting in Multicast Wiretap Channels
    Hwang, Duckdong
    Yang, Janghoon
    Kwon, Kuhyung
    Joung, Jingon
    Song, Hyoung-Kyu
    IEEE Access, 2021, 9 : 33826 - 33835
  • [38] Enhancing Speech and Music Discrimination Through the Integration of Static and Dynamic Features
    Chen, Liangwei
    Zhou, Xiren
    Tut, Qiang
    Chen, Huanhuan
    INTERSPEECH 2024, 2024, : 4318 - 4322
  • [39] Equalization-Based Digital Background Calibration Technique for Pipelined ADCs
    Zeinali, Behzad
    Moosazadeh, Tohid
    Yavari, Mohammad
    Rodriguez-Vazquez, Angel
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (02) : 322 - 333
  • [40] Speech/music discrimination based on wavelets for broadcast programs
    Didiot, E.
    Illina, I.
    Mella, O.
    Fohr, D.
    Haton, J. -P
    SIGMAP 2006: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2006, : 151 - +