Histogram Equalization-Based Features for Speech, Music, and Song Discrimination

被引:11
|
作者
Gallardo-Antolin, Ascension [1 ]
Montero, Juan M. [2 ]
机构
[1] Univ Carlos III Madrid, Dept Signal Theory & Commun, Madrid, Spain
[2] Univ Politecn Madrid, Dept Elect Engn, Speech Technol Grp, Madrid, Spain
关键词
Acoustic features; audio classification; HEQ-based features; parameterization; speech/music/song discrimination; CLASSIFICATION;
D O I
10.1109/LSP.2010.2049877
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
引用
收藏
页码:659 / 662
页数:4
相关论文
共 50 条
  • [21] Steganalysis of Compressed Speech Based on Histogram Features
    Ding Qi
    Ping Xijian
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [22] Histogram Equalization Detection Based on Statistical Features in Digital Image
    Bi X.-L.
    Qiu Y.-M.
    Xiao B.
    Li W.-S.
    Ma J.-F.
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (02): : 292 - 303
  • [23] The Influence of Listeners' Mood on Equalization-Based Listening Experience
    Dourou, Nefeli
    Bruschi, Valeria
    Spinsante, Susanna
    Cecchi, Stefania
    ACOUSTICS, 2022, 4 (03): : 746 - 763
  • [24] SPECTROGRAM BASED FEATURES SELECTION USING MULTIPLE KERNEL LEARNING FOR SPEECH/MUSIC DISCRIMINATION
    Nilufar, Sharmin
    Ray, Nilanjan
    Molla, M. K. Islam
    Hirose, Keikichi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 501 - 504
  • [25] Performance evaluation of HHT based features for Speech/Music Discrimination under Noisy condition
    Kumar, Arvind
    Kishore, Kamlesh
    Chandra, Mahesh
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [26] Speech music discrimination using class-specific features
    Beierholm, T
    Baggenstoss, PM
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 379 - 382
  • [27] Sub-band based histogram equalization in cepstral domain for speech recognition
    Joshi, Vikas
    Bilgi, Raghvendra
    Umesh, S.
    Garcia, Luz
    Benitez, Carmen
    SPEECH COMMUNICATION, 2015, 69 : 46 - 65
  • [28] Quantile based histogram equalization for noise robust large vocabulary speech recognition
    Hilger, F
    Ney, H
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 845 - 854
  • [29] Effect of histogram equalization in face detection using spatial histogram features
    Parvin H.
    Alizadeh H.
    Journal of Convergence Information Technology, 2011, 6 (09) : 296 - 301
  • [30] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Suh, Youngjoo
    Kim, Hoirin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,