A comparison of features for speech, music discrimination.

被引:79
|
作者
Carey, MJ [1 ]
Parris, ES [1 ]
Lloyd-Thomas, H [1 ]
机构
[1] Ensigma Ltd, Chepstow, Mons, England
关键词
D O I
10.1109/ICASSP.1999.758084
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several approaches have previously been taken to the problem of discriminating between speech and music signals. These have used different features as the input to the classifier and have tested and trained on different material. In this paper we examine the discrimination achieved by several different features using common training and test sets and the same classifier. The database assembled for these tests includes speech from thirteen languages and music from all over the world. In each case the distributions in the feature space were modelled by a Gaussian mixture model. Experiments were carried out on four types of feature, amplitude, cepstra, pitch and zero-crossings. In each case the derivative of the feature was also used and found to improve performance. The best performance resulted from using the cepstra and delta cepstra which gave an equal error rate (EER) of 1.2%. This was closely followed by normalised amplitude and delta amplitude. This however used a much less complex model. The pitch and delta pitch gave an EER of 4% which was better than the zero-crossing which produced an EER of 6%.
引用
收藏
页码:149 / 152
页数:4
相关论文
共 50 条
  • [21] Feature extraction for speech and music discrimination
    Hou, Huiyu
    Sadka, Abdul
    Jiang, Richard M.
    2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 154 - 157
  • [22] The social psychology of interpersonal discrimination.
    Coren, S
    POLITICAL PSYCHOLOGY, 1999, 20 (01) : 227 - 230
  • [23] Speech/music discrimination for multimedia applications
    El-Maleh, K
    Klein, M
    Petrucci, G
    Kabal, P
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2445 - 2448
  • [24] SPECTROGRAM BASED FEATURES SELECTION USING MULTIPLE KERNEL LEARNING FOR SPEECH/MUSIC DISCRIMINATION
    Nilufar, Sharmin
    Ray, Nilanjan
    Molla, M. K. Islam
    Hirose, Keikichi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 501 - 504
  • [25] Performance evaluation of HHT based features for Speech/Music Discrimination under Noisy condition
    Kumar, Arvind
    Kishore, Kamlesh
    Chandra, Mahesh
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [26] Hate Speech, Symbolic Violence, and Racial Discrimination. Antigypsyism: What Responses for the Next Decade?
    Cortes, Ismael
    SOCIAL SCIENCES-BASEL, 2021, 10 (10):
  • [27] Exploring the concept of genetic discrimination.
    Otlowski M.
    Journal of Bioethical Inquiry, 2005, 2 (3) : 165 - 176
  • [28] Contrast discrimination can explain orientation discrimination.
    Itti, L
    Braun, J
    Koch, C
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1999, 40 (04) : S572 - S572
  • [29] STRUCTURAL ASPECTS OF DIASTEREOMERIC DISCRIMINATION.
    Valente, Edward J.
    Miller, Christopher
    Ruggiero, Gerard
    Eggleston, Drake S.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 1996, 52 : C452 - C452
  • [30] The localisation of the sense of dietary discrimination.
    Pieron, H
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES, 1908, 147 : 279 - 280