MUSICAL INSTRUMENT IDENTIFICATION USING MULTISCALE MEL-FREQUENCY CEPSTRAL COEFFICIENTS

被引:0
|
作者
Sturm, Bob L. [1 ]
Morvidone, Marcela [2 ]
Daudet, Laurent [3 ]
机构
[1] Aalborg Univ, Dept Architecture Design & Media Technol, DK-2750 Ballerup, Denmark
[2] Univ Tecnol Nacl, Fac Reg Buenos Aires, Dept Ingn Elect, Buenos Aires, DF, Argentina
[3] Univ Paris Diderot, Inst Langevin LOA, UMR 7587, F-75231 Paris 05, France
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate the benefits of evaluating Mel-frequency cepstral coefficients (MFCCs) over several time scales in the context of automatic musical instrument identification for signals that are monophonic but derived from real musical settings. We define several sets of features derived from MFCCs computed using multiple time resolutions, and compare their performance against other features that are computed using a single time resolution, such as MFCCs, and derivatives of MFCCs. We find that in each task - pairwise discrimination, and one vs. all classification - the features involving multiscale decompositions perform significantly better than features computed using a single timeresolution.
引用
收藏
页码:477 / 481
页数:5
相关论文
共 50 条
  • [1] Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)
    Koolagudi, Shashidhar G.
    Rastogi, Deepika
    Rao, K. Sreenivasa
    [J]. INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3391 - 3398
  • [2] Mel-frequency Cepstral Coefficients for Eye Movement Identification
    Nguyen Viet Cuong
    Vu Dinh
    Lam Si Tung Ho
    [J]. 2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 253 - 260
  • [3] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
    Hashad F.G.
    Halim T.M.
    Diab S.M.
    Sallam B.M.
    El-Samie F.E.A.
    [J]. Pattern Recognition and Image Analysis, 2010, 20 (03) : 360 - 369
  • [4] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    Zhang Jun
    Sam Kwong
    Wei Gang
    Qingyang Hong
    [J]. EURASIP Journal on Advances in Signal Processing, 2004
  • [5] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    [J]. Jun, Z. (zhj_angun@sina.com.cn), 1600, Hindawi Publishing Corporation (2004):
  • [6] Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients
    Sheu, Jia-Shing
    Chen, Ching-Wen
    [J]. SENSORS AND MATERIALS, 2020, 32 (10) : 3209 - 3220
  • [7] Using Mel-frequency cepstral coefficients in missing data technique
    Jun, Z
    Kwong, S
    Gang, W
    Hong, QY
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (03) : 340 - 346
  • [8] Computing Mel-frequency cepstral coefficients on the power spectrum
    Molau, S
    Pitz, M
    Schlüter, R
    Ney, H
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 73 - 76
  • [9] PPG-based human identification using Mel-frequency cepstral coefficients and neural networks
    Ali I. Siam
    Atef Abou Elazm
    Nirmeen A. El-Bahnasawy
    Ghada M. El Banby
    Fathi E. Abd El-Samie
    [J]. Multimedia Tools and Applications, 2021, 80 : 26001 - 26019
  • [10] PPG-based human identification using Mel-frequency cepstral coefficients and neural networks
    Siam, Ali I.
    Elazm, Atef Abou
    El-Bahnasawy, Nirmeen A.
    El Banby, Ghada M.
    Abd El-Samie, Fathi E.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 26001 - 26019