A template-based algorithm by geometric means for the automatic and efficient recognition of music chords

被引:0
|
作者
Hernandez, Ruben [1 ]
Guerrero, Antonio [2 ]
Macias-Diaz, Jorge E. [3 ,4 ]
机构
[1] Univ Autonoma Aguascalientes, Ctr Ciencias Basicas, Ave Univ 940,Ciudad Univ, Aguascalientes 20100, Aguascalientes, Mexico
[2] Univ Autonoma Aguascalientes, Dept Estadist, Ave Univ 940,Ciudad Univ, Aguascalientes 20100, Aguascalientes, Mexico
[3] Tallinn Univ, Dept Math & Didact Math, Narva Mnt 25, EE-10120 Tallinn, Estonia
[4] Univ Autonoma Aguascalientes, Dept Matemat & Fis, Ave Univ 940,Ciudad Univ, Aguascalientes 20100, Aguascalientes, Mexico
关键词
Music chords recognition; Automatic classification; Geometric means; Signal processing; Gabor filters;
D O I
10.1007/s12065-022-00771-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we introduce a template-based computational method to recognize chords through an audio recording of a musical instrument. The algorithm is based on a temporal frequency analysis using Gabor's filter banks. These filters are centered over adjusted frequencies of musical notes in different octaves and the adjustment is accomplished in terms of the detunings on the recording. Using the results in the filtering stage, a geometric mean of each chord is calculated. It is important to mention that these statistics are calculated from the combination of notes that form each chord and are automatically grouped as templates. The presence of chords is determined from these metrics. Several experiments are carried out for major, minor, augmented, diminished and suspended chords played on acoustic guitar, classic guitar, electric guitar, piano and ukulele. A comparative study against machine-learning classifiers is presented. The results show a superior performance of the present approach. In addition, the proposed method presents the advantage that it does not require a training stage, in contrast with the methods based on machine-learning algorithms. This reduces significatively the storage and time requiered for processing.
引用
收藏
页码:467 / 481
页数:15
相关论文
共 50 条
  • [1] A template-based algorithm by geometric means for the automatic and efficient recognition of music chords
    Rubén Hernández
    Antonio Guerrero
    Jorge E. Macías-Díaz
    [J]. Evolutionary Intelligence, 2024, 17 : 467 - 481
  • [2] Computationally Efficient Template-Based Face Recognition
    Wu, Yue
    AdbAlmageed, Wael
    Rawls, Stephen
    Natarajan, Prem
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1424 - 1429
  • [3] Data Pruning for Template-based Automatic Speech Recognition
    Seppi, Dino
    Van Compernolle, Dirk
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 901 - 904
  • [4] Template-based Automatic Speech Recognition meets Prosody
    Seppi, Dino
    Demuynck, Kris
    Van Compernolle, Dirk
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 552 - 555
  • [5] Template-based automatic recognition of birdsong syllables from continuous recordings
    Anderson, SE
    Dave, AS
    Margoliash, D
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02): : 1209 - 1219
  • [6] Face recognition is not template-based
    Carbon, CC
    Leder, H
    [J]. PERCEPTION, 2004, 33 : 103 - 103
  • [7] COMPARING CQT AND REASSIGNMENT BASED CHROMA FEATURES FOR TEMPLATE-BASED AUTOMATIC CHORD RECOGNITION
    O'Hanlon, Ken
    Sandler, Mark B.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 860 - 864
  • [8] SAR Automatic Target Recognition Using Maximum Likelihood Template-based Classifiers
    Saghri, John A.
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXI, 2008, 7073
  • [9] Template-based online character recognition
    Connell, SD
    Jain, AK
    [J]. PATTERN RECOGNITION, 2001, 34 (01) : 1 - 14
  • [10] Template-based continuous speech recognition
    De Wachter, Mathias
    Matton, Mike
    Demuynck, Kris
    Wambacq, Patrick
    Cools, Ronald
    Van Compernolle, Dirk
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1377 - 1390