A system for automatic chord transcription from audio using genre-specific hidden Markov models

被引:0
|
作者
Lee, Kyogu [1 ]
机构
[1] Stanford Univ, Ctr Comp Res Mus & Acoust, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a system for automatic chord transcription from the raw audio using genre-specific hidden Markov models trained on audio-from-symbolic data. In order to avoid enormous amount of human labor required to manually annotate the chord labels for ground-truth, we use symbolic data such as MIDI files to automate the labeling process. In parallel, we synthesize the same symbolic files to provide the models with the sufficient amount of observation feature vectors along with the automatically generated annotations for training. In doing so, we build different models for various musical genres, whose model parameters reveal characteristics specific to their corresponding genre. The experimental results show that the HMMs trained on synthesized data perform very well on real acoustic recordings. It is also shown that when the correct genre is chosen, simpler, genre-specific model yields performance better than or comparable to that of more complex model that is genre-independent. Furthermore, we also demonstrate the potential application of the proposed model to the genre classification task.
引用
收藏
页码:134 / 146
页数:13
相关论文
共 50 条
  • [31] Indirect Acquisition of Violin Instrumental Controls from Audio Signal with Hidden Markov Models
    Perez-Carrillo, Alfonso
    Wanderley, Marcelo M.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (05) : 932 - 940
  • [32] Automatic Segmentation of the Second Cardiac Sound by Using Wavelets and Hidden Markov Models
    Lima, C. S.
    Barbosa, D.
    2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vols 1-8, 2008, : 334 - 337
  • [33] Using Hidden Markov Models to build an automatic, continuous and probabilistic sleep stager
    Flexer, A
    Sykacek, P
    Rezek, I
    Dorffner, G
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, : 627 - 631
  • [34] Segmentation of expiratory and inspiratory sounds in baby cry audio recordings using hidden Markov models
    Aucouturier, Jean-Julien
    Nonaka, Yulri
    Katahira, Kentaro
    Okanoya, Kazuo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (05): : 2969 - 2977
  • [35] Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio
    Lee, Kyogu
    Slaney, Malcolm
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (02): : 291 - 301
  • [36] Automatic English Stop Consonants Classification using Wavelet Analysis and Hidden Markov Models
    Kuehne, Marco
    Togneri, Roberto
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 637 - 640
  • [37] Automatic detection of alpine rockslides in continuous seismic data using hidden Markov models
    Dammeier, Franziska
    Moore, Jeffrey R.
    Hammer, Conny
    Haslinger, Florian
    Loew, Simon
    JOURNAL OF GEOPHYSICAL RESEARCH-EARTH SURFACE, 2016, 121 (02) : 351 - 371
  • [38] Automatic Analysis of Large-scale Nanopore Data Using Hidden Markov Models
    Zhang, Jianhua
    Liu, Xiuling
    IFAC PAPERSONLINE, 2020, 53 (02): : 16759 - 16766
  • [39] Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models
    Nock, HJ
    Young, SJ
    COGNITIVE SCIENCE, 2002, 26 (03) : 283 - 301
  • [40] Automatic detection of snow avalanches in continuous seismic data using hidden Markov models
    Heck, Matthias
    Hammer, Conny
    van Herwijnen, Alec
    Schweizer, Juerg
    Faeh, Donat
    NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2018, 18 (01) : 383 - 396