Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices

被引:20
|
作者
Nakamura, Eita [1 ]
Yoshii, Kazuyoshi [1 ]
Sagayama, Shigeki [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[2] Meiji Univ, Grad Sch Adv Math Sci, Tokyo 1648525, Japan
基金
日本学术振兴会;
关键词
Hidden Markov models; model for polyphonic music scores; music performance model; rhythm transcription; statistical music language model; TEMPO TRACKING; PERFORMANCE; INFORMATION; MODEL;
D O I
10.1109/TASLP.2017.2662479
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In a recent conference paper, we have reported a rhythm transcription method based on a merged-output hidden Markov model (HMM) that explicitly describes the multiple-voice structure of polyphonic music. This model solves a major problem of conventional methods that could not properly describe the nature of multiple voices as in polyrhythmic scores or in the phenomenon of loose synchrony between voices. In this paper, we present a complete description of the proposed model and develop an inference technique, which is valid for any merged-output HMMs, for which output probabilities depend on past events. We also examine the influence of the architecture and parameters of the method in terms of accuracies of rhythm transcription and voice separation and perform comparative evaluations with six other algorithms. Using MIDI recordings of classical piano pieces, we found that the proposed model outperformed other methods by more than 12 points in the accuracy for polyrhythmic performances and performed almost as good as the best one for non-polyrhythmic performances. This reveals the state-of-the-art methods of rhythm transcription for the first time in the literature. Publicly available source codes are also provided for future comparisons.
引用
收藏
页码:794 / 806
页数:13
相关论文
共 39 条
  • [1] Event based transcription system for polyphonic piano music
    Costantini, Giovanni
    Perfetti, Renzo
    Todisco, Massimiliano
    [J]. SIGNAL PROCESSING, 2009, 89 (09) : 1798 - 1811
  • [2] Automatic transcription of piano polyphonic music
    Kobzantsev, A
    Chazan, D
    Zeevi, Y
    [J]. ISPA 2005: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005, : 414 - 418
  • [3] Transcription of polyphonic piano music with neural networks
    Marolt, M
    [J]. MELECON 2000: INFORMATION TECHNOLOGY AND ELECTROTECHNOLOGY FOR THE MEDITERRANEAN COUNTRIES, VOLS 1-3, PROCEEDINGS, 2000, : 512 - 515
  • [4] Polyphonic Piano Transcription with a Note-Based Music Language Model
    Wang, Qi
    Zhou, Ruohua
    Yan, Yonghong
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (03):
  • [5] NOTE ONSET DETECTION FOR THE TRANSCRIPTION OF POLYPHONIC PIANO MUSIC
    Boogaart, C. G. V. D.
    Lienhart, R.
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 446 - 449
  • [6] A connectionist approach to automatic transcription of polyphonic piano music
    Marolt, M
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2004, 6 (03) : 439 - 449
  • [7] Transcription of polyphonic piano music by means of memory-based classification method
    Costantini, Giovanni
    Todisco, Massimiliano
    Perfetti, Renzo
    [J]. NEURAL NETS WIRN09, 2009, 204 : 91 - 100
  • [8] On the Effect of Memory Width in Automatic Transcription Systems for Polyphonic Piano Music
    Costantini, Giovanni
    Todisco, Massimiliano
    Saggio, Giovanni
    [J]. IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 124 - 127
  • [9] An End-to-End Neural Network for Polyphonic Piano Music Transcription
    Sigtia, Siddharth
    Benetos, Emmanouil
    Dixon, Simon
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 927 - 939
  • [10] SVM Based Transcription System with Short-Term Memory Oriented to Polyphonic Piano Music
    Costantini, Giovanni
    Todisco, Massimiliano
    Perfetti, Renzo
    Basili, Roberto
    Casali, Daniele
    [J]. MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 196 - 201