Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots

被引:0
|
作者
Takuma Otsuka
Kazuhiro Nakadai
Toru Takahashi
Tetsuya Ogata
HiroshiG Okuno
机构
[1] Kyoto University,Graduate School of Informatics
[2] Honda Research Institute Japan,Graduate School of Information Science and Engineering
[3] Co.,undefined
[4] Ltd.,undefined
[5] Tokyo Institute of Technology,undefined
关键词
Probability Distribution; Information Technology; Tempo; Quantum Information; Temporal Fluctuation;
D O I
暂无
中图分类号
学科分类号
摘要
Our goal is to develop a coplayer music robot capable of presenting a musical expression together with humans. Although many instrument-performing robots exist, they may have difficulty playing with human performers due to the lack of the synchronization function. The robot has to follow differences in humans' performance such as temporal fluctuations to play with human performers. We classify synchronization and musical expression into two levels: (1) melody level and (2) rhythm level to cope with erroneous synchronizations. The idea is as follows: When the synchronization with the melody is reliable, respond to the pitch the robot hears, when the synchronization is uncertain, try to follow the rhythm of the music. Our method estimates the score position for the melody level and the tempo for the rhythm level. The reliability of the score position estimation is extracted from the probability distribution of the score position. The experimental results demonstrate that our method outperforms the existing score following system in 16 songs out of 20 polyphonic songs. The error in the prediction of the score position is reduced by 69% on average. The results also revealed that the switching mechanism alleviates the error in the estimation of the score position.
引用
收藏
相关论文
共 50 条
  • [1] Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots
    Otsuka, Takuma
    Nakadai, Kazuhiro
    Takahashi, Toru
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [2] Real-Time Audio-to-Score Alignment of Music Performances Containing Errors and Arbitrary Repeats and Skips
    Nakamura, Tomohiko
    Nakamura, Eita
    Sagayama, Shigeki
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (02) : 329 - 339
  • [3] Audio-to-Score Alignment Using Deep Automatic Music Transcription
    Simonetta, Federico
    Ntalampiras, Stavros
    Avanzini, Federico
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [4] Real-Time Audio-to-Score Alignment of Singing Voice Based on Melody and Lyric Information
    Gong, Rong
    Cuvillier, Philippe
    Obin, Nicolas
    Cont, Arshia
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3312 - 3316
  • [5] Parallel online time warping for real-time audio-to-score alignment in multi-core systems
    Pedro Alonso
    Raquel Cortina
    F. J. Rodríguez-Serrano
    P. Vera-Candeas
    M. Alonso-González
    José Ranilla
    The Journal of Supercomputing, 2017, 73 : 126 - 138
  • [6] COHERENT TIME MODELING OF SEMI-MARKOV MODELS WITH APPLICATION TO REAL-TIME AUDIO-TO-SCORE ALIGNMENT
    Cuvillier, Philippe
    Cont, Arshia
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [7] Parallel online time warping for real-time audio-to-score alignment in multi-core systems
    Alonso, Pedro
    Cortina, Raquel
    Rodriguez-Serrano, F. J.
    Vera-Candeas, P.
    Alonso-Gonzalez, M.
    Ranilla, Jose
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01): : 126 - 138
  • [8] A UNIFIED APPROACH TO REAL TIME AUDIO-TO-SCORE AND AUDIO-TO-AUDIO ALIGNMENT USING SEQUENTIAL MONTE CARLO INFERENCE TECHNIQUES
    Montecchio, Nicola
    Cont, Arshia
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 193 - 196
  • [9] An Active Learning Approach to Audio-to-Score Alignment Using Dynamic Time Warping
    Chuan, Ching-Hua
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 796 - 799
  • [10] Robust on-line algorithm for real-time audio-to-score alignment based on a delayed decision and anticipation framework
    Yamamoto, Ryuichi
    Sako, Shinji
    Kitamura, Tadashi
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2013, : 191 - 195