A Music Cognition-Guided Framework for Multi-pitch Estimation

被引:2
|
作者
Li, Xiaoquan [1 ]
Yan, Yijun [2 ]
Soraghan, John [1 ]
Wang, Zheng [3 ]
Ren, Jinchang [2 ]
机构
[1] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
[2] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
关键词
Music cognition; Automatic music transcription; Multi-pitch estimation; Harmonic structure detection (HSD); Polyphonic music detection; TRANSCRIPTION; NETWORK;
D O I
10.1007/s12559-022-10031-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the most important subtasks of automatic music transcription (AMT), multi-pitch estimation (MPE) has been studied extensively for predicting the fundamental frequencies in the frames of audio recordings during the past decade. However, how to use music perception and cognition for MPE has not yet been thoroughly investigated. Motivated by this, this demonstrates how to effectively detect the fundamental frequency and the harmonic structure of polyphonic music using a cognitive framework. Inspired by cognitive neuroscience, an integration of the constant Q transform and a state-of-the-art matrix factorization method called shift-invariant probabilistic latent component analysis (SI-PLCA) are proposed to resolve the polyphonic short-time magnitude log-spectra for multiple pitch estimation and source-specific feature extraction. The cognitions of rhythm, harmonic periodicity and instrument timbre are used to guide the analysis of characterizing contiguous notes and the relationship between fundamental frequency and harmonic frequencies for detecting the pitches from the outcomes of SI-PLCA. In the experiment, we compare the performance of proposed MPE system to a number of existing state-of-the-art approaches (seven weak learning methods and four deep learning methods) on three widely used datasets (i.e. MAPS, BACH10 and TRIOS) in terms of F-measure (F-1) values. The experimental results show that the proposed MPE method provides the best overall performance against other existing methods.
引用
下载
收藏
页码:23 / 35
页数:13
相关论文
共 50 条
  • [1] A Music Cognition–Guided Framework for Multi-pitch Estimation
    Xiaoquan Li
    Yijun Yan
    John Soraghan
    Zheng Wang
    Jinchang Ren
    Cognitive Computation, 2023, 15 : 23 - 35
  • [2] Multi-pitch estimation using harmonic music
    Christensen, Mads Graesboll
    Jakobsson, Andreas
    Jensen, Soren Holdt
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 521 - +
  • [3] Multi-pitch estimation
    Christensen, Mads Graesboll
    Stoica, Petre
    Jakobsson, Andreas
    Jensen, Soren Holdt
    SIGNAL PROCESSING, 2008, 88 (04) : 972 - 983
  • [4] MULTI-PITCH ESTIMATION OF INHARMONIC SIGNALS
    Nilsson, Tommy
    Adalbjornsson, Stefan I.
    Butt, Naveed R.
    Jakobsson, Andreas
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [5] Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings
    Weiss, Christof
    Peeters, Geoffroy
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2814 - 2827
  • [6] MULTI-PITCH ESTIMATION USING SEMIDEFINITE PROGRAMMING
    Jensen, Tobias Lindstrom
    Vandenberghe, Lieven
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4192 - 4196
  • [7] AN ADAPTIVE PENALTY APPROACH TO MULTI-PITCH ESTIMATION
    Kronvall, Ted
    Elvander, Filip
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 31 - 35
  • [8] Multi-pitch estimation for polyphonic musical signals
    Fernandez-Cid, P
    Casajus-Quiros, FJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3565 - 3568
  • [9] Multi-pitch estimation with polyphony per instrument information for Western classical and electronic music
    Michael Taenzer
    EURASIP Journal on Audio, Speech, and Music Processing, 2025 (1)
  • [10] Multi-pitch estimation exploiting block sparsity
    Adalbjornsson, Stefan I.
    Jakobsson, Andreas
    Christensen, Mads G.
    SIGNAL PROCESSING, 2015, 109 : 236 - 247