An End-to-End Machine Learning System for Harmonic Analysis of Music

被引:32
|
作者
Ni, Yizhao [1 ]
McVicar, Matt [1 ]
Santos-Rodriguez, Raul [2 ]
De Bie, Tijl [1 ]
机构
[1] Univ Bristol, Dept Engn Math, Intelligent Syst Lab, Bristol BS8 1UB, Avon, England
[2] Univ Carlos III Madrid, Signal Theory & Commun Dept, E-28903 Getafe, Spain
基金
英国工程与自然科学研究理事会;
关键词
Audio chord estimation; harmony progression analyzer (HPA); loudness-based chromagram; machine learning; meta-song evaluation; FEATURES; AUDIO;
D O I
10.1109/TASL.2012.2188516
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a new system for the harmonic analysis of popular musical audio. It is focused on chord estimation, although the proposed system additionally estimates the key sequence and bass notes. It is distinct from competing approaches in two main ways. First, it makes use of a new improved chromagram representation of audio that takes the human perception of loudness into account. Furthermore, it is the first system for joint estimation of chords, keys, and bass notes that is fully based on machine learning, requiring no expert knowledge to tune the parameters. This means that it will benefit from future increases in available annotated audio files, broadening its applicability to a wider range of genres. In all of three evaluation scenarios, including a new one that allows evaluation on audio for which no complete ground truth annotation is available, the proposed system is shown to be faster, more memory efficient, and more accurate than the state-of-the-art.
引用
收藏
页码:1771 / 1783
页数:13
相关论文
共 50 条
  • [41] End-to-end optical music recognition for pianoform sheet music
    Rios-Vila, Antonio
    Rizo, David
    Inesta, Jose M.
    Calvo-Zaragoza, Jorge
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (03) : 347 - 362
  • [42] An end-to-end deep learning system for medieval writer identification
    Cilia, N. D.
    De Stefano, C.
    Fontanella, F.
    Marrocco, C.
    Molinara, M.
    Di Freca, A. Scotto
    [J]. PATTERN RECOGNITION LETTERS, 2020, 129 : 137 - 143
  • [43] Massive ECG: An end-to-end machine learning system for stroke risk stratification with a massive electrocardiogram database
    Liao, Shun
    Zhang, Zhaolei
    Wang, Bo
    [J]. INTERNATIONAL JOURNAL OF STROKE, 2019, 14 (3_SUPPL) : 22 - 22
  • [44] End-to-end optical music recognition for pianoform sheet music
    Antonio Ríos-Vila
    David Rizo
    José M. Iñesta
    Jorge Calvo-Zaragoza
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 347 - 362
  • [45] End-to-End Audiovisual Speech Recognition System With Multitask Learning
    Tao, Fei
    Busso, Carlos
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1 - 11
  • [46] End-to-End Learning of Communication System without Known Channel
    Jiang, Hao
    Dai, Linglong
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [47] DATALINK END-TO-END SYSTEM PERFORMANCE ANALYSIS METHODOLOGY
    Atia, Omar E.
    Monticone, Leone C.
    Ribeiro, Leila Z.
    [J]. 2014 INTEGRATED COMMUNICATIONS, NAVIGATION AND SURVEILLANCE CONFERENCE (ICNS), 2014,
  • [48] Canopy: An End-to-End Performance Tracing And Analysis System
    Kaldor, Jonathan
    Mace, Jonathan
    Bejda, Michal
    Gao, Edison
    Kuropatwa, Wiktor
    O'Neill, Joe
    Ong, Kian Win
    Schaller, Bill
    Shan, Pingjia
    Viscomi, Brendan
    Venkataraman, Vinod
    Veeraraghavan, Kaushik
    Song, Yee Jiun
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, : 34 - 50
  • [49] Efficient end-to-end learning for cell segmentation with machine generated weak annotations
    Shrestha, Prem
    Kuang, Nicholas
    Yu, Ji
    [J]. COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [50] An End-to-end Intelligent Network Resource Allocation in IoV: A Machine Learning Approach
    Muhammad, Afaq
    Khan, Talha Ahmed
    Abbass, Khizar
    Song, Wang-Cheol
    [J]. 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,