PERCEPTUAL MUSICAL SIMILARITY METRIC LEARNING WITH GRAPH NEURAL NETWORKS

被引:0
|
作者
Vahidi, Cyrus [1 ]
Singh, Shubhr [1 ]
Benetos, Emmanouil [1 ]
Phan, Huy [2 ]
Stowell, Dan [3 ]
Fazekas, Gyorgy [1 ]
Lagrange, Mathieu [4 ]
机构
[1] Queen Mary Univ London, Ctr Digital Mus, London, England
[2] Amazon Alexa, Cambridge, MA USA
[3] Tilburg Univ, Bijsterveldenlaan, Tilburg, Netherlands
[4] Nantes Univ, CNRS, Ecole Cent Nantes, LS2N, Nantes, France
基金
英国科研创新办公室; 英国工程与自然科学研究理事会;
关键词
auditory similarity; content-based music retrieval; graph neural networks; metric learning;
D O I
10.1109/WASPAA58266.2023.10248151
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sound retrieval for assisted music composition depends on evaluating similarity between musical instrument sounds, which is partly influenced by playing techniques. Previous methods utilizing Euclidean nearest neighbours over acoustic features show some limitations in retrieving sounds sharing equivalent timbral properties, but potentially generated using a different instrument, playing technique, pitch or dynamic. In this paper, we present a metric learning system designed to approximate human similarity judgments between extended musical playing techniques using graph neural networks. Such structure is a natural candidate for solving similarity retrieval tasks, yet have seen little application in modelling perceptual music similarity. We optimize a Graph Convolutional Network (GCN) over acoustic features via a proxy metric learning loss to learn embeddings that reflect perceptual similarities. Specifically, we construct the graph's adjacency matrix from the acoustic data manifold with an example-wise adaptive k-nearest neighbourhood graph: Adaptive Neighbourhood Graph Neural Network (AN-GNN). Our approach achieves 96.4% retrieval accuracy compared to 38.5% with a Euclidean metric and 86.0% with a multilayer perceptron (MLP), while effectively considering retrievals from distinct playing techniques to the query example.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Learning musical structure and style with neural networks
    Hörnel, D
    Menzel, W
    COMPUTER MUSIC JOURNAL, 1998, 22 (04) : 44 - 62
  • [22] Privacy-Preserved Neural Graph Similarity Learning
    Hou, Yupeng
    Zhao, Wayne Xin
    Li, Yaliang
    Wen, Ji-Rong
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 191 - 200
  • [23] Graph Structure Learning for Robust Graph Neural Networks
    Jin, Wei
    Ma, Yao
    Liu, Xiaorui
    Tang, Xianfeng
    Wang, Suhang
    Tang, Jiliang
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 66 - 74
  • [24] Graph Neural Networks for Brain Graph Learning: A Survey
    Luo, Xuexiong
    Wu, Jia
    Yang, Jian
    Xue, Shan
    Beheshti, Amin
    Sheng, Quan Z.
    McAlpine, David
    Sowman, Paul
    Giral, Alexis
    Yu, Philip S.
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8170 - 8178
  • [25] Heterogeneous Graph Structure Learning for Graph Neural Networks
    Zhao, Jianan
    Wang, Xiao
    Shi, Chuan
    Hu, Binbin
    Song, Guojie
    Ye, Yanfang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4697 - 4705
  • [26] Learning Graph Neural Networks with Deep Graph Library
    Zheng, Da
    Wang, Minjie
    Gan, Quan
    Zhang, Zheng
    Karypis, George
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 305 - 306
  • [27] Learning graph edit distance by graph neural networks
    Riba, Pau
    Fischer, Andreas
    Llados, Josep
    Fornes, Alicia
    PATTERN RECOGNITION, 2021, 120
  • [28] Shift-Tolerant Perceptual Similarity Metric
    Ghildyal, Abhijay
    Liu, Feng
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 91 - 107
  • [29] Perceptual Similarity Ranking of Temporal Heatmaps Using Convolutional Neural Networks
    Malik, Sana
    Kim, Sungchul
    Koh, Eunyee
    PROCEEDINGS OF THE 2018 WORKSHOP ON UNDERSTANDING SUBJECTIVE ATTRIBUTES OF DATA, WITH THE FOCUS ON EVOKED EMOTIONS (EE-USAD'18), 2018, : 25 - 31
  • [30] Learning Question Similarity with Recurrent Neural Networks
    Ye, Borui
    Feng, Guangyu
    Cheriton, David R.
    Cui, Anqi
    Li, Ming
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 111 - 118