Contrastive Learning with Positive-Negative Frame Mask for Music Representation

被引:8
|
作者
Yao, Dong [1 ]
Zhao, Zhou [1 ]
Zhang, Shengyu [1 ]
Zhu, Jieming [2 ]
Zhu, Yudong [2 ]
Zhang, Rui [2 ]
He, Xiuqiang [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
基金
浙江省自然科学基金; 中国国家自然科学基金;
关键词
Contrastive Learning; Music Representation; Representation Learning Attention;
D O I
10.1145/3485447.3512011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised learning, especially contrastive learning, has made an outstanding contribution to the development of many deep learning research fields. Recently, researchers in the acoustic signal processing field noticed its success and leveraged contrastive learning for better music representation. Typically, existing approaches maximize the similarity between two distorted audio segments sampled from the same music. In other words, they ensure a semantic agreement at the music level. However, those coarse-grained methods neglect some inessential or noisy elements at the frame level, which may be detrimental to the model to learn the effective representation of music. Towards this end, this paper proposes a novel Positive-nEgative frame mask for Music Representation based on the contrastive learning framework, abbreviated as PEMR. Concretely, PEMR incorporates a Positive-Negative Mask Generation module, which leverages transformer blocks to generate frame masks on Log-Mel spectrogram. We can generate self-augmented negative and positive samples by masking important components or inessential components, respectively. We devise a novel contrastive learning objective to accommodate both self-augmented positives/negatives sampled from the same music. We conduct experiments on four public datasets. The experimental results of two music-related downstream tasks, music classification and cover song identification, demonstrate the generalization ability and transferability of music representation learned by PEMR.
引用
收藏
页码:2906 / 2915
页数:10
相关论文
共 50 条
  • [1] Positive-negative equal contrastive loss for semantic segmentation
    Wang, Jing
    Li, Jiangyun
    Li, Wei
    Xuan, Lingfei
    Zhang, Tianxiang
    Wang, Wenxuan
    [J]. NEUROCOMPUTING, 2023, 535 : 13 - 24
  • [2] Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning
    Niu, Gang
    du Plessis, Marthinus C.
    Sakai, Tomoya
    Ma, Yao
    Sugiyama, Masashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [3] Image Annotation based on Positive-Negative Instances Learning
    Zhang, Kai
    Hu, Jiwei
    Liu, Quan
    Lou, Ping
    [J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [4] THE POSITIVE-NEGATIVE DICHOTOMY IN SCHIZOPHRENIA
    MORTIMER, AM
    LUND, CE
    MCKENNA, PJ
    [J]. BRITISH JOURNAL OF PSYCHIATRY, 1990, 157 : 41 - 49
  • [5] Positive-Negative Stiffness Actuators
    Braun, David J.
    Chalvet, Vincent
    Dahiya, Abhinav
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 162 - 173
  • [6] A POSITIVE-NEGATIVE PRESSURE RESPIRATOR
    SMITHCLARKE, GT
    GALPINE, JF
    [J]. LANCET, 1955, 1 (JUN25): : 1299 - 1300
  • [7] Contrastive sentence representation learning with adaptive false negative cancellation
    Xu, Lingling
    Xie, Haoran
    Wang, Fu Lee
    Tao, Xiaohui
    Wang, Weiming
    Li, Qing
    [J]. INFORMATION FUSION, 2024, 102
  • [8] Hard Negative Sample Mining for Contrastive Representation in Reinforcement Learning
    Chen, Qihang
    Liang, Dayang
    Liu, Yunlong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 277 - 288
  • [9] POSITIVE-NEGATIVE SYNDROME SCALE FOR SCHIZOPHRENICS
    FISZBEIN, A
    KAY, SR
    OPER, LA
    [J]. INTERNATIONAL JOURNAL OF NEUROSCIENCE, 1986, 31 (1-4) : 286 - 286
  • [10] Ketamine: a positive-negative anaesthetic agent
    Kakazu, C.
    Lippmann, M.
    Hsu, D.
    [J]. BRITISH JOURNAL OF ANAESTHESIA, 2016, 117 (02) : 267 - 267