END-TO-END MUSIC REMASTERING SYSTEM USING SELF-SUPERVISED AND ADVERSARIAL TRAINING

被引:1
|
作者
Koo, Junghyun [1 ]
Paik, Seungryeol [1 ]
Lee, Kyogu [1 ,2 ,3 ]
机构
[1] Seoul Natl Univ, Dept Intelligence & Informat, Mus & Audio Res Grp, Seoul, South Korea
[2] Seoul Natl Univ, AI Inst, Seoul, South Korea
[3] Seoul Natl Univ, Grad Sch AI, Seoul, South Korea
关键词
Intelligent music production; audio mastering; self-supervised learning; contrastive learning; adversarial training;
D O I
10.1109/ICASSP43922.2022.9746389
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Mastering is an essential step in music production, but it is also a challenging task that has to go through the hands of experienced audio engineers, where they adjust tone, space, and volume of a song. Remastering follows the same technical process, in which the context lies in mastering a song for the times. As these tasks have high entry barriers, we aim to lower the barriers by proposing an end-to-end music remastering system that transforms the mastering style of input audio to that of the target. The system is trained in a self-supervised manner, in which released pop songs were used for training. We also anticipated the model to generate realistic audio reflecting the reference's mastering style by applying a pre-trained encoder and a projection discriminator. We validate our results with quantitative metrics and a subjective listening test and show that the model generated samples of mastering style similar to the target.
引用
收藏
页码:4608 / 4612
页数:5
相关论文
共 50 条
  • [1] Investigating Self-supervised Pre-training for End-to-end Speech Translation
    Ha Nguyen
    Bougares, Fethi
    Tomashenko, Natalia
    Esteve, Yannick
    Besacier, Laurent
    [J]. INTERSPEECH 2020, 2020, : 1466 - 1470
  • [2] Self-supervised end-to-end graph local clustering
    Zhe Yuan
    [J]. World Wide Web, 2023, 26 : 1157 - 1179
  • [3] Self-supervised end-to-end graph local clustering
    Yuan, Zhe
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (03): : 1157 - 1179
  • [4] Self-Supervised Representations Improve End-to-End Speech Translation
    Wu, Anne
    Wang, Changhan
    Pino, Juan
    Gu, Jiatao
    [J]. INTERSPEECH 2020, 2020, : 1491 - 1495
  • [5] Geometric Consistency for Self-Supervised End-to-End Visual Odometry
    Iyer, Ganesh
    Murthy, J. Krishna
    Gupta, Gunshi
    Krishna, K. Madhava
    Paull, Liam
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 380 - 388
  • [6] AN EXPLORATION OF SELF-SUPERVISED PRETRAINED REPRESENTATIONS FOR END-TO-END SPEECH RECOGNITION
    Chang, Xuankai
    Maekaku, Takashi
    Guo, Pengcheng
    Shi, Jing
    Lu, Yen-Ju
    Subramanian, Aswin Shanmugam
    Wang, Tianzi
    Yang, Shu-wen
    Tsao, Yu
    Lee, Hung-yi
    Watanabe, Shinji
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 228 - 235
  • [7] ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems
    Zhang, Yinda
    Khamis, Sameh
    Rhemann, Christoph
    Valentin, Julien
    Kowdle, Adarsh
    Tankovich, Vladimir
    Schoenberg, Michael
    Izadi, Shahram
    Funkhouser, Thomas
    Fanello, Sean
    [J]. COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 802 - 819
  • [8] CONTINUAL SELF-SUPERVISED DOMAIN ADAPTATION FOR END-TO-END SPEAKER DIARIZATION
    Coria, Juan M.
    Bredin, Herve
    Ghannay, Sahar
    Rosset, Sophie
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 626 - 632
  • [9] An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding
    Fang, Hongchao
    Xie, Pengtao
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1324 - 1340
  • [10] SELF-SUPERVISED ADVERSARIAL TRAINING
    Chen, Kejiang
    Chen, Yuefeng
    Zhou, Hang
    Mao, Xiaofeng
    Li, Yuhong
    He, Yuan
    Xue, Hui
    Zhang, Weiming
    Yu, Nenghai
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2218 - 2222