Automatic Label Calibration for Singing Annotation Using Fully Convolutional Neural Network

被引:4
|
作者
Fu, Xiao [1 ]
Deng, Hangyu [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat & Prod & Syst, 2-7 Hibikino, Kitakyushu, Fukuoka 8080135, Japan
关键词
music information retrieval; label calibration; singing annotation; convolutional neural network; TRANSCRIPTION; GAME; GO;
D O I
10.1002/tee.23804
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurately-labeled data is crucial for the training of machine learning models. For singing-related tasks in the music information retrieval field, accurately-labeled data is limited because annotating singing is time-consuming. Several studies create vocal datasets using a two-step annotation method which creates coarse labels first and then executes a manual calibration procedure. However, manually calibrating coarsely-labeled singing data is expensive and time-consuming. To address this problem, in this study we propose a singing-label calibration framework, which aims to automatically calibrate the coarsely-labeled singing data with higher accuracy. This framework contains a data augmentation method to generate training and testing data, a reasonable data preprocessing method to handle music audio and symbolic labels, a fully-convolutional neural network to estimate the difference between coarse labels and accurate labels, and a novel calibration function to correct the coarse labels. Various experiments are conducted to examine the effect of our research. The results show that our model can highly reduce the cost time and slightly increase the labeling accuracy of the manual calibration process. (C) 2023 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:945 / 952
页数:8
相关论文
共 50 条
  • [21] Improving loss function for deep convolutional neural network applied in automatic image annotation
    Ali Salar
    Ali Ahmadi
    The Visual Computer, 2024, 40 : 1617 - 1629
  • [22] Automatic Segmentation of Sinkholes Using a Convolutional Neural Network
    Rafique, Muhammad Usman
    Zhu, Junfeng
    Jacobs, Nathan
    EARTH AND SPACE SCIENCE, 2022, 9 (02)
  • [23] Automatic ECG Diagnosis Using Convolutional Neural Network
    Avanzato, Roberta
    Beritelli, Francesco
    ELECTRONICS, 2020, 9 (06) : 1 - 14
  • [24] Convolutional Neural Network Based Fully Automatic Lesion Localization in Rectal Cancer
    Zhang, Y.
    Shi, L.
    Sun, X.
    Jabbour, S.
    Yue, N.
    Nie, K.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [25] Automatic Segmentation and Overall Survival Prediction in Gliomas Using Fully Convolutional Neural Network and Texture Analysis
    Alex, Varghese
    Safwan, Mohammed
    Krishnamurthi, Ganapathy
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2017, 2018, 10670 : 216 - 225
  • [26] Automatic detection and localization of Focal Cortical Dysplasia lesions in MRI using fully convolutional neural network
    Dev, K. M. Bijay
    Jogi, Pawan S.
    Niyas, S.
    Vinayagamani, S.
    Kesavadas, Chandrasekharan
    Rajan, Jeny
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 52 : 218 - 225
  • [27] Speech Enhancement using Fully Convolutional UNET and Gated Convolutional Neural Network
    Baloch, Danish
    Abdullah, Sidrah
    Qaiser, Asma
    Ahmed, Saad
    Nasim, Faiza
    Kanwal, Mehreen
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 831 - 836
  • [28] SPNet: Shape Prediction Using a Fully Convolutional Neural Network
    Al Arif, S. M. Masudur Rahman
    Knapp, Karen
    Slabaugh, Greg
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT I, 2018, 11070 : 430 - 439
  • [29] Traffic Lane Detection using Fully Convolutional Neural Network
    Zang, Jinju
    Zhou, Wei
    Zhang, Guanwen
    Duan, Zhemin
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 305 - 311
  • [30] Colorectal Polyp Segmentation Using A Fully Convolutional Neural Network
    Li, Qiaoliang
    Yang, Guangyao
    Chen, Zhewei
    Huang, Bin
    Chen, Liangliang
    Xu, Depeng
    Zhou, Xueying
    Zhong, Shi
    Zhang, Huisheng
    Wang, Tianfu
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,