Automatic Label Calibration for Singing Annotation Using Fully Convolutional Neural Network

被引:4
|
作者
Fu, Xiao [1 ]
Deng, Hangyu [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat & Prod & Syst, 2-7 Hibikino, Kitakyushu, Fukuoka 8080135, Japan
关键词
music information retrieval; label calibration; singing annotation; convolutional neural network; TRANSCRIPTION; GAME; GO;
D O I
10.1002/tee.23804
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurately-labeled data is crucial for the training of machine learning models. For singing-related tasks in the music information retrieval field, accurately-labeled data is limited because annotating singing is time-consuming. Several studies create vocal datasets using a two-step annotation method which creates coarse labels first and then executes a manual calibration procedure. However, manually calibrating coarsely-labeled singing data is expensive and time-consuming. To address this problem, in this study we propose a singing-label calibration framework, which aims to automatically calibrate the coarsely-labeled singing data with higher accuracy. This framework contains a data augmentation method to generate training and testing data, a reasonable data preprocessing method to handle music audio and symbolic labels, a fully-convolutional neural network to estimate the difference between coarse labels and accurate labels, and a novel calibration function to correct the coarse labels. Various experiments are conducted to examine the effect of our research. The results show that our model can highly reduce the cost time and slightly increase the labeling accuracy of the manual calibration process. (C) 2023 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:945 / 952
页数:8
相关论文
共 50 条
  • [11] An Improved Automatic Image Annotation Approach Using Convolutional Neural Network-Slantlet Transform
    Adnan, Myasar Mundher
    Rahim, Mohd Shafry Mohd
    Khan, Amjad Rehman
    Saba, Tanzila
    Fati, Suliman Mohamed
    Bahaj, Saeed Ali
    IEEE ACCESS, 2022, 10 : 7520 - 7532
  • [12] Fully Automatic Quantitative Measurement of Equilibrium Radionuclide Angiocardiography Using a Convolutional Neural Network
    Ha, Sejin
    Seo, Seung Yeon
    Park, Byung Soo
    Han, Sangwon
    Oh, Jungsu S.
    Chae, Sun Young
    Kim, Jae Seung
    Moon, Dae Hyuk
    CLINICAL NUCLEAR MEDICINE, 2024, 49 (08) : 727 - 732
  • [13] Automatic image annotation method based on a convolutional neural network with threshold optimization
    Cao, Jianfang
    Zhao, Aidi
    Zhang, Zibang
    PLOS ONE, 2020, 15 (09):
  • [14] A Lightweight Fully Convolutional Neural Network for SAR Automatic Target Recognition
    Yu, Jimin
    Zhou, Guangyu
    Zhou, Shangbo
    Yin, Jiajun
    REMOTE SENSING, 2021, 13 (15)
  • [15] Blind inpainting using the fully convolutional neural network
    Nian Cai
    Zhenghang Su
    Zhineng Lin
    Han Wang
    Zhijing Yang
    Bingo Wing-Kuen Ling
    The Visual Computer, 2017, 33 : 249 - 261
  • [16] Blind inpainting using the fully convolutional neural network
    Cai, Nian
    Su, Zhenghang
    Lin, Zhineng
    Wang, Han
    Yang, Zhijing
    Ling, Bingo Wing-Kuen
    VISUAL COMPUTER, 2017, 33 (02): : 249 - 261
  • [17] Automatic Modulation Classification Using Contrastive Fully Convolutional Network
    Huang, Sai
    Jiang, Yizhou
    Gao, Yue
    Feng, Zhiyong
    Zhang, Ping
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (04) : 1044 - 1047
  • [18] Transfer Learning Using Musical Instrument Audio for Improving Automatic Singing Label Calibration
    Fu, Xiao
    Rui, Xijian
    Deng, Hangyu
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2024, 19 (05) : 707 - 715
  • [19] Automatic Document Classification Using Convolutional Neural Network
    Sun, Xingping
    Li, Yibing
    Kang, Hongwei
    Shen, Yong
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [20] Improving loss function for deep convolutional neural network applied in automatic image annotation
    Salar, Ali
    Ahmadi, Ali
    VISUAL COMPUTER, 2024, 40 (03): : 1617 - 1629