Singing Voice Detection Based on Convolutional Neural Networks

被引:0
|
作者
Huang, Hong-Ming [1 ]
Chen, Woei-Kae [2 ]
Liu, Chien-Hung [2 ]
You, Shingchern D. [2 ]
机构
[1] Synology Inc, Taipei, Taiwan
[2] Natl Taipei Univ Technol, Dept Comp Sci & Info Engn, Taipei, Taiwan
关键词
singing voice detection; DFT; MFCC; convolutional neural networks;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates various structures of convolutional neural networks (CNN) for singing voice detection. The input features are MFCC (mel-frequency cepstrum coefficients), DFT (discrete Fourier transform) coefficients, and raw PCM samples. The simulation results show that DFT coefficients yields higher detection accuracy, up to 92%.
引用
收藏
页码:223 / 226
页数:4
相关论文
共 50 条
  • [21] Traffic Sign Detection based on Convolutional Neural Networks
    Wu, Yihui
    Liu, Yulong
    Li, Jianmin
    Liu, Huaping
    Hu, Xiaolin
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [22] Glomerulus Classification and Detection Based on Convolutional Neural Networks
    Gallego, Jaime
    Pedraza, Anibal
    Lopez, Samuel
    Steiner, Georg
    Gonzalez, Lucia
    Laurinavicius, Arvydas
    Bueno, Gloria
    [J]. JOURNAL OF IMAGING, 2018, 4 (01)
  • [23] SINGING STYLE INVESTIGATION BY RESIDUAL SIAMESE CONVOLUTIONAL NEURAL NETWORKS
    Wang, Cheng-i
    Tzanetakis, George
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 116 - 120
  • [24] Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer
    Agarwal, Shrutina
    Ganapathy, Sriram
    Takahashi, Naoya
    [J]. INTERSPEECH 2022, 2022, : 3013 - 3017
  • [25] Voice pathology detection using optimized convolutional neural networks and explainable artificial intelligence-based analysis
    Jegan, Roohum
    Jayagowri, R.
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2023, 27 (14) : 2041 - 2057
  • [26] Cloud Detection and Tracking Based on Object Detection with Convolutional Neural Networks
    Carballo, Jose Antonio
    Bonilla, Javier
    Fernandez-Reche, Jesus
    Nouri, Bijan
    Avila-Marin, Antonio
    Fabel, Yann
    Alarcon-Padilla, Diego-Cesar
    [J]. ALGORITHMS, 2023, 16 (10)
  • [27] RECOGNITION OF SPOOFED VOICE USING CONVOLUTIONAL NEURAL NETWORKS
    Liang, Huixin
    Lin, Xiaodan
    Zhang, Qiong
    Kang, Xiangui
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 293 - 297
  • [28] SINGING VOICE SYNTHESIS BASED ON GENERATIVE ADVERSARIAL NETWORKS
    Hono, Yukiya
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6955 - 6959
  • [29] DENOISING DEEP NEURAL NETWORKS BASED VOICE ACTIVITY DETECTION
    Zhang, Xiao-Lei
    Wu, Ji
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 853 - 857
  • [30] Voice activity detection based on deep neural networks and Viterbi
    Bai, Liang
    Zhang, Zhen
    Hu, Jun
    [J]. 2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231