Singing Voice Detection Based on Convolutional Neural Networks

被引:0
|
作者
Huang, Hong-Ming [1 ]
Chen, Woei-Kae [2 ]
Liu, Chien-Hung [2 ]
You, Shingchern D. [2 ]
机构
[1] Synology Inc, Taipei, Taiwan
[2] Natl Taipei Univ Technol, Dept Comp Sci & Info Engn, Taipei, Taiwan
关键词
singing voice detection; DFT; MFCC; convolutional neural networks;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates various structures of convolutional neural networks (CNN) for singing voice detection. The input features are MFCC (mel-frequency cepstrum coefficients), DFT (discrete Fourier transform) coefficients, and raw PCM samples. The simulation results show that DFT coefficients yields higher detection accuracy, up to 92%.
引用
收藏
页码:223 / 226
页数:4
相关论文
共 50 条
  • [1] Exploring Channel Properties to Improve Singing Voice Detection with Convolutional Neural Networks
    Gui, Wenming
    Li, Yukun
    Zang, Xian
    Zhang, Jinglan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [2] Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks
    Kum, Sangeun
    Nam, Juhan
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (07):
  • [3] SINGING VOICE DETECTION WITH DEEP RECURRENT NEURAL NETWORKS
    Leglaive, Simon
    Hennequin, Romain
    Badeau, Roland
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 121 - 125
  • [4] Convolutional Neural Networks for Pathological Voice Detection
    Wu, Huiyi
    Soraghan, John
    Lowit, Anja
    Di Caterina, Gaetano
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4784 - 4787
  • [5] FAST AND HIGH-QUALITY SINGING VOICE SYNTHESIS SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORKS
    Nakamura, Kazuhiro
    Takaki, Shinji
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7239 - 7243
  • [6] Singing voice synthesis based on deep neural networks
    Nishimura, Masanari
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2478 - 2482
  • [7] Comparative study of singing voice detection based on deep neural networks and ensemble learning
    You, Shingchern D.
    Liu, Chien-Hung
    Chen, Woei-Kae
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [8] Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation
    Grais, Emad M.
    Zhao, Fei
    Plumbley, Mark D.
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 261 - 265
  • [9] Detection of landslide based on convolutional neural networks
    Zhang, Heng
    Chen, Xiaohu
    Song, Zhizhong
    Zhan, Weijie
    Lei, Huiguang
    [J]. 2022 8TH INTERNATIONAL CONFERENCE ON HYDRAULIC AND CIVIL ENGINEERING: DEEP SPACE INTELLIGENT DEVELOPMENT AND UTILIZATION FORUM, ICHCE, 2022, : 736 - 739
  • [10] Resistor Detection Based on Convolutional Neural Networks
    Liu, Chun
    Shi, Yudeng
    [J]. 2017 IEEE 3RD INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC), 2017, : 91 - 94