A COMPARATIVE STUDY OF ROBUSTNESS OF DEEP LEARNING APPROACHES FOR VAD

被引:0
|
作者
Tong, Sibo [1 ]
Gu, Hao [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Educ Commiss Intelligent Interact & Cogn, Key Lab, Shanghai, Peoples R China
关键词
VAD; Deep learning; Robustness; VOICE ACTIVITY DETECTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection (VAD) is an important step for real-world automatic speech recognition (ASR) systems. Deep learning approaches, such as DNN, RNN or CNN, have been widely used in model-based VAD. Although they have achieved success in practice, they are developed on different VAD tasks separately. Whilst VAD performance under noisy conditions, especially with unseen noise or very low SNR, are of great interest, there has no robustness comparison of different deep learning approaches so far. In this paper, to learn the robustness property, VAD models based on DNN, LSTM and CNN are thoroughly compared at both frame and segment level under various noisy conditions on Aurora 4, a commonly used speech corpus with rich noises. To improve the robustness of deep learning based VAD models, a new noise-aware training (NAT) approach is also proposed. Experiments show that LSTM-based VAD is most robust but the performance degrades dramatically in the conditions with unseen noise or diverse SNR. By incorporating NAT, significant performance gains can be obtained in these conditions.
引用
收藏
页码:5695 / 5699
页数:5
相关论文
共 50 条
  • [1] The Robustness Study of Multiple Kernel Learning Approaches for VAD
    Zhang, Jie
    Wang, Mantao
    Tang, Haitao
    Huang, Qiang
    Pu, Haibo
    Luo, Lixin
    Zhou, Zhihao
    [J]. PROCEEDINGS OF THE 2018 8TH INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION AND INFORMATION (MEICI 2018), 2018, 163 : 757 - 763
  • [2] Authorship attribution in twitter: a comparative study of machine learning and deep learning approaches
    Rebeh Imane Ammar Aouchiche
    Fatima Boumahdi
    Mohamed Abdelkarim Remmide
    Amina Madani
    [J]. International Journal of Information Technology, 2024, 16 (5) : 3303 - 3310
  • [3] A Comparative Study of Deep Learning Approaches to Rooftop Detection in Aerial Images
    Cai, Yuwei
    He, Hongjie
    Yang, Ke
    Fatholahi, Sarah Narges
    Ma, Lingfei
    Xu, Linlin
    Li, Jonathan
    [J]. CANADIAN JOURNAL OF REMOTE SENSING, 2021, 47 (03) : 413 - 431
  • [4] A Comparative Study of Conventional and Deep Learning Approaches for Demosaicing Mastcam Images
    Kwan, Chiman
    Chou, Bryan
    [J]. SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXVIII, 2019, 11018
  • [5] Comparative Study between Traditional Machine Learning and Deep Learning Approaches for Text Classification
    Kamath, Cannannore Nidhi
    Bukhari, Syed Saqib
    Dengel, Andreas
    [J]. PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [6] A comparative evaluation of deep learning approaches for ophthalmology
    de Souza, Waldir Rodrigues, Jr.
    Linde, Glenn
    Hong, Sheng Chiong
    Chalakkal, Renoh
    [J]. CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2023, 51 (09): : 1011 - 1011
  • [7] Deep learning for cyber security intrusion detection: Approaches, datasets, and comparative study
    Ferrag, Mohamed Amine
    Maglaras, Leandros
    Moschoyiannis, Sotiris
    Janicke, Helge
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2020, 50
  • [8] Electric Vehicle Charging Load Forecasting: A Comparative Study of Deep Learning Approaches
    Zhu, Juncheng
    Yang, Zhile
    Mourshed, Monjur
    Guo, Yuanjun
    Zhou, Yimin
    Chang, Yan
    Wei, Yanjie
    Feng, Shengzhong
    [J]. ENERGIES, 2019, 12 (14)
  • [9] The Comparative Study of Deep Learning Neural Network Approaches for Breast Cancer Diagnosis
    Nasir, Haslinah Mohd
    Brahin, Noor Mohd Ariff
    Zainuddin, Suraya
    Mispan, Mohd Syafiq
    Isa, Ida Syafiza Md
    Sha'abani, Mohd Nurul Al Hafiz
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (06) : 127 - 140
  • [10] Deep vs. Shallow: A Comparative Study of Machine Learning and Deep Learning Approaches for Fake Health News Detection
    Mahara, Tripti
    Josephine, V. L. Helen
    Srinivasan, Rashmi
    Prakash, Poorvi
    Algarni, Abeer D. D.
    Verma, Om Prakash
    [J]. IEEE ACCESS, 2023, 11 : 79330 - 79340