A COMPARATIVE STUDY OF ROBUSTNESS OF DEEP LEARNING APPROACHES FOR VAD

被引:0
|
作者
Tong, Sibo [1 ]
Gu, Hao [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Educ Commiss Intelligent Interact & Cogn, Key Lab, Shanghai, Peoples R China
关键词
VAD; Deep learning; Robustness; VOICE ACTIVITY DETECTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection (VAD) is an important step for real-world automatic speech recognition (ASR) systems. Deep learning approaches, such as DNN, RNN or CNN, have been widely used in model-based VAD. Although they have achieved success in practice, they are developed on different VAD tasks separately. Whilst VAD performance under noisy conditions, especially with unseen noise or very low SNR, are of great interest, there has no robustness comparison of different deep learning approaches so far. In this paper, to learn the robustness property, VAD models based on DNN, LSTM and CNN are thoroughly compared at both frame and segment level under various noisy conditions on Aurora 4, a commonly used speech corpus with rich noises. To improve the robustness of deep learning based VAD models, a new noise-aware training (NAT) approach is also proposed. Experiments show that LSTM-based VAD is most robust but the performance degrades dramatically in the conditions with unseen noise or diverse SNR. By incorporating NAT, significant performance gains can be obtained in these conditions.
引用
收藏
页码:5695 / 5699
页数:5
相关论文
共 50 条
  • [21] A Comparative Analysis of Deep Learning based Vehicle Detection Approaches
    Singhal, Nikita
    Prasad, Lalji
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (02): : 485 - 501
  • [22] Data preprocessing and feature selection techniques in gait recognition: A comparative study of machine learning and deep learning approaches
    Parashar, Anubha
    Parashar, Apoorva
    Ding, Weiping
    Shabaz, Mohammad
    Rida, Imad
    [J]. PATTERN RECOGNITION LETTERS, 2023, 172 : 65 - 73
  • [23] A Comparative Review of Sentimental Analysis Using Machine Learning and Deep Learning Approaches
    Nagelli, Archana
    Saleena, B.
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2023, 22 (03)
  • [24] A Comparative Study of Deep Learning-based Depth Estimation Approaches: Application to Smart Mobility
    Mauri, Antoine
    Khemmar, Redouane
    Decoux, Benoit
    Benmoumen, Tahar
    Haddad, Madjid
    Boutteau, Remi
    [J]. 2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 80 - 84
  • [25] A Comparative Study of Fault Diagnosis for Train Door System: Traditional versus Deep Learning Approaches
    Ham, Seokju
    Han, Seok-Youn
    Kim, Seokgoo
    Park, Hyung Jun
    Park, Kee-Jun
    Choi, Joo-Ho
    [J]. SENSORS, 2019, 19 (23)
  • [26] A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches
    Guo, Dongyue
    Zhang, Jianwei
    Yang, Bo
    Lin, Yi
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (04)
  • [27] Vocation, motivation and approaches to learning: a comparative study
    Luis Arquero, Jose
    Fernandez-Polvillo, Carmen
    Hassall, Trevor
    Joyce, John
    [J]. EDUCATION AND TRAINING, 2015, 57 (01): : 13 - 30
  • [28] Cybersecurity Deep: Approaches, Attacks Dataset, and Comparative Study
    Barik, Kousik
    Misra, Sanjay
    Konar, Karabi
    Fernandez-Sanz, Luis
    Murat, Koyuncu
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [29] Learning Design Approaches for Designing Learning Environments : A comparative study
    Battou, Amal
    Baz, Omar
    Mammass, Driss
    [J]. PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 206 - 211
  • [30] Colorectal Polyp Detection and Comparative Evaluation Based on Deep Learning Approaches
    Chen, Yao-Tien
    Ahmad, Nisar
    [J]. IEEE ACCESS, 2023, 11 : 135074 - 135089