A Universal VAD Based on Jointly Trained Deep Neural Networks

被引:0
|
作者
Wang, Qing [1 ]
Du, Jun [1 ]
Bao, Xiao [1 ]
Wang, Zi-Rui [1 ]
Dai, Li-Rong [1 ]
Lee, Chin-Hui [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
voice activity detection; deep neural network; feature mapping; joint training; VOICE ACTIVITY DETECTION; SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a joint training approach to voice activity detection (VAD) to address the issue of performance degradation due to unseen noise conditions. Two key techniques are integrated into this deep neural network (DNN) based VAD framework. First, a regression DNN is trained to map the noisy to clean speech features similar to DNN-based speech enhancement. Second, the VAD part to discriminate speech against noise backgrounds is also a DNN trained with a large amount of diversified noisy data synthesized by a wide range of additive noise types. By stacking the classification DNN on top of the enhancement DNN, this integrated DNN can be jointly trained to perform VAD. The feature mapping DNN serves as a noise normalization module aiming at explicitly generating the "clean" features which are easier to be correctly recognized by the following classification DNN. Our experiment results demonstrate the proposed noise-universal DNN-based VAD algorithm achieves a good generalization capacity to unseen noises, and the jointly trained DNNs consistently and significantly outperform the conventional classification-based DNN for all the noise types and signal-to-noise levels tested.
引用
收藏
页码:2282 / 2286
页数:5
相关论文
共 50 条
  • [21] Teaming Up Pre-Trained Deep Neural Networks
    Deabes, Wael
    Abdel-Hakim, Alaa E.
    2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INFORMATION SECURITY (ICSPIS), 2018, : 73 - 76
  • [22] Universal Approximation Property of Hamiltonian Deep Neural Networks
    Zakwan, Muhammad
    d'Angelo, Massimiliano
    Ferrari-Trecate, Giancarlo
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 2689 - 2694
  • [23] Generalizing universal adversarial perturbations for deep neural networks
    Yanghao Zhang
    Wenjie Ruan
    Fu Wang
    Xiaowei Huang
    Machine Learning, 2023, 112 : 1597 - 1626
  • [24] Jaynes machine: The universal microstructure of deep neural networks
    Venkatasubramanian, Venkat
    Sanjeevrajan, N.
    Khandekar, Manasi
    Sivaram, Abhishek
    Szczepanski, Collin
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
  • [25] DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
    Wiedemann, Simon
    Kirchhoffer, Heiner
    Matlage, Stefan
    Haase, Paul
    Marban, Arturo
    Marinc, Talmaj
    Neumann, David
    Nguyen, Tung
    Schwarz, Heiko
    Wiegand, Thomas
    Marpe, Detlev
    Samek, Wojciech
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 700 - 714
  • [26] Generalizing universal adversarial perturbations for deep neural networks
    Zhang, Yanghao
    Ruan, Wenjie
    Wang, Fu
    Huang, Xiaowei
    MACHINE LEARNING, 2023, 112 (05) : 1597 - 1626
  • [27] Cocktail Universal Adversarial Attack on Deep Neural Networks
    Li, Shaoxin
    Li, Xiaofeng
    Che, Xin
    Li, Xintong
    Zhang, Yong
    Chu, Lingyang
    COMPUTER VISION - ECCV 2024, PT LXV, 2025, 15123 : 396 - 412
  • [28] Universal and Succinct Source Coding of Deep Neural Networks
    Basu S.
    Varshney L.R.
    IEEE Journal on Selected Areas in Information Theory, 2022, 3 (04): : 732 - 745
  • [29] A new genetic approach to universal rule generation from trained neural networks
    Fukumi, M
    Mitsukura, Y
    Akamatsu, N
    IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : 1 - 6
  • [30] Following the Leader using a Tracking System based on Pre-trained Deep Neural Networks
    Mutz, Filipe
    Cardoso, Vinicius
    Teixeira, Thomas
    Jesus, Luan F. R.
    Golcalves, Michael A.
    Guidolini, Ranik
    Oliveira, Josias
    Badue, Claudine
    De Souza, Alberto F.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4332 - 4339