AUC OPTIMIZATION FOR DEEP LEARNING BASED VOICE ACTIVITY DETECTION

被引:0
|
作者
Fan, Zi-Chen [1 ]
Bai, Zhongxin
Zhang, Xiao-Lei
Rahardja, Susanto
Chen, Jingdong
机构
[1] Northwestern Polytech Univ, Ctr Intelligent Acoust & Immers Commun, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
AUC; deep neural networks; voice activity detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection (VAD) based on deep neural networks (DNN) has demonstrated good performance in adverse acoustic environments. Current DNN based VAD optimizes a surrogate function, e.g. minimum cross-entropy or minimum squared error, at a given decision threshold. However, VAD usually works on-the-fly with a dynamic decision threshold; and ROC curve is a global evaluation metric of VAD that reflects the performance of VAD at all possible decision thresholds. In this paper, we propose to optimize the area under ROC curve (AUC) by DNN, which can maximize the performance of VAD in terms of the ROC curve. Experimental results show that optimizing AUC by DNN results in higher performance than the common method of optimizing the minimum squared error by DNN.
引用
收藏
页码:6760 / 6764
页数:5
相关论文
共 50 条
  • [21] Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection
    Zhang, Xiao-Lei
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (02) : 252 - 264
  • [22] Hybrid optimization and deep learning based intrusion detection system
    Gupta, Subham Kumar
    Tripathi, Meenakshi
    Grover, Jyoti
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [23] Traffic Light Detection Based on Genetic Optimization and Deep Learning
    Xiong H.
    Guo Y.
    Chen C.
    Xu Q.
    Li K.
    Qiche Gongcheng/Automotive Engineering, 2019, 41 (08): : 960 - 966
  • [24] Video Anomaly Detection Using Optimization Based Deep Learning
    Gayal, Baliram Sambhaji
    Patil, Sandip Raosaheb
    UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 249 - 264
  • [25] Supervised Contrastive Learning for Voice Activity Detection
    Heo, Youngjun
    Lee, Sunggu
    ELECTRONICS, 2023, 12 (03)
  • [26] Deep Learning Approach for Voice Pathology Detection and Classification
    Mittal, Vikas
    Sharma, R. K.
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2021, 16 (04)
  • [27] Gender Detection Using Voice Through Deep Learning
    Enriquez, Vanessa Garza
    Singh, Madhusudan
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2021, 2022, 13184 : 548 - 555
  • [28] EEG Based Voice Activity Detection
    Kocturova, M.
    Juhar, J.
    2018 16TH INTERNATIONAL CONFERENCE ON EMERGING ELEARNING TECHNOLOGIES AND APPLICATIONS (ICETA), 2018, : 267 - 272
  • [29] Voice Activity Detection Based on the Bispectrum
    Dou, Hui-jing
    Wu, Zhao-yang
    Feng, Yan
    Qian, Yan-zhou
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 502 - 505
  • [30] DNN-Based Voice Activity Detection with Multi-Task Learning
    Kang, Tae Gyoon
    Kim, Nam Soo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (02): : 550 - 553