A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement

被引:6
|
作者
Zhang, Yan [1 ,2 ]
Tang, Zhen-min [1 ]
Li, Yan-ping [3 ]
Luo, Yang [2 ]
机构
[1] Nanjing Univ Sci & Technol NUST, Coll Comp Sci & Technol, Nanjing 210094, Jiangsu, Peoples R China
[2] Jinling Inst Technol, Coll Informat Technol, Nanjing 211169, Jiangsu, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210046, Jiangsu, Peoples R China
来源
关键词
D O I
10.1155/2014/723643
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech enhancement block. For the feature selection and voting block, several discriminating features were employed in a voting paradigm for the consideration of reliability and discriminative power. Effectiveness of the proposed approach is compared and evaluated to other VAD techniques by using two well-known databases, namely, TIMIT database and NOISEX-92 database. Experimental results show that the proposed method performs well under a variety of noisy conditions.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A unified approach to speech enhancement and voice activity detection
    Kasap, Ceyhan
    Arslan, Mustafa Levent
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (02) : 527 - 547
  • [2] Voice Activity Detection for Speech Enhancement Applications
    Verteletskaya, E.
    Sakhnov, K.
    [J]. ACTA POLYTECHNICA, 2010, 50 (04) : 100 - 105
  • [3] Gaussian Process Regression for Voice Activity Detection and Speech Enhancement
    Park, Sunho
    Choi, Seungjin
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2879 - 2882
  • [4] Enhancement of speech dynamics for voice activity detection using DNN
    Dwijayanti, Suci
    Yamamori, Kei
    Miyoshi, Masato
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [5] Enhancement of speech dynamics for voice activity detection using DNN
    Suci Dwijayanti
    Kei Yamamori
    Masato Miyoshi
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [6] A SPEECH ENHANCEMENT SYSTEM FOR AUTOMOTIVE SPEECH RECOGNITION WITH A HYBRID VOICE ACTIVITY DETECTION METHOD
    Wang, Haikun
    Ye, Zhongfu
    Chen, Jingdong
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 456 - 460
  • [7] An improved voice activity detection algorithm employing speech enhancement preprocessing
    Lee, YC
    Ahn, SS
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2001, E84A (06): : 1401 - 1405
  • [8] An improved voice activity detection algorithm employing speech enhancement preprocessing
    Lee, Y.-C.
    Ahn, S.-S.
    [J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2001, E84-A (06) : 1401 - 1405
  • [9] Speech recognition enhancement with statistical model-based voice activity detection
    Jarc, Bojan
    Babič, Rudolf
    [J]. Elektrotehniski Vestnik/Electrotechnical Review, 2002, 69 (01): : 75 - 81
  • [10] Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement
    Xu, Tianjiao
    Zhang, Hui
    Zhang, Xueliang
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1157 - 1162