A unified approach to speech enhancement and voice activity detection

被引:6
|
作者
Kasap, Ceyhan [1 ]
Arslan, Mustafa Levent [1 ]
机构
[1] Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey
关键词
Speech enhancement; voice activity detection; noise suppression; modified Wiener filtering; NOISE; SUPPRESSION;
D O I
10.3906/elk-1107-30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a unified system for voice activity detection (VAD) and speech enhancement is proposed. In the proposed system, there is mutual exchange of information between VAD and speech enhancement blocks. A new and robust VAD algorithm is implemented for the VAD block of the unified system. The newly proposed VAD algorithm uses a periodicity measure and an energy measure obtained from spectral energy distribution and spectral energy difference of the input speech data. For the speech enhancement block, the modified Wiener filtering (MWF) approach is utilized. It has been shown that the utilization of information exchange between the VAD and MWF algorithms in the unified system increases the performance of both algorithms and the proposed unified system improves the robustness of a speech recognition system significantly. Both of the enhanced algorithms are noniterative. Therefore, the proposed unified system is computationally attractive for real-time applications.
引用
收藏
页码:527 / 547
页数:21
相关论文
共 50 条
  • [1] A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
    Zhang, Yan
    Tang, Zhen-min
    Li, Yan-ping
    Luo, Yang
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [2] Voice Activity Detection for Speech Enhancement Applications
    Verteletskaya, E.
    Sakhnov, K.
    [J]. ACTA POLYTECHNICA, 2010, 50 (04) : 100 - 105
  • [3] Gaussian Process Regression for Voice Activity Detection and Speech Enhancement
    Park, Sunho
    Choi, Seungjin
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2879 - 2882
  • [4] Enhancement of speech dynamics for voice activity detection using DNN
    Dwijayanti, Suci
    Yamamori, Kei
    Miyoshi, Masato
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [5] Enhancement of speech dynamics for voice activity detection using DNN
    Suci Dwijayanti
    Kei Yamamori
    Masato Miyoshi
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [6] A SPEECH ENHANCEMENT SYSTEM FOR AUTOMOTIVE SPEECH RECOGNITION WITH A HYBRID VOICE ACTIVITY DETECTION METHOD
    Wang, Haikun
    Ye, Zhongfu
    Chen, Jingdong
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 456 - 460
  • [7] An improved voice activity detection algorithm employing speech enhancement preprocessing
    Lee, YC
    Ahn, SS
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2001, E84A (06): : 1401 - 1405
  • [8] An improved voice activity detection algorithm employing speech enhancement preprocessing
    Lee, Y.-C.
    Ahn, S.-S.
    [J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2001, E84-A (06) : 1401 - 1405
  • [9] Speech recognition enhancement with statistical model-based voice activity detection
    Jarc, Bojan
    Babič, Rudolf
    [J]. Elektrotehniski Vestnik/Electrotechnical Review, 2002, 69 (01): : 75 - 81
  • [10] Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement
    Xu, Tianjiao
    Zhang, Hui
    Zhang, Xueliang
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1157 - 1162