Voice activity detection with array signal processing in the wavelet domain

被引:0
|
作者
Hioka, Y [1 ]
Hamada, N [1 ]
机构
[1] Keio Univ, Fac Sci & Technol, Dept Syst Design Engn, Yokohama, Kanagawa 2238522, Japan
关键词
voice activity detection; microphone array; wavelet packet analysis; eigenspace analysis; speech features;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In speech enhancement with adaptive microphone array, the voice activity detection (VAD) is indispensable for the adaptation control. Even though many VAD methods have been proposed as a pre-processor for speech recognition and compression, they can hardly discriminate nonstationary interferences which frequently exist in real environment. In this research., we propose a novel VAD method with array signal processing in the wavelet domain. In that domain we can integrate the temporal, spectral and spatial information to achieve robust voice activity discriminability for a nonstationary interference arriving from close direction of speech. The signals acquired by microphone array are at first decomposed into appropriate subbands using wavelet packet to extract its temporal and spectral features. Then directionality check and direction estimation on each subbands are executed to do VAD with respect to the spatial information. Computer simulation results for sound data demonstrate that the proposed method keeps its discriminability even for the interference arriving from close direction of speech.
引用
收藏
页码:2802 / 2811
页数:10
相关论文
共 50 条
  • [1] Voice activity detection using microphone array
    Cho, Jaeyoun
    Krishnamurthy, Ashok
    Proceedings of the AES International Conference, 2007,
  • [2] Source detection and localization in array signal processing
    Bouri, Mohamed
    2006 First International Symposium on Environment Identities and Mediterranean Area, Vols 1 and 2, 2006, : 101 - 106
  • [3] A robust voice activity detection based on wavelet transform
    Aghajani, Kh.
    Manzuri, M. T.
    Karami, M.
    Tayebi, H.
    2008 SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2008, : 37 - +
  • [4] Voice activity detection based on using wavelet packet
    Eshaghi, Mohadese
    Mollaei, M. R. Karami
    DIGITAL SIGNAL PROCESSING, 2010, 20 (04) : 1102 - 1115
  • [5] Wavelet Hardware Processing Unit for Transient Signal Detection
    Macchi Konrad, Juan Marcos
    De Pasquale, Lorenzo
    Angel Banchieri, Miguel
    Reggiani, Guillermo
    Cayssials, Ricardo
    Ferro, Edgardo
    2014 IX SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC (SPL 2014), 2014,
  • [6] Signal Processing Domain Application Mapping On The Brick Reconfigurable Array
    Eusse Giraldo, Juan Fernando
    Jacobi, Ricardo Pezzuol
    2009 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS, 2009, : 356 - +
  • [7] Subband-domain signal processing for radar array systems
    Rabinkin, DV
    Pulsone, NB
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES,AND IMPLEMENTATIONS IX, 1999, 3807 : 174 - 187
  • [8] Robust voice-activity detection based on the wavelet transform
    Stegmann, J
    Schroder, G
    1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 99 - 100
  • [9] Using the bootstrap for robust detection in array signal processing
    Pelin, Per
    Viberg, Mats
    Conference Record of the Asilomar Conference on Signals, Systems and Computers, 1999, 1 : 20 - 24
  • [10] A new algorithm for voice activity detection based on wavelet transform
    Jiang, SJ
    Guo, HT
    Yin, FL
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 222 - 225