Blind dereverberation of single channel speech signal based on harmonic structure

被引:0
|
作者
Nakatani, T [1 ]
Miyoshi, M [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Speech Open Lab, Kyoto 6190237, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new method for dereverberation of speech signals with a single microphone. For applications such as speech recognition, reverberant speech causes serious problems when a distant microphone is used in recording. This is especially severe when the reverberation time exceeds 0.5 of a second. We propose a method which uses the fundamental frequency (F-0) of target speech as the primary feature for dereverberation. This method initially estimates F-0 and harmonic structure of the speech signal and then obtains a dereverberation operator. This operator transforms the reverberant signal to its direct signal based on an inverse filtering operation. Dereverberation is achieved with prior knowledge of neither room acoustics nor the target speech. Experimental results showed that the dereverberation operator estimated from 5240 Japanese word utterances could effectively reduce the reverberation when the reverberation time is longer than 01 of a second.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [31] Delay and predict equalization for blind speech dereverberation
    Triki, Mahdi
    Slock, Dirk T. M.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4955 - 4958
  • [32] A delay-based constrained beamformer for blind speech enhancement and dereverberation
    Yermeche, Zohra
    Grbic, Nedelko
    [J]. PROCEEDINGS ELMAR 2007, 2007, : 159 - 162
  • [33] JOINT BLIND DEREVERBERATION AND SEPARATION OF SPEECH MIXTURES
    Jan, Tariqullah
    Wang, Wenwu
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2343 - 2347
  • [34] Blind Signal Dereverberation Based on Mixture of Weighted Prediction Error Models
    Ikeshita, Rintaro
    Kamo, Naoyuki
    Nakatani, Tomohiro
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 399 - 403
  • [35] SPEECH DEREVERBERATION WITH MULTI-CHANNEL LINEAR PREDICTION AND SPARSE PRIORS FOR THE DESIRED SIGNAL
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 23 - 26
  • [36] OPTIMIZED JOINT NOISE SUPPRESSION AND DEREVERBERATION BASED ON BLIND SIGNAL EXTRACTION FOR HANDS-FREE SPEECH RECOGNITION SYSTEM
    Aprilyanti, Fine D.
    Saruwatari, Hiroshi
    Nakamura, Satoshi
    Takatani, Tomoya
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 182 - 186
  • [37] ARTICULATORY BASED SPEECH MODELS FOR BLIND SPEECH DEREVERBERATION USING SEQUENTIAL MONTE CARLO METHODS
    Evers, Christine
    Hopgood, James R.
    [J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 2131 - 2135
  • [38] Robust blind dereverberation of speech signals based on characteristics of short-time speech segments
    Nakatani, Tomohiro
    Hikichi, Takafunii
    Kinoshita, Keisuke
    Yoshioka, Takuya
    Delcroix, Marc
    Miyoshi, Masato
    Juang, Bing-Hwang
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 2986 - 2989
  • [39] Single channel speech blind separation based on genetic algorithm optimization
    Wang, Fei
    Guo, Ningning
    Jia, Zixi
    Wu, Wei
    Zhao, Haobo
    Zhang, Yuze
    [J]. 2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1437 - 1441
  • [40] PARTITIONED BLOCK FREQUENCY DOMAIN KALMAN FILTER FOR MULTI-CHANNEL LINEAR PREDICTION BASED BLIND SPEECH DEREVERBERATION
    Dietzen, T.
    Spriet, A.
    Tirry, W.
    Doclo, S.
    Moonen, M.
    van Waterschoot, T.
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,