Blind dereverberation of single channel speech signal based on harmonic structure

被引:0
|
作者
Nakatani, T [1 ]
Miyoshi, M [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Speech Open Lab, Kyoto 6190237, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new method for dereverberation of speech signals with a single microphone. For applications such as speech recognition, reverberant speech causes serious problems when a distant microphone is used in recording. This is especially severe when the reverberation time exceeds 0.5 of a second. We propose a method which uses the fundamental frequency (F-0) of target speech as the primary feature for dereverberation. This method initially estimates F-0 and harmonic structure of the speech signal and then obtains a dereverberation operator. This operator transforms the reverberant signal to its direct signal based on an inverse filtering operation. Dereverberation is achieved with prior knowledge of neither room acoustics nor the target speech. Experimental results showed that the dereverberation operator estimated from 5240 Japanese word utterances could effectively reduce the reverberation when the reverberation time is longer than 01 of a second.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [21] On a blind speech dereverberation algorithm using multi-channel linear prediction
    Delcroix, Marc
    Hikichi, Takafumi
    Miyoshvi, Masato
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (10) : 2837 - 2846
  • [22] Single-channel Speech Dereverberation via Generative Adversarial Training
    Li, Chenxing
    Wang, Tieqiang
    Xu, Shuang
    Xu, Bo
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1309 - 1313
  • [23] SUBJECTIVE SPEECH QUALITY AND SPEECH INTELLIGIBILITY EVALUATION OF SINGLE-CHANNEL DEREVERBERATION ALGORITHMS
    Warzybok, Anna
    Kodrasi, Ina
    Jungmann, Jan Ole
    Habets, Emanuel
    Gerkmann, Timo
    Mertins, Alfred
    Doclo, Simon
    Kollmeier, Birger
    Goetze, Stefan
    [J]. 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 332 - 336
  • [24] SINGLE CHANNEL JOINT SPEECH DEREVERBERATION AND DENOISING USING DEEP PRIORS
    Raikar, Aditya
    Basu, Sourya
    Hegde, Rajesh M.
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 216 - 220
  • [25] Single channel dereverberation using example-based speech enhancement with uncertainty decoding technique
    Kinoshita, Keisuke
    Souden, Mehrez
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 204 - 207
  • [26] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
    Mousavi, Leila
    Razzazi, Farbod
    Haghbin, Afrooz
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 729 - 738
  • [27] Blind multichannel identification for speech dereverberation and enhancement
    Yu, ZL
    Er, MH
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 105 - 108
  • [28] Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation
    Nakatani, Tomohiro
    Yoshioka, Takuya
    Kinoshita, Keisuke
    Miyoshi, Masato
    Juang, Biing-Hwang
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 85 - 88
  • [29] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
    Leila Mousavi
    Farbod Razzazi
    Afrooz Haghbin
    [J]. International Journal of Speech Technology, 2019, 22 : 729 - 738
  • [30] Robust Speech Dereverberation Based on Blind Adaptive Estimation of Acoustic Channels
    Haque, Mohammad Ariful
    Islam, Toufiqul
    Hasan, Md Kamrul
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 775 - 787