Blind dereverberation of single channel speech signal based on harmonic structure

被引：0

作者：

Nakatani, T ^{[1
]}

Miyoshi, M ^{[1
]}

机构：

[1] NTT Corp, NTT Commun Sci Labs, Speech Open Lab, Kyoto 6190237, Japan

来源：

2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I | 2003年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a new method for dereverberation of speech signals with a single microphone. For applications such as speech recognition, reverberant speech causes serious problems when a distant microphone is used in recording. This is especially severe when the reverberation time exceeds 0.5 of a second. We propose a method which uses the fundamental frequency (F-0) of target speech as the primary feature for dereverberation. This method initially estimates F-0 and harmonic structure of the speech signal and then obtains a dereverberation operator. This operator transforms the reverberant signal to its direct signal based on an inverse filtering operation. Dereverberation is achieved with prior knowledge of neither room acoustics nor the target speech. Experimental results showed that the dereverberation operator estimated from 5240 Japanese word utterances could effectively reduce the reverberation when the reverberation time is longer than 01 of a second.

引用

页码：92 / 95

页数：4

共 50 条

[21] On a blind speech dereverberation algorithm using multi-channel linear prediction
Delcroix, Marc
Hikichi, Takafumi
Miyoshvi, Masato
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (10) : 2837 - 2846
[22] Single-channel Speech Dereverberation via Generative Adversarial Training
Li, Chenxing
Wang, Tieqiang
Xu, Shuang
Xu, Bo
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1309 - 1313
[23] SUBJECTIVE SPEECH QUALITY AND SPEECH INTELLIGIBILITY EVALUATION OF SINGLE-CHANNEL DEREVERBERATION ALGORITHMS
Warzybok, Anna
Kodrasi, Ina
Jungmann, Jan Ole
Habets, Emanuel
Gerkmann, Timo
Mertins, Alfred
Doclo, Simon
Kollmeier, Birger
Goetze, Stefan
[J]. 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 332 - 336
[24] SINGLE CHANNEL JOINT SPEECH DEREVERBERATION AND DENOISING USING DEEP PRIORS
Raikar, Aditya
Basu, Sourya
Hegde, Rajesh M.
[J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 216 - 220
[25] Single channel dereverberation using example-based speech enhancement with uncertainty decoding technique
Kinoshita, Keisuke
Souden, Mehrez
Delcroix, Marc
Nakatani, Tomohiro
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 204 - 207
[26] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
Mousavi, Leila
Razzazi, Farbod
Haghbin, Afrooz
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 729 - 738
[27] Blind multichannel identification for speech dereverberation and enhancement
Yu, ZL
Er, MH
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 105 - 108
[28] Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation
Nakatani, Tomohiro
Yoshioka, Takuya
Kinoshita, Keisuke
Miyoshi, Masato
Juang, Biing-Hwang
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 85 - 88
[29] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
Leila Mousavi
Farbod Razzazi
Afrooz Haghbin
[J]. International Journal of Speech Technology, 2019, 22 : 729 - 738
[30] Robust Speech Dereverberation Based on Blind Adaptive Estimation of Acoustic Channels
Haque, Mohammad Ariful
Islam, Toufiqul
Hasan, Md Kamrul
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 775 - 787

← 1 2 3 4 5 →