Signal modification for robust speech coding

被引：7

作者：

Kim, NS ^{[1
]}

Chang, JH

机构：

[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151742, South Korea

[2] Seoul Natl Univ, INMC, Seoul 151742, South Korea

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2004年 / 12卷 / 01期

关键词：

low-bit-rate speech coding; signal modification;

D O I：

10.1109/TSA.2003.819946

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Usually, the performance of a low-bit-rate speech coder degrades seriously in the presence of various interfering signals such as the background noise, acoustic echo, co-talkers' speech and other unwanted signals. This comes from the mismatch between the input signal and the assumed speech production model on which the design of the given speech coder is based. In this paper, we present an approach to modify the input signal such that it can be coded more effectively within the generalized analysis-by-synthesis framework. Signal modification in the presented approach is performed according to a criterion which makes a compromise between the modification and coder quantization errors. The coder-decoder (CODEC) characteristic is described in terms of a transfer matrix, and an on-line method using the recursive least square (RLS) technique is proposed to estimate it. Since each part of the speech signal is differently affected by the modification, we also devise an adaptive method based on the signal-to-quantization noise ratio (SQNR). In contrast to the conventional modification techniques, our approach can be implemented as a simple front-end for any analysis-by-synthesis type coders.

引用

页码：9 / 18

页数：10

共 50 条

[31] Speech signal modification to increase intelligibility in noisy environments
Yoo, Sungyub D.
Boston, J. Robert
El-Jaroudi, Amro
Li, Ching-Chung
Durrant, John D.
Kovacyk, Kristie
Shaiman, Susan
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (02): : 1138 - 1149
[32] Robust and complex approach of pathological speech signal analysis
Mekyska, Jiri
Janousova, Eva
Gomez-Vilda, Pedro
Smekal, Zdenek
Rektorova, Irena
Eliasova, Ilona
Kostalova, Milena
Mrackova, Martina
Alonso-Hernandez, Jesus B.
Faundez-Zanuy, Marcos
Lopez-de-Ipina, Karmele
NEUROCOMPUTING, 2015, 167 : 94 - 111
[33] SIGNAL BOOSTING FOR ROBUST DATA FUSION IN SPEECH RETRIEVAL
Wu, Dan
He, Daqing
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (3B): : 1525 - 1536
[34] Robust estimate for linear prediction parameters of speech signal
Jiang, Taihui
Yao, Tianren
1996, (24):
[35] ROBUST LOWRATE SPEECH CODING BASED ON CLONED NETWORKS AND WAVENET
Lim, Felicia S. C.
Kleijn, W. Bastiaan
Chinen, Michael
Skoglund, Jan
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6769 - 6773
[36] Robust audio and speech coding for mobile and IP network applications
Ma, Hongfei
Hao, Xiaofeng
Li, Qian
Song, Shaopeng
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 262 - 266
[37] On the Use of Discrete Wavelet Transform for Robust Scalable Speech Coding
Ogunfunmi, Tokunbo
Seto, Koji
2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 766 - 769
[38] Assessment of signal subspace based speech enhancement for noise robust speech recognition
Hermus, K
Wambacq, P
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 945 - 948
[39] Speech coding for energy-efficient digital signal processing
Wassner, J
Kaeslin, H
Felber, N
Fichtner, W
PROCEEDINGS OF THE 43RD IEEE MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 2000, : 580 - 583
[40] Vector predictive coding algorithm for unstable speech signal sequences
Qian Zhengxiang
Yang Luyi
Chen Wei
Ding Penghui
SIGNAL ANALYSIS, MEASUREMENT THEORY, PHOTO-ELECTRONIC TECHNOLOGY, AND ARTIFICIAL INTELLIGENCE, PTS 1 AND 2, 2006, 6357

← 1 2 3 4 5 →