Signal modification for robust speech coding

被引:7
|
作者
Kim, NS [1 ]
Chang, JH
机构
[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151742, South Korea
[2] Seoul Natl Univ, INMC, Seoul 151742, South Korea
来源
关键词
low-bit-rate speech coding; signal modification;
D O I
10.1109/TSA.2003.819946
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Usually, the performance of a low-bit-rate speech coder degrades seriously in the presence of various interfering signals such as the background noise, acoustic echo, co-talkers' speech and other unwanted signals. This comes from the mismatch between the input signal and the assumed speech production model on which the design of the given speech coder is based. In this paper, we present an approach to modify the input signal such that it can be coded more effectively within the generalized analysis-by-synthesis framework. Signal modification in the presented approach is performed according to a criterion which makes a compromise between the modification and coder quantization errors. The coder-decoder (CODEC) characteristic is described in terms of a transfer matrix, and an on-line method using the recursive least square (RLS) technique is proposed to estimate it. Since each part of the speech signal is differently affected by the modification, we also devise an adaptive method based on the signal-to-quantization noise ratio (SQNR). In contrast to the conventional modification techniques, our approach can be implemented as a simple front-end for any analysis-by-synthesis type coders.
引用
收藏
页码:9 / 18
页数:10
相关论文
共 50 条
  • [31] Speech signal modification to increase intelligibility in noisy environments
    Yoo, Sungyub D.
    Boston, J. Robert
    El-Jaroudi, Amro
    Li, Ching-Chung
    Durrant, John D.
    Kovacyk, Kristie
    Shaiman, Susan
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (02): : 1138 - 1149
  • [32] Robust and complex approach of pathological speech signal analysis
    Mekyska, Jiri
    Janousova, Eva
    Gomez-Vilda, Pedro
    Smekal, Zdenek
    Rektorova, Irena
    Eliasova, Ilona
    Kostalova, Milena
    Mrackova, Martina
    Alonso-Hernandez, Jesus B.
    Faundez-Zanuy, Marcos
    Lopez-de-Ipina, Karmele
    NEUROCOMPUTING, 2015, 167 : 94 - 111
  • [33] SIGNAL BOOSTING FOR ROBUST DATA FUSION IN SPEECH RETRIEVAL
    Wu, Dan
    He, Daqing
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (3B): : 1525 - 1536
  • [34] Robust estimate for linear prediction parameters of speech signal
    Jiang, Taihui
    Yao, Tianren
    1996, (24):
  • [35] ROBUST LOWRATE SPEECH CODING BASED ON CLONED NETWORKS AND WAVENET
    Lim, Felicia S. C.
    Kleijn, W. Bastiaan
    Chinen, Michael
    Skoglund, Jan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6769 - 6773
  • [36] Robust audio and speech coding for mobile and IP network applications
    Ma, Hongfei
    Hao, Xiaofeng
    Li, Qian
    Song, Shaopeng
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 262 - 266
  • [37] On the Use of Discrete Wavelet Transform for Robust Scalable Speech Coding
    Ogunfunmi, Tokunbo
    Seto, Koji
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 766 - 769
  • [38] Assessment of signal subspace based speech enhancement for noise robust speech recognition
    Hermus, K
    Wambacq, P
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 945 - 948
  • [39] Speech coding for energy-efficient digital signal processing
    Wassner, J
    Kaeslin, H
    Felber, N
    Fichtner, W
    PROCEEDINGS OF THE 43RD IEEE MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 2000, : 580 - 583
  • [40] Vector predictive coding algorithm for unstable speech signal sequences
    Qian Zhengxiang
    Yang Luyi
    Chen Wei
    Ding Penghui
    SIGNAL ANALYSIS, MEASUREMENT THEORY, PHOTO-ELECTRONIC TECHNOLOGY, AND ARTIFICIAL INTELLIGENCE, PTS 1 AND 2, 2006, 6357