Scalable speech coding based on the wavelet transform

被引:0
|
作者
Stegmann, Joachim [1 ]
机构
[1] T-Nova Deutsche Telekom I., Berkom, Am Kavalleriesand 3, D-64295 Darmstadt, Germany
来源
关键词
Algorithms - Bit error rate - Cosine transforms - Fast Fourier transforms - Frequency domain analysis - Signal encoding - Signal filtering and prediction - Wavelet transforms;
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes a scalable speech coder for operation at bit rates between 4 and 32 kbit/s. The bit rate can be changed with a minimum step size of one bit per frame. The algorithm is based on a predictive coding scheme using adaptive transform coding (ATC) of the residual signal based on a 20-ms frame. The shape of the excitation signal is coded open-loop while gain quantisation and long-term prediction are performed in a closed-loop way. Two different transforms, the discrete cosine transform (DCT) and the discrete wavelet transform (DWT) are investigated and compared with each other. It is shown that the DWT is superior to the DCT at low bit rates. The algorithm has been tested at various bit rates and has been compared with standardised speech coders using ITU-T Rec. P.861 (PSQM) as an objective quality measure. It turns out that the speech quality of the proposed coder is roughly equivalent to ITU-T G.729 at 12 kbit/s and ITU-T G.728 at 16 kbit/s. Even at 4 kbit/s the algorithm produces good-quality output speech.
引用
收藏
页码:321 / 330
相关论文
共 50 条
  • [31] Joint speech/audio coding based scalable perceptual audio coding
    Gao, Li
    Hu, Ruimin
    Yang, Yuhong
    [J]. 2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
  • [32] Medical image coding based on wavelet transform and distributed arithmetic coding
    Li Wenna
    Gao Yang
    Yi Yufeng
    Gao Liqun
    [J]. 2011 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, 2011, : 4159 - 4162
  • [33] Application of the wavelet transform to the low-bit-rate speech coding system
    Moriai, S
    Hanazaki, I
    [J]. ELECTRICAL ENGINEERING IN JAPAN, 2004, 148 (03) : 62 - 71
  • [34] BCH Coding Watermarking Based on Discrete Wavelet Transform
    Lu, Jinyu
    Qu, Tao
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ELECTRONIC, MECHANICAL, INFORMATION AND MANAGEMENT SOCIETY (EMIM), 2016, 40 : 460 - 465
  • [35] Combined coding of audio and speech signals using LPC and the discrete wavelet transform
    Mason, M
    Boland, S
    Sridharan, S
    Deriche, M
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 747 - 750
  • [36] ECG compression based on wavelet transform and Golomb coding
    Chen, JL
    Ma, J
    Zhang, Y
    Shi, X
    [J]. ELECTRONICS LETTERS, 2006, 42 (06) : 322 - 324
  • [37] Audio coding based on improved wavelet packet transform
    Zhang, Liang-Zhi
    Zheng, Ying-Wen
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2002, 36 (SUPPL.): : 55 - 58
  • [38] An ROI image coding based on switching wavelet transform
    Fukuma, S
    Ikuta, S
    Ito, M
    Nishimura, S
    Nawate, M
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 420 - 423
  • [39] Wavelet transform based fast fractal video coding
    Zhao, J
    Yu, SL
    [J]. INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING, 1998, 3545 : 420 - 423
  • [40] IMAGE CODING OF CONSTRUCTION AND DECOMPOSITION BASED ON WAVELET TRANSFORM
    Ye, Ruyi
    [J]. INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 153 - 156