Low bit-rate voice compression based on frequency domain interpolative techniques

被引:3
|
作者
Bhaskar, U [1 ]
Swarninathan, K [1 ]
机构
[1] Hughes Network Syst, Germantown, MD 20876 USA
关键词
frequency domain interpolation (FDI); linear prediction; prototype waveform interpolation; voice coding;
D O I
10.1109/TSA.2005.857803
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents an approach, referred to as frequency domain interpolation (FDI), for achieving high-quality speech at low bit-rates (4 kb/s and below) within reasonable complexity and delay. FDI methods, like the prototype waveform interpolation (PWI) methods, derive a prototype waveform (PW) at regular intervals of time. But, unlike PWI, there is no separation into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) component. Instead, the PW is encoded after gain normalization in magnitude-phase form. The magnitude is modeled as a sum of mean and deviation values in multiple frequency bands and this model is quantized using switched backward adaptive VQ techniques. The phase information is represented as a composite vector of PW correlations in multiple frequency bands and an overall voicing measure. This information is quantized using a VQ at the encoder. At the decoder, a phase model is employed that uses the received phase (and magnitude) information to reproduce PWs with the correct periodicity and evolutionary characteristics. Speech is synthesized by interpolating the reconstructed PWs after gain adjustment and filtering it using the short-term predictor and a postfilter. The design of a 4-kb/s and a 2.4-kb/s FDI codec are presented in this paper and their performance is characterized in terms of delay, complexity, and subjective voice quality. The results confirm that FDI techniques have the potential for delivering high-quality speech at low bit-rates in a cost-effective manner.
引用
收藏
页码:558 / 576
页数:19
相关论文
共 50 条
  • [1] Enhanced waveform interpolative coding at low bit-rate
    Gottesman, O
    Gersho, A
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 786 - 798
  • [2] HVS-Based Low Bit-Rate Image Compression
    Wang, Yuer
    Zhu, Zhongjie
    Chen, Weidong
    [J]. SENSORS, MECHATRONICS AND AUTOMATION, 2014, 511-512 : 441 - 446
  • [3] LOW BIT-RATE COMPRESSION OF OMNIDIRECTIONAL IMAGES
    Tosic, Ivana
    Frossard, Pascal
    [J]. PCS: 2009 PICTURE CODING SYMPOSIUM, 2009, : 53 - 56
  • [4] Low bit-rate compression of facial images
    Elad, Michael
    Goldenberg, Roman
    Kimmel, Ron
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (09) : 2379 - 2383
  • [5] LOW BIT-RATE IMAGE COMPRESSION SCHEMES BASED ON VECTOR QUANTIZATION
    Hu, Yu-Chen
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2005, 5 (04) : 745 - 764
  • [6] JPEG-based image compression for low bit-rate coding
    Gandhi, PP
    [J]. STILL-IMAGE COMPRESSION II, 1996, 2669 : 82 - 94
  • [7] Low Bit-rate Subpixel-based Color Image Compression
    Fang, L.
    Cheung, N. -M.
    Au, O. C.
    Li, H.
    Tang, K.
    [J]. 2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 489 - 489
  • [8] Low bit-rate efficient compression for seismic data
    Averbuch, AZ
    Meyer, F
    Strömberg, JO
    Coifman, R
    Vassiliou, A
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (12) : 1801 - 1814
  • [9] SAR image compression at very low bit-rate
    Zhai, JF
    Wang, ZS
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 2167 - 2170
  • [10] A bit-rate control method for low bit-rate video coding based on high bit-rate period emphasis
    Nishio, K
    Tanaka, A
    Yamamoto, N
    Kanada, Y
    Yamamoto, R
    [J]. IEEE SOUTHEASTCON '99, PROCEEDINGS, 1999, : 92 - 97