Voicing-specific LPC quantization for variable-rate speech coding

被引:13
|
作者
Hagen, R [1 ]
Paksoy, E [1 ]
Gersho, A [1 ]
机构
[1] Chalmers Univ Technol, Dept Informat Theory, S-41296 Gothenburg, Sweden
来源
基金
美国国家科学基金会;
关键词
CELP; LPAS; LPC quantization; spectral quantization; speech coding; variable-rate speech coding vector quantization;
D O I
10.1109/89.784101
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Phonetic classification of speech frames allows distinctive quantization and bit allocation schemes suited to the particular class. Separate quantization of the linear predictive coding (LPC) parameters for voiced and unvoiced speech frames is shown to offer useful gains for representing the synthesis filter commonly used in code-excited linear prediction (CELP) and other coders. Subjective test results are reported that determine the required bit rate and accuracy in the two classes of voiced and unvoiced LPC spectra for CELP coding with phonetic classification, It was found, in this context, that unvoiced spectra need 9 b/frame or more whereas voiced spectra need 25 b/frame or more with the quantization schemes used. New spectral distortion criteria needed to assure transparent LPC spectral quantization for each voicing class in CELP coders are presented. Similar subjective test results for speech synthesized from the true residual signal are also presented, leading to some interesting observations on the role of the analysis-by-synthesis structure of CELP, Objective performance assessments based on the spectral distortion measure are also presented. The theoretical distortion-rate function for the spectral distortion measure is estimated for voiced and unvoiced LPC parameters and compared with experimental results obtained with unstructured vector quantization (VQ). These results show a saving of at least 2 b/frame for unvoiced spectra compared to voiced spectra to achieve the same spectral distortion performance.
引用
收藏
页码:485 / 494
页数:10
相关论文
共 50 条
  • [1] Variable-Rate Finite-State Vector Quantization and Applications to Speech and Image Coding
    Hussain, Yunus
    Farvardin, Nariman
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (01): : 25 - 38
  • [2] LPC SPEECH CODING BASED ON VARIABLE-LENGTH SEGMENT QUANTIZATION
    SHIRAKI, Y
    HONDA, M
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (09): : 1437 - 1444
  • [3] FINITE STATE CELP FOR VARIABLE-RATE SPEECH CODING
    VASEGHI, SV
    [J]. IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1991, 138 (06): : 603 - 610
  • [4] VARIABLE-RATE VECTOR QUANTIZATION FOR SPEECH, IMAGE, AND VIDEO COMPRESSION
    LOOKABAUGH, T
    RISKIN, EA
    CHOU, PA
    GRAY, RM
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1993, 41 (01) : 186 - 199
  • [5] VARIABLE-RATE SPEECH CODING FOR ASYNCHRONOUS TRANSFER MODE
    NAKADA, H
    SATO, KI
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1990, 38 (03) : 277 - 284
  • [6] A variable-rate harmonic speech coder with efficient spectral quantization
    Yu, EWM
    Chan, CF
    [J]. ISCAS '99: PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3: ANALOG AND DIGITAL SIGNAL PROCESSING, 1999, : 114 - +
  • [7] CELP CODING AT VARIABLE-RATE
    CELLARIO, L
    SERENO, D
    [J]. EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1994, 5 (05): : 603 - 613
  • [8] DIGITAL SPEECH INTERPOLATION FOR VARIABLE-RATE CODERS WITH APPLICATION TO SUBBAND CODING
    KOU, KY
    ONEAL, JB
    NILSSON, AA
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1985, 33 (10) : 1100 - 1108
  • [9] A VARIABLE-RATE IMAGE-CODING SCHEME WITH VECTOR QUANTIZATION AND CLUSTERING INTERPOLATION
    HO, YS
    GERSHO, A
    [J]. DALLAS GLOBECOM 89, VOLS 1-3: COMMUNICATIONS TECHNOLOGY FOR THE 1990S AND BEYOND, 1989, : 898 - 902
  • [10] Adaptive variable-rate motion vector quantization
    Hwang, Wen-Jyi
    Ou, Chien-Min
    Wang, Chun-Wei
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2006, 29 (03) : 377 - 382