A MIXED EXCITATION LPC VOCODER MODEL FOR LOW BIT-RATE SPEECH CODING

被引:155
|
作者
MCCREE, AV [1 ]
BARNWELL, TP [1 ]
机构
[1] GEORGIA INST TECHNOL,SCH ELECT ENGN,ATLANTA,GA 30332
来源
关键词
D O I
10.1109/89.397089
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Traditional pitch-excited linear predictive coding (LPC) vocoders use a fully parametric model to efficiently encode the important information in human speech. These vocoders can produce intelligible speech at low data rates (800-2400 b/s), but they often sound synthetic and generate annoying artifacts such as buzzes, thumps, and tonal noises. These problems increase dramatically if acoustic background noise is present at the speech input. This paper presents a new mixed excitation LPC vocoder model that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech. The new model also eliminates the traditional requirement for a binary voicing decision so that the vocoder performs well even in the presence of acoustic background noise. A 2400-b/s LPC vocoder based on this model has been developed and implemented in simulations and in a real-time system. Formal subjective testing of this coder confirms that it produces natural sounding speech even in a difficult noise environment. In fact, diagnostic acceptibility measure (DAM) test scores show that the performance of the 2400-b/s mixed excitation LPC vocoder is close to that of the government standard 4800-b/s CELP coder.
引用
收藏
页码:242 / 250
页数:9
相关论文
共 50 条
  • [1] A mixed excitation LPC vocoder operating at very low bit rate
    Mao, JS
    Chan, SC
    Ho, KL
    [J]. 1997 IEEE 6TH INTERNATIONAL CONFERENCE ON UNIVERSAL PERSONAL COMMUNICATIONS RECORD, CONFERENCE RECORD, VOLS 1 AND 2, 1997, : 406 - 409
  • [2] ADAPTIVE DENSITY PULSE EXCITATION FOR LOW BIT-RATE SPEECH CODING
    AKAMINE, M
    MISEKI, K
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1995, E78A (02) : 199 - 207
  • [3] Low bit-rate speech coding by perceptually optimized noise excitation modulation
    Tsoukalas, D
    Mourjopoulos, J
    Kokkinakis, G
    [J]. SIGNAL PROCESSING, 1997, 56 (01) : 77 - 89
  • [4] Phase modelling of speech excitation for low bit-rate sinusoidal transform coding
    Sun, XQ
    Plante, F
    Cheetham, BMG
    Wong, KWT
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1691 - 1694
  • [5] IMPROVEMENT OF LOW BIT RATE LPC VOCODER
    王俊生
    [J]. Chinese Journal of Acoustics, 1986, (01) : 42 - 53
  • [6] CELP BASED MIXED-SOURCE MODEL FOR VERY LOW BIT-RATE SPEECH CODING
    KWON, CH
    UN, CK
    [J]. ELECTRONICS LETTERS, 1993, 29 (02) : 156 - 157
  • [7] Low bit-rate speech coding based on an improved sinusoidal model
    Ahmadi, S
    Spanias, AS
    [J]. SPEECH COMMUNICATION, 2001, 34 (04) : 369 - 390
  • [8] Pitch quantization in low bit-rate speech coding
    Eriksson, T
    Kang, HG
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 489 - 492
  • [9] SIGNAL MODELS FOR LOW BIT-RATE CODING OF SPEECH
    FLANAGAN, JL
    ISHIZAKA, K
    SHIPLEY, KL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (03): : 780 - 791
  • [10] Techniques of very low bit-rate speech coding
    Cui, HJ
    Tang, K
    Zhao, M
    Zhang, X
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2004, 13 (01) : 63 - 65