A MIXED EXCITATION LPC VOCODER MODEL FOR LOW BIT-RATE SPEECH CODING

被引:155
|
作者
MCCREE, AV [1 ]
BARNWELL, TP [1 ]
机构
[1] GEORGIA INST TECHNOL,SCH ELECT ENGN,ATLANTA,GA 30332
来源
关键词
D O I
10.1109/89.397089
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Traditional pitch-excited linear predictive coding (LPC) vocoders use a fully parametric model to efficiently encode the important information in human speech. These vocoders can produce intelligible speech at low data rates (800-2400 b/s), but they often sound synthetic and generate annoying artifacts such as buzzes, thumps, and tonal noises. These problems increase dramatically if acoustic background noise is present at the speech input. This paper presents a new mixed excitation LPC vocoder model that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech. The new model also eliminates the traditional requirement for a binary voicing decision so that the vocoder performs well even in the presence of acoustic background noise. A 2400-b/s LPC vocoder based on this model has been developed and implemented in simulations and in a real-time system. Formal subjective testing of this coder confirms that it produces natural sounding speech even in a difficult noise environment. In fact, diagnostic acceptibility measure (DAM) test scores show that the performance of the 2400-b/s mixed excitation LPC vocoder is close to that of the government standard 4800-b/s CELP coder.
引用
收藏
页码:242 / 250
页数:9
相关论文
共 50 条
  • [31] LOW BIT-RATE CODING OF MOVING IMAGES
    HASKELL, BG
    PEARSON, D
    YAMAMOTO, H
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1987, 5 (07) : 1065 - 1067
  • [32] Quad-band excitation for low bit rate speech coding
    Chiu, KM
    Ching, PC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (04): : 2365 - 2369
  • [33] An unified unit-selection framework for ultra low bit-rate speech coding
    Ramasubramanian, V.
    Harish, D.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 213 - 216
  • [34] An unified unit-selection framework for ultra low bit-rate speech coding
    Ramasubramanian, V.
    Harish, D.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 217 - 220
  • [35] Linear inter-frame dependencies for very low bit-rate speech coding
    López-Soler, JM
    Sánchez, V
    de la Torre, A
    Rubio-Ayuso, AJ
    [J]. SPEECH COMMUNICATION, 2001, 34 (04) : 333 - 349
  • [36] Speech excitation modelling for low bit speech coding
    Sun, XQ
    Cheetham, BMG
    [J]. 1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 9 - 10
  • [37] An optimal unit-selection algorithm for ultra low bit-rate speech coding
    Ramasubramanian, V.
    Harish, D.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 541 - +
  • [38] VARIABLE BIT-RATE CELP CODING OF SPEECH WITH PHONETIC CLASSIFICATION
    PAKSOY, E
    SRINIVASAN, K
    GERSHO, A
    [J]. EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1994, 5 (05): : 591 - 601
  • [39] SPEECH CLASSIFICATION EMBEDDED IN ADAPTIVE CODEBOOK SEARCH FOR LOW BIT-RATE CELP CODING
    KUO, CC
    JEAN, FR
    WANG, HC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 94 - 98
  • [40] Enhanced waveform interpolative coding at low bit-rate
    Gottesman, O
    Gersho, A
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 786 - 798