Low-rate multimode multiband spectral coding of speech

被引:2
|
作者
Das A. [1 ]
Gersho A. [2 ]
机构
[1] Qualcomm, Inc., San Diego, CA 92121, 6455, Lusk Boulevard
[2] Department of Electrical and Computer Engineering, University of California, Santa Barbara
关键词
sinusoidal coding; vector quantization; EMBE coder; satellite communications; variable dimension VQ;
D O I
10.1007/BF02108647
中图分类号
学科分类号
摘要
At bit rates of 4 kbps and below, conventional time-domain algorithms such as CELP fail to retain high voice quality and robust performance against background noise as their waveform-matching ability is curtailed by the severely limited codebook space. Spectral coding, on the other hand, offers an effective parametric model, amenable for low-rate implementation. Instead of performing waveform matching, spectral coders preserve only the perceptually important spectral attributes of the speech signal. Spectral coding algorithms encompass a broad family of emerging low-rate speech coding techniques, the common goal being the representation of the short-term spectrum of input speech with a limited set of spectral parameters and the synthesis of the output speech with a set of sinusoids. Pitch, frequency-domain voicing information, and a varying number of spectral magnitudes are the usual parameters of spectral coders. In this paper, we present the enhanced multiband excitation (EMBE) coder as an illustration of this new generation of low-rate spectral coders. The distinguishing features of EMBE are: (a) signal-adaptive multimode spectral modeling and parameter quantization, (b) two-band signal-adaptive frequency-domain voicing decision, (c) a novel VQ scheme for the efficient encoding of the variable-dimension spectral magnitude vectors at low-rates, and (d) multi-class selective protection of spectral parameters from channel errors. A 4 kbps implementation of the EMBE spectral coding algorithm with 2.9 kbps source coding and 1.1 kbps for channel coding was specifically designed for satellite-based communication systems, targeting good voice quality at low bit rates and robust performance against channel errors. Fundamental concepts of the EMBE spectral coding algorithm, implementation details, and performance comparisons of the 4 kbps EMBE coder with earlier coders are reported.
引用
收藏
页码:317 / 327
页数:10
相关论文
共 50 条
  • [1] Multimode variable bit rate speech coding: An efficient paradigm for high-quality low-rate representation of speech signal
    Das, A
    DeJaco, A
    Manjunath, S
    Ananthapadmanabhan, A
    Huang, J
    Choy, E
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2307 - 2310
  • [2] The consequences of linguistic perception on low-rate speech coding
    Parry, JJ
    Burnett, IS
    Chicharo, JF
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1383 - 1386
  • [3] Signal processing for cochlear implants and low-rate speech coding
    Loizou, PC
    [J]. 2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 68 - 68
  • [4] Speech Inventory Based Discriminative Training for Joint Speech Enhancement and Low-Rate Speech Coding
    Xiao, Xiaoqiang
    Nickel, Robert M.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2398 - +
  • [5] Low-rate CELP speech coding using an improved weighting function
    Kwon, CH
    Un, CK
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 743 - 746
  • [6] A new approach to modeling excitation in very low-rate speech coding
    Ghaemmaghami, S
    Deriche, M
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 597 - 600
  • [7] Anew approach to very low-rate speech coding using temporal decomposition
    Ghaemmaghami, S
    Deriche, M
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 224 - 227
  • [8] SPEAKER IDENTIFICATION IN LOW-RATE CODED SPEECH
    Catellier, Andrew
    Voran, Stephen
    [J]. MEASUREMENT OF SPEECH, AUDIO AND VIDEO QUALITY IN NETWORKS, 2008, : 27 - 36
  • [9] Design of good low-rate coding schemes for ISI channels based on spectral shaping
    Doan, DN
    Narayanan, KR
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2005, 4 (05) : 2309 - 2317
  • [10] LOW-RATE TREE CODING OF AUTOREGRESSIVE SOURCES
    SETHIA, ML
    ANDERSON, JB
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1983, 29 (02) : 279 - 284