Low bit-rate speech coding by perceptually optimized noise excitation modulation

被引:0
|
作者
Tsoukalas, D [1 ]
Mourjopoulos, J [1 ]
Kokkinakis, G [1 ]
机构
[1] UNIV PATRAS, WIRE COMMUN LAB, PATRAS 26500, GREECE
关键词
speech; speech coding; parametric representation;
D O I
10.1016/S0165-1684(96)00151-X
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel low bit-rate high-quality speech coding technique is presented based on a perceptually optimized signal reconstruction method. According to this parametric speech model, the signal's spectral envelope is reconstructed from non-linear spectral filtering of an excitation signal, which is a combination of a random broadband noise signal with a number of discrete spectral pulses extracted from the original speech using a perceptual model. This general coding platform allows variable bit-rate implementations, starting from 1.9 kbit/s, at which sufficient intelligibility (more than 92%) was measured, while at higher bit-rates (2.8 kbit/s) intelligibility scores were better than 94% with sufficient naturalness in the coded speech. In all cases, the complexity of the proposed system is very low. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:77 / 89
页数:13
相关论文
共 50 条
  • [1] ADAPTIVE DENSITY PULSE EXCITATION FOR LOW BIT-RATE SPEECH CODING
    AKAMINE, M
    MISEKI, K
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1995, E78A (02) : 199 - 207
  • [2] A MIXED EXCITATION LPC VOCODER MODEL FOR LOW BIT-RATE SPEECH CODING
    MCCREE, AV
    BARNWELL, TP
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04): : 242 - 250
  • [3] Phase modelling of speech excitation for low bit-rate sinusoidal transform coding
    Sun, XQ
    Plante, F
    Cheetham, BMG
    Wong, KWT
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1691 - 1694
  • [4] On the Study of Noise Allocation for Speech Signal in Low Bit-Rate Audio Coding
    Lee, Chang-Heon
    Oh, Hyen-O
    Kang, Hong-Goo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (10) : 849 - 852
  • [5] SIGNAL MODELS FOR LOW BIT-RATE CODING OF SPEECH
    FLANAGAN, JL
    ISHIZAKA, K
    SHIPLEY, KL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (03): : 780 - 791
  • [6] Pitch quantization in low bit-rate speech coding
    Eriksson, T
    Kang, HG
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 489 - 492
  • [7] Techniques of very low bit-rate speech coding
    Cui, HJ
    Tang, K
    Zhao, M
    Zhang, X
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2004, 13 (01) : 63 - 65
  • [8] All-pass excitation phase modelling for low bit-rate speech coding
    Cheetham, BMG
    Choi, HB
    Sun, HQ
    Goodyear, CC
    Plante, F
    Wong, WTK
    [J]. ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2633 - 2636
  • [9] SPEECH RECONSTRUCTION FOR MFCC-BASED LOW BIT-RATE SPEECH CODING
    Jiang Wenbin
    Ying Rendong
    Liu Peilin
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [10] Bandwidth extension of narrowband speech for low bit-rate wideband coding
    Valin, JM
    Lefebvre, R
    [J]. 2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 130 - 132