FINITE STATE CELP FOR VARIABLE-RATE SPEECH CODING

被引:4
|
作者
VASEGHI, SV
机构
[1] Univ of East Anglin, Norwich
来源
关键词
SPEECH SYNTHESIS; CODING; PREDICTIVE TECHNIQUES;
D O I
10.1049/ip-i-2.1991.0078
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of a variable rate code excited linear predictor system is investigated. The coding system is based on a finite state CELP (FSCELP) frame work. Each individual state is primarily identified with a LPC model order, LPC coefficients bit allocation, excitation code book population density and state encoding rate. Successive input speech vectors are encoded at a rate that depends on the current state of the FSCELP system and the input vector characteristics. The use of a finite state system involves implicit clustering of speech signals. The lower rate states are selected during highly correlated steady state speech segments when relatively few bits are required to obtain adequate fidelity. For speech signals with a strong glottal excitation, unvoiced signals and transient speech segments, a relatively greater quantisation accuracy is needed to obtain good fidelity and therefore higher rate states of the system are used. Further improvement is obtained by using gamma populated excitation codebooks, for those states that are mainly used to encode speech signals with a strong underlying glottal excitation pulses. Experiments focus on investigation of the varying encoding requirements of the excitation signal for low pass, voiced, unvoiced and transient speech signals. The parameters of the finite state CELP system are designed to match the encoding requirements of typical speech signals. The greater part of the coding gain is obtained from variable rate encoding of the excitation signal. Using a six-state FSCELP, good quality speech is obtained at an average, maximum and minimum bit rates of 4 kbit/s, 10 kbit/s and 2 kbit/s, respectively.
引用
收藏
页码:603 / 610
页数:8
相关论文
共 50 条
  • [41] REAL-TIME SPEECH SEGMENTATION USING PITCH AND CONVEXITY JUMP MODELS - APPLICATION TO VARIABLE-RATE SPEECH CODING
    DIFRANCESCO, RJ
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (05): : 741 - 748
  • [42] Variable-rate state gasoline taxes revisited
    Farkas, ZA
    [J]. TRANSPORTATION QUARTERLY, 2000, 54 (03): : 21 - 24
  • [43] Limited error based event localizing temporal decomposition and its application to variable-rate speech coding
    Nguyen, Phu Chien
    Akagi, Masato
    Nguyen, Binh Phu
    [J]. SPEECH COMMUNICATION, 2007, 49 (04) : 292 - 304
  • [44] CELP and MELP Speech Coding Techniques
    Jage, Rhutuja
    Upadhya, Savitha
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1398 - 1402
  • [46] Variable-Rate Coding With Constant BER for NOMA via Multilevel IRA Coding
    Chi, Yuhao
    Liu, Lei
    Guo, Jie
    Song, Guanghui
    Yuen, Chau
    Guan, Yong Liang
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 5149 - 5153
  • [47] ALGEBRAIC SPEECH CODING - A TERNARY CELP
    DIFRANCESCO, R
    [J]. ANNALES DES TELECOMMUNICATIONS-ANNALS OF TELECOMMUNICATIONS, 1992, 47 (5-6): : 214 - 226
  • [48] Variable-rate universal Slepian-Wolf coding with feedback
    Sarvotham, Shriram
    Baron, Dror
    Baraniuk, Richard G.
    [J]. 2005 39th Asilomar Conference on Signals, Systems and Computers, Vols 1 and 2, 2005, : 8 - 12
  • [49] New algorithm for variable-rate linear broadcast network coding
    夏寅
    张惕远
    黄佳庆
    [J]. Journal of Central South University, 2011, 18 (04) : 1193 - 1199
  • [50] Variable-rate distributed source coding in the presence of Byzantine sensors
    Kosut, Oliver
    Tong, Lang
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-7, 2007, : 2121 - 2125