Joint optimization of model and excitation in CELP-type speech coders

被引:0
|
作者
Lashkari, K [1 ]
Miki, T [1 ]
机构
[1] DoCoMo USA Labs Inc, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new Analysis-by-Synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient search in the root domain, model parameters for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For the ITU G. 729 speech codec there is about 1dB of improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. By adding an extra optimization step, the technique can be incorporated into any parametric coder such as LPC, multi-pulse LPC and CELP-type speech coders in a bit stream. compatible manner.
引用
收藏
页码:191 / 195
页数:5
相关论文
共 50 条
  • [21] Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders
    Jia, WH
    Chan, WY
    [J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 210 - 213
  • [22] COMPLEXITY REDUCTION OF CELP SPEECH CODERS THROUGH THE USE OF PHASE INFORMATION
    RAMABADRAN, TV
    LUECK, CD
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1994, 42 (2-4) : 248 - 251
  • [23] Increasing the robustness of CELP-based coders by constrained optimization
    Chibani, M
    Gournay, P
    Lefebvre, R
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 785 - 788
  • [24] EMBEDDED ALGEBRAIC CELP/VSELP CODERS FOR WIDE-BAND SPEECH CODING
    LEGUYADER, A
    LAMBLIN, C
    BOURSICAUT, E
    [J]. SPEECH COMMUNICATION, 1995, 16 (04) : 319 - 328
  • [25] Joint optimization of excitation parameters in analysis-by-synthesis speech coders having multi-tap long term predictor
    Mittal, W
    Ashley, JP
    Cruz-Zeno, EM
    Jasiuk, MA
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 789 - 792
  • [26] Pitch adaptive windows for improved excitation coding in low-rate CELP coders
    Rao, AV
    Ahmadi, S
    Lindén, J
    Gersho, A
    Cuperman, V
    Heidari, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 648 - 659
  • [27] CELP AND SINUSOIDAL CODERS - 2 SOLUTIONS FOR SPEECH CODING AT 4.8-9.6 KBPS
    TRANCOSO, IM
    MARQUES, JS
    RIBEIRO, CM
    [J]. SPEECH COMMUNICATION, 1990, 9 (5-6) : 389 - 400
  • [28] Improved optimisation of excitation sequences in speech and audio coders
    Riera-Palou, F
    den Brinker, AC
    Gerrits, AJ
    Sluijter, RJ
    [J]. ELECTRONICS LETTERS, 2004, 40 (08) : 515 - 517
  • [29] Wideband re-synthesis of narrowband CELP coded speech using multiband excitation model
    Chan, CF
    Hui, WK
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 322 - 325
  • [30] A novel approach to excitation coding in low-bit-rate high-quality CELP coders
    Cuperman, V
    Gersho, A
    Lindén, J
    Rao, A
    Yang, TC
    Ahmadi, S
    Heidari, R
    Liu, FH
    [J]. 2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 14 - 16