Joint optimization of model and excitation in CELP-type speech coders

被引:0
|
作者
Lashkari, K [1 ]
Miki, T [1 ]
机构
[1] DoCoMo USA Labs Inc, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new Analysis-by-Synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient search in the root domain, model parameters for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For the ITU G. 729 speech codec there is about 1dB of improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. By adding an extra optimization step, the technique can be incorporated into any parametric coder such as LPC, multi-pulse LPC and CELP-type speech coders in a bit stream. compatible manner.
引用
收藏
页码:191 / 195
页数:5
相关论文
共 50 条
  • [41] Low-complexity fuzzy control of excitation gain in LD-CELP speech coding
    Beritelli, F
    Casale, S
    Cavallaro, A
    [J]. ELECTRONICS LETTERS, 1997, 33 (22) : 1846 - 1847
  • [42] Low-complexity fuzzy control of excitation gain in LD-CELP speech coding
    Univ of Catania, Catania, Italy
    [J]. Electron Lett, 22 (1846-1847):
  • [43] JOINT ESTIMATION OF SHORT-TERM AND LONG-TERM PREDICTORS IN SPEECH CODERS
    Giacobello, Daniele
    Christensen, Mads Graesboll
    Dahl, Joachim
    Jensen, Soren Holdt
    Moonen, Marc
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4109 - +
  • [44] 4 kb/s Muti-Pulse Based CELP speech coding using excitation switching
    Ozawa, K
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 189 - 192
  • [45] Quality enhancement of CELP coded speech by using a voicing gaussian mixture model
    Raza, DG
    Chan, CF
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 452 - 455
  • [46] A NEURAL NETWORK APPROACH FOR JOINT OPTIMIZATION OF PREDICTORS IN LIFTING-BASED IMAGE CODERS
    Dardouri, T.
    Kaaniche, M.
    Benazza-Benyahia, A.
    Pesquet, J-C
    Dauphin, G.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3747 - 3751
  • [47] A CELP-based hybrid digital-analog (HDA) joint source-channel speech coder
    Phamdo, N
    Mittal, U
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1487 - 1490
  • [48] Blind speech separation using a joint model of speech production
    Smith, D
    Lukasiak, J
    Burnett, I
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (11) : 784 - 787
  • [49] Optimization of Excitation in MR Imaging for Joint Tissues Visualization
    Netreba, A., V
    Pershina, T. B.
    Radchenko, S. P.
    [J]. 2014 IEEE 34TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND NANOTECHNOLOGY (ELNANO), 2014, : 310 - 312
  • [50] Joint On-Demand Pruning and Online Distillation in Automatic Speech Recognition Language Model Optimization
    Seo, Soonshin
    Kim, Ji-Hwan
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (03): : 2833 - 2856