Sinusoidal speech coding at 2.4 kbps using an improved phase matching algorithm

被引:0
|
作者
Ahmadi, S [1 ]
Spanias, AS [1 ]
机构
[1] Nokia Mobile Phones Inc, San Diego, CA 92121 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the design, development, evaluation, and implementation of efficient low bit rate speech coding algorithms based on the sinusoidal model. A series of algorithms have been developed for pitch frequency determination and voicing detection, simultaneous modeling of the sinusoidal amplitudes and phases, and mid-frame interpolation. An improved sinusoidal phase matching algorithm is presented, where short-time sinusoidal phases are approximated using an elaborate combination of linear prediction, spectral sampling, delay compensation, and phase correction techniques. A voicing-dependent perceptual split vector quantization scheme is used to encode the sinusoidal amplitudes. The perceptual properties of the human auditory system are effectively exploited in the developed algorithms. The algorithms have been successfully integrated into a 2.4 kbps sinusoidal coder. The performance of the 2.4 kbps coder has been evaluated in terms of subjective tests such as the mean opinion score and the diagnostic rhyme test, as well as some perceptually-motivated objective distortion measures. Performance analysis on a large speech database indicates that the use of the proposed algorithms resulted in considerable improvement in temporal and spectral signal matching, as well as improved subjective quality of the reproduced speech.
引用
收藏
页码:1075 / 1079
页数:5
相关论文
共 50 条
  • [1] Low rate sinusoidal coding of speech using an improved phase matching algorithm
    Ahmadi, S
    Spanias, AS
    1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 35 - 36
  • [2] A 2.4kbps Multiband Characteristic Waveform Interpolation Speech Coding Algorithm
    Tang, Yibin
    Huang, Rong
    Wu, Zhenyang
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4283 - 4286
  • [3] Multiband excitation coding of speech at 1.8-2.4 kbps
    Zhang, Jun
    Xiao, Zimei
    Wei, Gang
    Shengxue Xuebao/Acta Acustica, 2002, 27 (05): : 398 - 404
  • [4] Very low complexity interpolative speech coding at 1.2 to 2.4 kbps
    Shoham, Y
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1599 - 1602
  • [5] Matching pursuits sinusoidal speech coding
    Etemoglu, ÇÖ
    Cuperman, V
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 413 - 424
  • [6] Low complexity speech coding at 1.2 to 2.4 kbps based on waveform interpolation
    Shoham Y.
    International Journal of Speech Technology, 1999, 2 (4) : 329 - 341
  • [7] Variable rate multi-mode excitation coding of speech at 2.4 kbps
    Wang, SH
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1395 - 1398
  • [8] CELP AND SINUSOIDAL CODERS - 2 SOLUTIONS FOR SPEECH CODING AT 4.8-9.6 KBPS
    TRANCOSO, IM
    MARQUES, JS
    RIBEIRO, CM
    SPEECH COMMUNICATION, 1990, 9 (5-6) : 389 - 400
  • [9] Improved sinusoidal transform coding algorithm
    You, H.
    Chen, J.
    Shu Ju Cai Ji Yu Chu Li/Journal of Data Acquisition and Processing, 2001, 16 (02): : 189 - 192
  • [10] Research on Low Delay 11.2kbps Speech Coding Algorithm
    Zhao, Zhefeng
    Zhang, Gang
    Wang, Yiping
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 276 - 281