Analysis-by-synthesis multimode harmonic speech coding at 4 kb/s

被引:0
|
作者
Li, CY [1 ]
Cuperman, V [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a 4 kb/s Analysis-by-Synthesis Multimode Harmonic Coder (AbS-MHC). Novel features of this coder include a signal modification technique that allows time-domain analysis-by-synthesis parameter estimation in sinusoidal coding framework, and a frequency-domain transition speech model with improved parameter estimation and quantization schemes. An efficient quantization scheme for harmonic magnitudes based on Weighted Non-Square Transform Vector Quantization (WNSTVQ) is also used. Subjective quality tests indicate that the 4 kb/s AbS-MHC coder outperforms the 5.3 kb/s G.723.1 standard CELP coder and produces speech quality very similar to the 6.3 kb/s G.723.1 coder.
引用
收藏
页码:1367 / 1370
页数:4
相关论文
共 50 条
  • [41] High quality waveform interpolaton speech coding at 2kb/s
    Bao Changchun
    Li Jing
    Qi Fengyan
    CHINESE JOURNAL OF ELECTRONICS, 2007, 16 (02): : 257 - 262
  • [42] Multi-prototype waveform coding using frame-by-frame Analysis-by-Synthesis
    Burnett, IS
    Pham, DH
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1567 - 1570
  • [43] LOW-DELAY VECTOR EXCITATION CODING OF SPEECH AT 16 KB/S
    CUPERMAN, V
    GERSHO, A
    PETTIGREW, R
    YAO, JH
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1992, 40 (01) : 129 - 139
  • [44] A variable-rate multimodal speech coder with gain-matched analysis-by-synthesis
    Paksoy, E
    McCree, A
    Viswanathan, V
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 751 - 754
  • [45] ANALYSIS-BY-SYNTHESIS FEATURE ESTIMATION FOR ROBUST AUTOMATIC SPEECH RECOGNITION USING SPECTRAL MASKS
    Mandel, Michael I.
    Narayanan, Arun
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [46] Improving Speech-Based Emotion Recognition by Using Psychoacoustic Modeling and Analysis-by-Synthesis
    Siegert, Ingo
    Lotz, Alicia Flores
    Egorow, Olga
    Wendemuth, Andreas
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 445 - 455
  • [47] Closed loop dynamic bit allocation for excitation parameters in analysis-by-synthesis speech codec
    Ashley, James P.
    Mittal, Udar
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1109 - +
  • [48] 3.35kb/s low bit-rate speech coding algorithm
    Li, Yue
    Tang, Kun
    Cui, Huijuan
    Du, Wen
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2004, 44 (10): : 1410 - 1413
  • [49] A REAL-TIME IMPLEMENTATION OF 4.2Kb/s CELP SPEECH CODING
    Bao Changchun Dai Yisong Fan Changxin(information Science Institute
    JournalofElectronics(China), 1997, (01) : 52 - 58
  • [50] Quantization of SEW and REW magnitude for 2 kb/s waveform interpolation speech coding
    Li, J
    Bao, CC
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 141 - 144