Speech coding with an analysis-by-synthesis sinusoidal model

被引:0
|
作者
Etemoglu, ÇÖ [1 ]
Cuperman, V [1 ]
Gersho, A [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce a general and powerful approach to sinusoidal modeling of speech wherein a closed-loop Analysis-by-Synthesis (AbS) technique sequentially extracts the parameters for each sinusoidal component. Low bit-rate speech coding is achieved by efficiently constraining the allowed frequencies of sinusoidal components into sets of frequency intervals or bins. In conjunction with the closed-loop analysis, the constrained frequency regions allow us to efficiently vector quantize the frequency information in each frame. In voiced frames, two sets of frequency vectors are generated: one for harmonically related components and the other for non-harmonically related components of the voiced segment. In transition frames, a vector of nonuniformly spaced frequencies is selected from a frequency codebook using frequency bin vector quantization (FBVQ) to represent the frequency domain information. The effectiveness of the coding scheme is enhanced by exploiting the critical band concept of auditory perception in defining the frequency bins. In transition segments, the sinusoidal phases are modeled and coded. Subjective tests with a partially quantized model indicate that, for a target rate of 4 kbps, the coder quality exceeds that of the G.729 standard at 8 kbps.
引用
收藏
页码:1371 / 1374
页数:4
相关论文
共 50 条
  • [21] Analysis-by-synthesis voicing cut-off determination in harmonic coding
    Jia, WH
    Chan, WY
    2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 65 - 67
  • [22] Matching pursuits sinusoidal speech coding
    Etemoglu, ÇÖ
    Cuperman, V
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 413 - 424
  • [23] SPEECH ANALYSIS SYNTHESIS BASED ON A SINUSOIDAL REPRESENTATION
    MCAULAY, RJ
    QUATIERI, TF
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04): : 744 - 754
  • [24] A CLASS OF ANALYSIS-BY-SYNTHESIS PREDICTIVE CODERS FOR HIGH-QUALITY SPEECH CODING AT RATES BETWEEN 4.8 AND 16 KBITS/S
    KROON, P
    DEPRETTERE, EF
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1988, 6 (02) : 353 - 363
  • [25] Voiced speech excitation synthesis using a sinusoidal model
    Pollard, MP
    Cheetham, BMG
    Goodyear, CC
    Edgington, MD
    ELECTRONICS LETTERS, 1998, 34 (06) : 531 - 532
  • [26] Steganalysis of analysis-by-synthesis speech exploiting pulse-position distribution characteristics
    Tian, Hui
    Wu, Yanpeng
    Chang, Chin-Chen
    Huang, Yongfeng
    Liu, Jin
    Wang, Tian
    Chen, Yonghong
    Cai, Yiqiao
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (15) : 2934 - 2944
  • [27] A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model
    Panchapagesan, Sankaran
    Alwan, Abeer
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (04): : 2144 - 2162
  • [28] Speech analysis and coding using a multi-resolution sinusoidal transform
    Anderson, DV
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 1037 - 1040
  • [29] Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach
    Zheng, Xiguang
    Ritz, Christian
    Xi, Jiangtao
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 27 - 36
  • [30] Initialization of Model-Based Camera Tracking with Analysis-by-Synthesis
    Schumann, Martin
    Kowalczyk, Sebastian
    Mueller, Stefan
    ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT II, 2012, 7432 : 324 - 333