Speech coding with an analysis-by-synthesis sinusoidal model

被引:0
|
作者
Etemoglu, ÇÖ [1 ]
Cuperman, V [1 ]
Gersho, A [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce a general and powerful approach to sinusoidal modeling of speech wherein a closed-loop Analysis-by-Synthesis (AbS) technique sequentially extracts the parameters for each sinusoidal component. Low bit-rate speech coding is achieved by efficiently constraining the allowed frequencies of sinusoidal components into sets of frequency intervals or bins. In conjunction with the closed-loop analysis, the constrained frequency regions allow us to efficiently vector quantize the frequency information in each frame. In voiced frames, two sets of frequency vectors are generated: one for harmonically related components and the other for non-harmonically related components of the voiced segment. In transition frames, a vector of nonuniformly spaced frequencies is selected from a frequency codebook using frequency bin vector quantization (FBVQ) to represent the frequency domain information. The effectiveness of the coding scheme is enhanced by exploiting the critical band concept of auditory perception in defining the frequency bins. In transition segments, the sinusoidal phases are modeled and coded. Subjective tests with a partially quantized model indicate that, for a target rate of 4 kbps, the coder quality exceeds that of the G.729 standard at 8 kbps.
引用
收藏
页码:1371 / 1374
页数:4
相关论文
共 50 条
  • [1] Speech analysis synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model
    George, EB
    Smith, MJT
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05): : 389 - 406
  • [2] Analysis-by-synthesis speech coding with quantization noise modeling
    Andersen, SV
    Kleijn, WB
    Jensen, SH
    Hansen, E
    CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 333 - 337
  • [3] Analysis-by-synthesis sinusoidal model without an overlapping scheme
    Kim, Jong-Hark
    Jeong, Gyu-Hyeok
    Lee, In-Sung
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2008, E91B (06) : 2094 - 2096
  • [4] Analysis-by-synthesis multimode harmonic speech coding at 4 kb/s
    Li, CY
    Cuperman, V
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1367 - 1370
  • [5] ANALYSIS-BY-SYNTHESIS LINEAR PREDICTIVE SPEECH CODING AT 2.4 KBIT/S
    TZENG, FF
    DALLAS GLOBECOM 89, VOLS 1-3: COMMUNICATIONS TECHNOLOGY FOR THE 1990S AND BEYOND, 1989, : 1253 - 1257
  • [6] Analysis-by-synthesis features for speech recognition
    Al Bawab, Ziad
    Raj, Bhiksha
    Stern, Richard M.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4185 - +
  • [7] Segmental sinusoidal model for speech coding
    Setiawan, Florentinus Budi
    Hartono, Sugi
    Soegijoko, Soegijardjo
    Tjondronegoro, Suhartono
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1206 - +
  • [8] A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding
    Chang, JH
    Kim, NS
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 747 - 751
  • [9] REDUCTION OF SPEECH SPECTRA BY ANALYSIS-BY-SYNTHESIS TECHNIQUES
    BELL, CG
    STEVENS, KN
    HOUSE, AS
    FUJISAKI, H
    HEINZ, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (12): : 1725 - &
  • [10] Segmental Sinusoidal Model for Speech Signal Coding
    Setiawan, Florentinus Budi
    Soegijoko, Soegijardjo
    Sugihartono
    Tjondronegoro, Suhartono
    MAKARA JOURNAL OF TECHNOLOGY, 2006, 10 (02): : 61 - 66