Speech coding with an analysis-by-synthesis sinusoidal model

被引：0

作者：

Etemoglu, ÇÖ ^{[1
]}

Cuperman, V ^{[1
]}

Gersho, A ^{[1
]}

机构：

[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We introduce a general and powerful approach to sinusoidal modeling of speech wherein a closed-loop Analysis-by-Synthesis (AbS) technique sequentially extracts the parameters for each sinusoidal component. Low bit-rate speech coding is achieved by efficiently constraining the allowed frequencies of sinusoidal components into sets of frequency intervals or bins. In conjunction with the closed-loop analysis, the constrained frequency regions allow us to efficiently vector quantize the frequency information in each frame. In voiced frames, two sets of frequency vectors are generated: one for harmonically related components and the other for non-harmonically related components of the voiced segment. In transition frames, a vector of nonuniformly spaced frequencies is selected from a frequency codebook using frequency bin vector quantization (FBVQ) to represent the frequency domain information. The effectiveness of the coding scheme is enhanced by exploiting the critical band concept of auditory perception in defining the frequency bins. In transition segments, the sinusoidal phases are modeled and coded. Subjective tests with a partially quantized model indicate that, for a target rate of 4 kbps, the coder quality exceeds that of the G.729 standard at 8 kbps.

引用

页码：1371 / 1374

页数：4

共 50 条

[1] Speech analysis synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model
George, EB
Smith, MJT
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05): : 389 - 406
[2] Analysis-by-synthesis speech coding with quantization noise modeling
Andersen, SV
Kleijn, WB
Jensen, SH
Hansen, E
CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 333 - 337
[3] Analysis-by-synthesis sinusoidal model without an overlapping scheme
Kim, Jong-Hark
Jeong, Gyu-Hyeok
Lee, In-Sung
IEICE TRANSACTIONS ON COMMUNICATIONS, 2008, E91B (06) : 2094 - 2096
[4] Analysis-by-synthesis multimode harmonic speech coding at 4 kb/s
Li, CY
Cuperman, V
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1367 - 1370
[5] ANALYSIS-BY-SYNTHESIS LINEAR PREDICTIVE SPEECH CODING AT 2.4 KBIT/S
TZENG, FF
DALLAS GLOBECOM 89, VOLS 1-3: COMMUNICATIONS TECHNOLOGY FOR THE 1990S AND BEYOND, 1989, : 1253 - 1257
[6] Analysis-by-synthesis features for speech recognition
Al Bawab, Ziad
Raj, Bhiksha
Stern, Richard M.
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4185 - +
[7] Segmental sinusoidal model for speech coding
Setiawan, Florentinus Budi
Hartono, Sugi
Soegijoko, Soegijardjo
Tjondronegoro, Suhartono
2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1206 - +
[8] A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding
Chang, JH
Kim, NS
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 747 - 751
[9] REDUCTION OF SPEECH SPECTRA BY ANALYSIS-BY-SYNTHESIS TECHNIQUES
BELL, CG
STEVENS, KN
HOUSE, AS
FUJISAKI, H
HEINZ, JM
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (12): : 1725 - &
[10] Segmental Sinusoidal Model for Speech Signal Coding
Setiawan, Florentinus Budi
Soegijoko, Soegijardjo
Sugihartono
Tjondronegoro, Suhartono
MAKARA JOURNAL OF TECHNOLOGY, 2006, 10 (02): : 61 - 66

← 1 2 3 4 5 →