Speech coding with an analysis-by-synthesis sinusoidal model

被引：0

作者：

Etemoglu, ÇÖ ^{[1
]}

Cuperman, V ^{[1
]}

Gersho, A ^{[1
]}

机构：

[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We introduce a general and powerful approach to sinusoidal modeling of speech wherein a closed-loop Analysis-by-Synthesis (AbS) technique sequentially extracts the parameters for each sinusoidal component. Low bit-rate speech coding is achieved by efficiently constraining the allowed frequencies of sinusoidal components into sets of frequency intervals or bins. In conjunction with the closed-loop analysis, the constrained frequency regions allow us to efficiently vector quantize the frequency information in each frame. In voiced frames, two sets of frequency vectors are generated: one for harmonically related components and the other for non-harmonically related components of the voiced segment. In transition frames, a vector of nonuniformly spaced frequencies is selected from a frequency codebook using frequency bin vector quantization (FBVQ) to represent the frequency domain information. The effectiveness of the coding scheme is enhanced by exploiting the critical band concept of auditory perception in defining the frequency bins. In transition segments, the sinusoidal phases are modeled and coded. Subjective tests with a partially quantized model indicate that, for a target rate of 4 kbps, the coder quality exceeds that of the G.729 standard at 8 kbps.

引用

页码：1371 / 1374

页数：4

共 50 条

[21] Analysis-by-synthesis voicing cut-off determination in harmonic coding
Jia, WH
Chan, WY
2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 65 - 67
[22] Matching pursuits sinusoidal speech coding
Etemoglu, ÇÖ
Cuperman, V
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 413 - 424
[23] SPEECH ANALYSIS SYNTHESIS BASED ON A SINUSOIDAL REPRESENTATION
MCAULAY, RJ
QUATIERI, TF
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04): : 744 - 754
[24] A CLASS OF ANALYSIS-BY-SYNTHESIS PREDICTIVE CODERS FOR HIGH-QUALITY SPEECH CODING AT RATES BETWEEN 4.8 AND 16 KBITS/S
KROON, P
DEPRETTERE, EF
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1988, 6 (02) : 353 - 363
[25] Voiced speech excitation synthesis using a sinusoidal model
Pollard, MP
Cheetham, BMG
Goodyear, CC
Edgington, MD
ELECTRONICS LETTERS, 1998, 34 (06) : 531 - 532
[26] Steganalysis of analysis-by-synthesis speech exploiting pulse-position distribution characteristics
Tian, Hui
Wu, Yanpeng
Chang, Chin-Chen
Huang, Yongfeng
Liu, Jin
Wang, Tian
Chen, Yonghong
Cai, Yiqiao
SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (15) : 2934 - 2944
[27] A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model
Panchapagesan, Sankaran
Alwan, Abeer
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (04): : 2144 - 2162
[28] Speech analysis and coding using a multi-resolution sinusoidal transform
Anderson, DV
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 1037 - 1040
[29] Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach
Zheng, Xiguang
Ritz, Christian
Xi, Jiangtao
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 27 - 36
[30] Initialization of Model-Based Camera Tracking with Analysis-by-Synthesis
Schumann, Martin
Kowalczyk, Sebastian
Mueller, Stefan
ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT II, 2012, 7432 : 324 - 333

← 1 2 3 4 5 →