Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders

被引:0
|
作者
Jia, WH [1 ]
Chan, WY [1 ]
机构
[1] Brooktrout Technol, Los Gatos, CA 95032 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In conventional multi-band excitation (MBE) speech encoding, pitch is estimated first from the speech signal. Using the estimated pitch, voicing decisions are made for pitch-spaced spectral bands. As the. method invariably includes unvoiced components in the speech signal to estimate the pitch,. the accuracy of the estimated pitch and voicing decisions are degraded. We present a novel pitch and voicing estimation scheme, wherein the spectrum of the speech signal is segmented into voiced and unvoiced regions without knowledge of the pitch. Pitch is then estimated only from the voiced regions. Experimental results show that the new scheme improves the accuracy of the estimated pitch and voicing decisions, and offers better speech quality.
引用
收藏
页码:210 / 213
页数:4
相关论文
共 50 条
  • [21] Investigation on the spectral envelope estimator (SEEVOC) and refined pitch estimation based on the sinusoidal speech model
    Kim, HS
    Holmes, H
    Zhang, WH
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 575 - 578
  • [22] Efficient spectral magnitude quantisation for high-quality sinusoidal speech coders
    Cho, YD
    Villette, S
    Kondoz, A
    [J]. IEEE VTC 53RD VEHICULAR TECHNOLOGY CONFERENCE, SPRING 2001, VOLS 1-4, PROCEEDINGS, 2001, : 1315 - 1318
  • [23] Fast harmonic estimation method for harmonic speech coders
    Choi, YS
    Youn, DH
    [J]. ELECTRONICS LETTERS, 2002, 38 (07) : 346 - 347
  • [24] Estimation of the instantaneous pitch of speech
    Resch, Barbara
    Nilsson, Mattias
    Ekman, Anders
    Kleijn, W. Bastiaan
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 813 - 822
  • [25] ZERO-BRANCH TREE ENCODING OF SPEECH PITCH VOICING SIGNALS
    GONCHAROFF, V
    BUKIET, B
    HAYNER, D
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1989, 37 (11) : 1236 - 1239
  • [26] Interpolation of the Pitch-Predictor Parameters in Analysis-by-Synthesis Speech Coders
    Kleijn, W. Bastiaan
    Ramachandran, Ravi P.
    Kroon, Peter
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 42 - 54
  • [27] Low bit-rate wideband LP and wideband sinusoidal parametric speech coders
    Madrid, KM
    Tan, EC
    Guevara, RCL
    [J]. TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A135 - A138
  • [28] Joint optimization of LPC and closed-loop pitch parameters in CELP coders
    Serizawa, M
    Gersho, A
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (03) : 52 - 54
  • [29] CELP AND SINUSOIDAL CODERS - 2 SOLUTIONS FOR SPEECH CODING AT 4.8-9.6 KBPS
    TRANCOSO, IM
    MARQUES, JS
    RIBEIRO, CM
    [J]. SPEECH COMMUNICATION, 1990, 9 (5-6) : 389 - 400
  • [30] Pitch adaptive windows for improved excitation coding in low-rate CELP coders
    Rao, AV
    Ahmadi, S
    Lindén, J
    Gersho, A
    Cuperman, V
    Heidari, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 648 - 659