Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders

被引:0
|
作者
Jia, WH [1 ]
Chan, WY [1 ]
机构
[1] Brooktrout Technol, Los Gatos, CA 95032 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In conventional multi-band excitation (MBE) speech encoding, pitch is estimated first from the speech signal. Using the estimated pitch, voicing decisions are made for pitch-spaced spectral bands. As the. method invariably includes unvoiced components in the speech signal to estimate the pitch,. the accuracy of the estimated pitch and voicing decisions are degraded. We present a novel pitch and voicing estimation scheme, wherein the spectrum of the speech signal is segmented into voiced and unvoiced regions without knowledge of the pitch. Pitch is then estimated only from the voiced regions. Experimental results show that the new scheme improves the accuracy of the estimated pitch and voicing decisions, and offers better speech quality.
引用
收藏
页码:210 / 213
页数:4
相关论文
共 50 条
  • [1] Improving pitch estimation for efficient multiband excitation coding of speech
    Chan, CF
    Yu, EWM
    [J]. ELECTRONICS LETTERS, 1996, 32 (10) : 870 - 872
  • [2] A robust pitch and voicing detector for harmonic coders
    Bryden, K
    BrindAmour, A
    Hassanein, H
    [J]. ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 61 - 64
  • [3] Joint optimization of model and excitation in parametric speech coders
    Lashkari, K
    Miki, T
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 277 - 280
  • [4] A joint pitch estimation and voicing detection method for melody extraction
    Zhang, Weiwei
    Wang, Rong
    Zhang, Qiaoling
    Fang, Shaojun
    [J]. APPLIED ACOUSTICS, 2020, 166
  • [5] Joint optimization of model and excitation in CELP-type speech coders
    Lashkari, K
    Miki, T
    [J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 191 - 195
  • [6] Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics
    Drugman, Thomas
    Alwan, Abeer
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1984 - +
  • [7] Phase-spread voicing analysis in parametric speech coders
    Edwards, R.
    Sturt, C.
    Villette, S.
    Kondoz, A.
    [J]. ELECTRONICS LETTERS, 2006, 42 (11) : 665 - 666
  • [8] LSF quantisation for pitch synchronous speech coders
    Sturt, C
    Villette, S
    Kondoz, AM
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 165 - 168
  • [9] Autocorrelation of the Speech Multi-Scale Product for Voicing Decision and Pitch Estimation
    Mohamed Anouar Ben Messaoud
    Aïcha Bouzid
    Noureddine Ellouze
    [J]. Cognitive Computation, 2010, 2 : 151 - 159
  • [10] Autocorrelation of the Speech Multi-Scale Product for Voicing Decision and Pitch Estimation
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    Ellouze, Noureddine
    [J]. COGNITIVE COMPUTATION, 2010, 2 (03) : 151 - 159