Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders

被引:0
|
作者
Jia, WH [1 ]
Chan, WY [1 ]
机构
[1] Brooktrout Technol, Los Gatos, CA 95032 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In conventional multi-band excitation (MBE) speech encoding, pitch is estimated first from the speech signal. Using the estimated pitch, voicing decisions are made for pitch-spaced spectral bands. As the. method invariably includes unvoiced components in the speech signal to estimate the pitch,. the accuracy of the estimated pitch and voicing decisions are degraded. We present a novel pitch and voicing estimation scheme, wherein the spectrum of the speech signal is segmented into voiced and unvoiced regions without knowledge of the pitch. Pitch is then estimated only from the voiced regions. Experimental results show that the new scheme improves the accuracy of the estimated pitch and voicing decisions, and offers better speech quality.
引用
收藏
页码:210 / 213
页数:4
相关论文
共 50 条
  • [41] Joint optimization of excitation parameters in analysis-by-synthesis speech coders having multi-tap long term predictor
    Mittal, W
    Ashley, JP
    Cruz-Zeno, EM
    Jasiuk, MA
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 789 - 792
  • [42] Voiced speech excitation synthesis using a sinusoidal model
    Pollard, MP
    Cheetham, BMG
    Goodyear, CC
    Edgington, MD
    [J]. ELECTRONICS LETTERS, 1998, 34 (06) : 531 - 532
  • [44] PROGRAMS FOR THE ESTIMATION OF FUNDAMENTAL-FREQUENCY, AMPLITUDE, AND VOICING OF SPEECH
    HEYMAN, R
    BIRD, RJ
    HEYMAN, RL
    HARDING, J
    [J]. BEHAVIOR RESEARCH METHODS & INSTRUMENTATION, 1981, 13 (06): : 760 - 760
  • [45] SPEECH ENHANCEMENT USING HARMONICS REGENERATION BASED ON MULTIBAND EXCITATION
    Zhang Yanfang Tang Kun Cui HuijuanNational Laboratory for Information Science and TechnologyTsinghua UniversityBeijing China
    [J]. Journal of Electronics(China), 2011, 28(Z1) (China) : 565 - 570
  • [46] Efficient mixed excitation models in LPC based prototype interpolation speech coders
    Papanastasiou, C
    Xydeas, CS
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1555 - 1558
  • [47] Multiband excitation coding of speech at 1.8-2.4 kbps
    Zhang, Jun
    Xiao, Zimei
    Wei, Gang
    [J]. Shengxue Xuebao/Acta Acustica, 2002, 27 (05): : 398 - 404
  • [48] AhoTransf: A tool for Multiband Excitation based speech analysis and modification
    Saratxaga, Ibon
    Hernaez, Inmaculada
    Navas, Eva
    Sainz, Inaki
    Luengo, Iker
    Sanchez, Jon
    Odriozola, Igor
    Erro, Daniel
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3732 - 3737
  • [49] SPEECH ENHANCEMENT USING HARMONICS REGENERATION BASED ON MULTIBAND EXCITATION
    Zhang Yanfang Tang Kun Cui Huijuan(National Laboratory for Information Science and Technology
    [J]. Journal of Electronics(China), 2011, (Z1) : 565 - 570
  • [50] Pitch Detection and Voicing/Unvoicing decision of Arabic Speech Signal by HOS-Polycesptre
    Cherouat, Soumeya
    Marir, Farid
    [J]. 2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 768 - 771