A NEW VOICE SOURCE MODEL BASED ON HIGH-SPEED IMAGING AND ITS APPLICATION TO VOICE SOURCE ESTIMATION

被引:13
|
作者
Shue, Yen-Liang [1 ]
Alwan, Abeer [1 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
关键词
source estimation; voice source; speech analysis;
D O I
10.1109/ICASSP.2010.5495030
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There are numerous models of varying complexities which seek to efficiently represent the voice source signal. These models are typically based on data and observations which can come from air-flow masks, electroglottographs, mechanical systems, and the inverse-filtering of speech signals. The first part of this study examines observations from the high-speed imaging of the larynx and proposes a new source model, which is shown to provide a better fit for the observed data than existing models. The proposed source model is then used in an automatic source estimation application, based on methods introduced in an earlier study [1]. Results, on average, show that the proposed model provides a more accurate estimation of the source signal compared with the Liljencrants-Fant model.
引用
收藏
页码:5134 / 5137
页数:4
相关论文
共 50 条
  • [21] Tracking High-speed Source Based on Moving Source Acoustic Field Model in Shallow Ocean Environment
    Du, Jinyan
    Zheng, Yi
    Sun, Chao
    Liu, Zongwei
    Yang, Yixin
    [J]. 2013 OCEANS - SAN DIEGO, 2013,
  • [22] High-speed light source depth estimation using spatially-resolved diffuse imaging
    Brennan, Kieran A.
    Kulasingham, Daniel A. N.
    Nielsen, Poul M. F.
    Taberner, Andrew J.
    Ruddy, Bryan P.
    [J]. JOURNAL OF OPTICS, 2019, 21 (01)
  • [23] Automatic estimation of formant and voice source parameters using a subspace based algorithm
    Yang, CS
    Kasuya, H
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 941 - 944
  • [24] Estimation method of glottal vocal efficiency based on conversion function of voice source
    ZOU Yuan WAN Mingxi ZHAO Shouguo WANG Supin(1 Department of Biomedical Engineering
    [J]. Chinese Journal of Acoustics, 2002, (04) : 332 - 342
  • [25] Studies on the physiology of voice by means of high-speed cinematography
    Berger, R
    [J]. ZEITSCHRIFT FUR DIALEKTOLOGIE UND LINGUISTIK, 1999, 66 (03): : 366 - 367
  • [26] VOICE SOURCE ESTIMATION FOR ARTIFICIAL BANDWIDTH EXTENSION OF TELEPHONE SPEECH
    Thomas, Mark R. P.
    Gudnason, Jon
    Naylor, Patrick A.
    Geiser, Bernd
    Vary, Peter
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4794 - 4797
  • [27] On the use of voice descriptors for glottal source shape parameter estimation
    Huber, Stefan
    Roebel, Axel
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (05): : 1170 - 1194
  • [28] VOICE SOURCE MODEL FOR CONTINUOUS CONTROL OF PITCH PERIOD
    MILENKOVIC, PH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (02): : 1087 - 1096
  • [29] Modeling glottal source for high quality voice conversion
    Sun, Jun
    Dai, Beiqian
    Zhang, Jian
    Xie, Yanlu
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 319 - 319
  • [30] Hybrid Source Model for Predicting High-Speed Jet Noise
    Leib, S. J.
    Goldstein, M. E.
    [J]. AIAA JOURNAL, 2011, 49 (07) : 1324 - 1335