A segment-based probabilistic generative model of speech

被引:0
|
作者
Achan, K [1 ]
Roweis, S [1 ]
Hertzmann, A [1 ]
Frey, B [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a purely time domain approach to speech processing which identifies waveform samples at the boundaries between glottal pulse periods (in voiced speech) or at the boundaries of unvoiced segments. An efficient algorithm for inferring these boundaries and estimating the average spectra of voiced and unvoiced regions is derived from a simple probabilistic generative model. Competitive results are presented on pitch tracking, voiced/unvoiced detection and timescale modification; all these tasks and several others can be performed using the single segmentation provided by inference in the model.
引用
收藏
页码:221 / 224
页数:4
相关论文
共 50 条
  • [31] Segment-based automatic language identification
    Hazen, TJ
    Zue, VW
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (04): : 2323 - 2331
  • [32] Segment-based coding of color images
    Zhang YuDong
    Wu LeNan
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (06): : 914 - 925
  • [33] Segment-based pavement crack quantification
    Weng, Xingxing
    Huang, Yuchun
    Wang, Wenzong
    AUTOMATION IN CONSTRUCTION, 2019, 105
  • [34] Segment-based hand pose estimation
    Schwarz, C
    Lobo, NDV
    2nd Canadian Conference on Computer and Robot Vision, Proceedings, 2005, : 42 - 49
  • [35] A SEGMENT-BASED IMAGE SALIENCY DETECTION
    Muratov, O.
    Zontone, P.
    Boato, G.
    De Natale, F. G. B.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1217 - 1220
  • [36] Segment-based coding of color images
    YuDong Zhang
    LeNan Wu
    Science in China Series F: Information Sciences, 2009, 52 : 914 - 925
  • [37] Sample or Random Security - A Security Model for Segment-Based Visual Cryptography
    Pape, Sebastian
    FINANCIAL CRYPTOGRAPHY AND DATA SECURITY, FC 2014, 2014, 8437 : 291 - 303
  • [38] Segment-based excess Gibbs energy model for aqueous organic electrolytes
    Chen, CC
    Bokis, CP
    Mathias, P
    AICHE JOURNAL, 2001, 47 (11) : 2593 - 2602
  • [39] OpenVL: Abstracting Vision Tasks Using a Segment-Based Language Model
    Miller, Gregor
    Oldridge, Steve
    Fels, Sidney
    2013 INTERNATIONAL CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2013, : 257 - 264
  • [40] The development of a segment-based musculoskeletal model of the lower limb: introducing FREEBODY
    Cleather, Daniel J.
    Bull, Anthony M. J.
    ROYAL SOCIETY OPEN SCIENCE, 2015, 2 (06):