A segment-based probabilistic generative model of speech

被引:0
|
作者
Achan, K [1 ]
Roweis, S [1 ]
Hertzmann, A [1 ]
Frey, B [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a purely time domain approach to speech processing which identifies waveform samples at the boundaries between glottal pulse periods (in voiced speech) or at the boundaries of unvoiced segments. An efficient algorithm for inferring these boundaries and estimating the average spectra of voiced and unvoiced regions is derived from a simple probabilistic generative model. Competitive results are presented on pitch tracking, voiced/unvoiced detection and timescale modification; all these tasks and several others can be performed using the single segmentation provided by inference in the model.
引用
收藏
页码:221 / 224
页数:4
相关论文
共 50 条
  • [1] A probabilistic framework for segment-based speech recognition
    Glass, JR
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 137 - 152
  • [2] Segment-based approach to the recognition of emotions in speech
    Shami, MT
    Kamel, MS
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 366 - 369
  • [3] Timing Levels in Segment-Based Speech Emotion Recognition
    Schuller, Bjoern
    Rigoll, Gerhard
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1818 - 1821
  • [4] Assessing the importance of the segmentation probability in segment-based speech recognition
    Verhasselt, J
    Illina, I
    Martens, JP
    Gong, Y
    Haton, JP
    SPEECH COMMUNICATION, 1998, 24 (01) : 51 - 72
  • [5] Segment-based emotion recognition from continuous Mandarin Chinese speech
    Yeh, Jun-Heng
    Pao, Tsang-Long
    Lin, Ching-Yi
    Tsai, Yao-Wei
    Chen, Yu-Te
    COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1545 - 1552
  • [6] Modelling graph-based observation spaces for segment-based speech recognition
    Glass, JR
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 157 - 167
  • [7] Segment-Based Speech Emotion Recognition Using Recurrent Neural Networks
    Tzinis, Efthymios
    Potamianos, Alexandros
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 190 - 195
  • [8] A segment-based optimization model for water pipeline replacement
    Kao, Jehng-Jung
    Li, Pei-Hao
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 2007, 99 (07): : 83 - 95
  • [9] A segment-based optimization model for water pipeline replacement
    Institute of Environmental Engineering, National Chiao Tung University, 75 Po-Ai street, Hsinchu 300, Taiwan
    不详
    J Am Water Works Assoc, 2007, 7 (83-95+12):
  • [10] Segment-based Teletraffic Model for MPEG-DASH
    Ognenoski, Ognen
    Martini, Maria G.
    Amon, Peter
    2013 IEEE 15TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2013, : 333 - 337