A segment-based probabilistic generative model of speech

被引：0

作者：

Achan, K ^{[1
]}

Roweis, S ^{[1
]}

Hertzmann, A ^{[1
]}

Frey, B ^{[1
]}

机构：

[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a purely time domain approach to speech processing which identifies waveform samples at the boundaries between glottal pulse periods (in voiced speech) or at the boundaries of unvoiced segments. An efficient algorithm for inferring these boundaries and estimating the average spectra of voiced and unvoiced regions is derived from a simple probabilistic generative model. Competitive results are presented on pitch tracking, voiced/unvoiced detection and timescale modification; all these tasks and several others can be performed using the single segmentation provided by inference in the model.

引用

页码：221 / 224

页数：4

共 50 条

[1] A probabilistic framework for segment-based speech recognition
Glass, JR
COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 137 - 152
[2] Segment-based approach to the recognition of emotions in speech
Shami, MT
Kamel, MS
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 366 - 369
[3] Timing Levels in Segment-Based Speech Emotion Recognition
Schuller, Bjoern
Rigoll, Gerhard
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1818 - 1821
[4] Assessing the importance of the segmentation probability in segment-based speech recognition
Verhasselt, J
Illina, I
Martens, JP
Gong, Y
Haton, JP
SPEECH COMMUNICATION, 1998, 24 (01) : 51 - 72
[5] Segment-based emotion recognition from continuous Mandarin Chinese speech
Yeh, Jun-Heng
Pao, Tsang-Long
Lin, Ching-Yi
Tsai, Yao-Wei
Chen, Yu-Te
COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1545 - 1552
[6] Modelling graph-based observation spaces for segment-based speech recognition
Glass, JR
MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 157 - 167
[7] Segment-Based Speech Emotion Recognition Using Recurrent Neural Networks
Tzinis, Efthymios
Potamianos, Alexandros
2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 190 - 195
[8] A segment-based optimization model for water pipeline replacement
Kao, Jehng-Jung
Li, Pei-Hao
JOURNAL AMERICAN WATER WORKS ASSOCIATION, 2007, 99 (07): : 83 - 95
[9] A segment-based optimization model for water pipeline replacement
Institute of Environmental Engineering, National Chiao Tung University, 75 Po-Ai street, Hsinchu 300, Taiwan
不详
J Am Water Works Assoc, 2007, 7 (83-95+12):
[10] Segment-based Teletraffic Model for MPEG-DASH
Ognenoski, Ognen
Martini, Maria G.
Amon, Peter
2013 IEEE 15TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2013, : 333 - 337

← 1 2 3 4 5 →