Markov processes on curves for automatic speech recognition

被引：0

作者：

Saul, L ^{[1
]}

Rahim, M ^{[1
]}

机构：

[1] AT&T Labs Res, Shannon Lab, Florham Park, NJ 07932 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11 | 1999年 / 11卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We investigate a probabilistic framework for automatic speech recognition based on the intrinsic geometric properties of curves. In particular, we analyze the setting in which two variables-one continuous (x), one discrete (s)-evolve jointly in time. We suppose that the vector x traces out a smooth multidimensional curve and that the variable s evolves stochastically as a function of the are length traversed along this curve. Since are length does not depend on the rate at which a curve is traversed, this gives rise to a family of Markov processes whose predictions, Pr[s \ x], are invariant to nonlinear warpings of time. We describe the use of such models, known as Markov processes on curves (MPCs), for automatic speech recognition, where x are acoustic feature trajectories and s are phonetic transcriptions. On two tasks-recognizing New Jersey town names and connected alpha-digits-we find that MPCs yield lower word error rates than comparably trained hidden Markov models.

引用

下载

页码：751 / 757

页数：7

共 50 条

[21] Are automatic acquisition of acoustical units for speech recognition based on hidden Markov network
Suzuki, M
Hayashi, T
Mori, H
Makino, S
Aso, H
DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 357 - 358
[22] On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
Garner, PN
Holmes, WJ
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1 - 4
[23] Novel frequency masking curves for noise-robust automatic speech recognition
Chen, Chia-Ping
Yeh, Ja-Zang
Wu, Bo-Feng
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
[24] Automatic speech recognition
O'Shaughnessy, Douglas
2015 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON), 2015, : 417 - 424
[25] AUTOMATIC SPEECH RECOGNITION
IVALL, T
ELECTRONICS & WIRELESS WORLD, 1984, 90 (1581): : 73 - 76
[26] AUTOMATIC RECOGNITION OF SPEECH
MARILL, T
IRE TRANSACTIONS ON HUMAN FACTORS IN ELECTRONICS, 1961, HFE2 (01): : 34 - +
[27] AUTOMATIC SPEECH RECOGNITION
RAO, PVS
PALIWAL, KK
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1986, 9 : 85 - 120
[28] Speech production and automatic speech recognition
Acoustics Bulletin, 2000, 25 (02):
[29] AUTOMATIC SPEECH RECOGNITION OF IMPAIRED SPEECH
CARLSON, GS
BERNSTEIN, J
INTERNATIONAL JOURNAL OF REHABILITATION RESEARCH, 1988, 11 (04) : 396 - 398
[30] SUBSPACE HIGH-DENSITY DISCRETE HIDDEN MARKOV MODEL FOR AUTOMATIC SPEECH RECOGNITION
Ye, Guoli
Mak, Brian
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1643 - 1647

← 1 2 3 4 5 →