Markov processes on curves for automatic speech recognition

被引:0
|
作者
Saul, L [1 ]
Rahim, M [1 ]
机构
[1] AT&T Labs Res, Shannon Lab, Florham Park, NJ 07932 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a probabilistic framework for automatic speech recognition based on the intrinsic geometric properties of curves. In particular, we analyze the setting in which two variables-one continuous (x), one discrete (s)-evolve jointly in time. We suppose that the vector x traces out a smooth multidimensional curve and that the variable s evolves stochastically as a function of the are length traversed along this curve. Since are length does not depend on the rate at which a curve is traversed, this gives rise to a family of Markov processes whose predictions, Pr[s \ x], are invariant to nonlinear warpings of time. We describe the use of such models, known as Markov processes on curves (MPCs), for automatic speech recognition, where x are acoustic feature trajectories and s are phonetic transcriptions. On two tasks-recognizing New Jersey town names and connected alpha-digits-we find that MPCs yield lower word error rates than comparably trained hidden Markov models.
引用
下载
收藏
页码:751 / 757
页数:7
相关论文
共 50 条
  • [21] Are automatic acquisition of acoustical units for speech recognition based on hidden Markov network
    Suzuki, M
    Hayashi, T
    Mori, H
    Makino, S
    Aso, H
    DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 357 - 358
  • [22] On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
    Garner, PN
    Holmes, WJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1 - 4
  • [23] Novel frequency masking curves for noise-robust automatic speech recognition
    Chen, Chia-Ping
    Yeh, Ja-Zang
    Wu, Bo-Feng
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
  • [24] Automatic speech recognition
    O'Shaughnessy, Douglas
    2015 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON), 2015, : 417 - 424
  • [25] AUTOMATIC SPEECH RECOGNITION
    IVALL, T
    ELECTRONICS & WIRELESS WORLD, 1984, 90 (1581): : 73 - 76
  • [26] AUTOMATIC RECOGNITION OF SPEECH
    MARILL, T
    IRE TRANSACTIONS ON HUMAN FACTORS IN ELECTRONICS, 1961, HFE2 (01): : 34 - +
  • [27] AUTOMATIC SPEECH RECOGNITION
    RAO, PVS
    PALIWAL, KK
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1986, 9 : 85 - 120
  • [28] Speech production and automatic speech recognition
    Acoustics Bulletin, 2000, 25 (02):
  • [29] AUTOMATIC SPEECH RECOGNITION OF IMPAIRED SPEECH
    CARLSON, GS
    BERNSTEIN, J
    INTERNATIONAL JOURNAL OF REHABILITATION RESEARCH, 1988, 11 (04) : 396 - 398
  • [30] SUBSPACE HIGH-DENSITY DISCRETE HIDDEN MARKOV MODEL FOR AUTOMATIC SPEECH RECOGNITION
    Ye, Guoli
    Mak, Brian
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1643 - 1647