Prosody modeling for automatic speech recognition and understanding

被引:0
|
作者
Shriberg, E [1 ]
Stolcke, A [1 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
关键词
prosody; speech recognition and understanding; hidden Markov models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automatic sentence segmentation and disfluency detection, topic segmentation, dialog act labeling, and word recognition.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 50 条
  • [21] Prosody Dependent Mandarin Speech Recognition
    Ni, Chong-Jia
    Liu, Wen-Ju
    Xu, Bo
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
  • [22] On the use of prosody in automatic dialogue understanding
    Nöth, E
    Batliner, A
    Warnke, V
    Haas, J
    Boros, M
    Buckow, J
    Huber, R
    Gallwitz, F
    Nutt, M
    Niemann, H
    SPEECH COMMUNICATION, 2002, 36 (1-2) : 45 - 62
  • [23] Lexical modeling of non-native speech for automatic speech recognition
    Livescu, K
    Glass, J
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1683 - 1686
  • [24] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
    Saraclar, Murat
    Dikici, Erinc
    Arisoy, Ebru
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
  • [25] An Evaluation of Structured Language Modeling for Automatic Speech Recognition
    Bjorklund, Johanna
    Cleophas, Loek
    Karlsson, My
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2017, 23 (11) : 1019 - 1034
  • [26] STATISTICAL MODELING AND AUTOMATIC PARAMETER ESTIMATION IN SPEECH RECOGNITION
    BAHL, LR
    BAKER, JK
    JELINEK, F
    MERCER, RL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 : S96 - S96
  • [27] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
    Nguyen, Long
    Ng, Tim
    Nguyen, Kham
    Zbib, Rabih
    Makhoul, John
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
  • [28] MODELING ERROR RECOVERY AND REPAIR IN AUTOMATIC SPEECH RECOGNITION
    BABER, C
    HONE, KS
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1993, 39 (03): : 495 - 515
  • [29] Improved Acoustic Modeling for Automatic Dysarthric Speech Recognition
    Sriranjani, R.
    Reddy, M. Ramasubba
    Umesh, S.
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [30] An Automatic Prosody Labeling Method for Mandarin Speech
    Chiang, Chen-Yu
    Yu, Hsiu-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 725 - +