Prosody modeling for automatic speech recognition and understanding

被引:0
|
作者
Shriberg, E [1 ]
Stolcke, A [1 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
关键词
prosody; speech recognition and understanding; hidden Markov models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automatic sentence segmentation and disfluency detection, topic segmentation, dialog act labeling, and word recognition.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 50 条
  • [41] Using morphemes in language modeling and automatic speech recognition of amharic
    Tachbelie, Martha Yifiru, 1600, Cambridge University Press (20):
  • [42] Dialogue act modeling for automatic tagging and recognition of conversational speech
    Stolcke, A
    Ries, K
    Coccaro, N
    Shriberg, E
    Bates, R
    Jurafsky, D
    Taylor, P
    Martin, R
    Van Ess-Dykema, C
    Meteer, M
    COMPUTATIONAL LINGUISTICS, 2000, 26 (03) : 339 - 373
  • [43] CYCLEGAN BANDWIDTH EXTENSION ACOUSTIC MODELING FOR AUTOMATIC SPEECH RECOGNITION
    Haws, David
    Cui, Xiaodong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6780 - 6784
  • [44] Using morphemes in language modeling and automatic speech recognition of Amharic
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    Menzel, Wolfgang
    NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
  • [45] Special issue on modeling pronunciation variation for automatic speech recognition
    Strik, H
    SPEECH COMMUNICATION, 1999, 29 (2-4) : 81 - 82
  • [46] DYNAMIC-PROGRAMMING AND STATISTICAL MODELING IN AUTOMATIC SPEECH RECOGNITION
    RUSSELL, MJ
    MOORE, RK
    TOMLINSON, MJ
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1986, 37 (01) : 21 - 30
  • [47] Written-Domain Language Modeling for Automatic Speech Recognition
    Sak, Hasim
    Sung, Yun-hsuan
    Beaufays, Francoise
    Allauzen, Cyril
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 675 - 679
  • [48] Image-Sensitive Language Modeling for Automatic Speech Recognition
    Naszadi, Kata
    Oualil, Youssef
    Klakow, Dietrich
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 173 - 179
  • [49] Modeling Dialectal Variation for Swiss German Automatic Speech Recognition
    Khosravani, Abbas
    Garner, Philip N.
    Lazaridis, Alexandros
    INTERSPEECH 2021, 2021, : 2896 - 2900
  • [50] Lexical modeling for the development of Amharic automatic speech recognition systems
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (03) : 963 - 984