Prosody modeling for automatic speech recognition and understanding

被引：0

作者：

Shriberg, E ^{[1
]}

Stolcke, A ^{[1
]}

机构：

[1] SRI Int, Menlo Pk, CA 94025 USA

来源：

MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING | 2004年 / 138卷

关键词：

prosody; speech recognition and understanding; hidden Markov models;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automatic sentence segmentation and disfluency detection, topic segmentation, dialog act labeling, and word recognition.

引用

页码：105 / 114

页数：10

共 50 条

[41] Using morphemes in language modeling and automatic speech recognition of amharic
Tachbelie, Martha Yifiru, 1600, Cambridge University Press (20):
[42] Dialogue act modeling for automatic tagging and recognition of conversational speech
Stolcke, A
Ries, K
Coccaro, N
Shriberg, E
Bates, R
Jurafsky, D
Taylor, P
Martin, R
Van Ess-Dykema, C
Meteer, M
COMPUTATIONAL LINGUISTICS, 2000, 26 (03) : 339 - 373
[43] CYCLEGAN BANDWIDTH EXTENSION ACOUSTIC MODELING FOR AUTOMATIC SPEECH RECOGNITION
Haws, David
Cui, Xiaodong
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6780 - 6784
[44] Using morphemes in language modeling and automatic speech recognition of Amharic
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
Menzel, Wolfgang
NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
[45] Special issue on modeling pronunciation variation for automatic speech recognition
Strik, H
SPEECH COMMUNICATION, 1999, 29 (2-4) : 81 - 82
[46] DYNAMIC-PROGRAMMING AND STATISTICAL MODELING IN AUTOMATIC SPEECH RECOGNITION
RUSSELL, MJ
MOORE, RK
TOMLINSON, MJ
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1986, 37 (01) : 21 - 30
[47] Written-Domain Language Modeling for Automatic Speech Recognition
Sak, Hasim
Sung, Yun-hsuan
Beaufays, Francoise
Allauzen, Cyril
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 675 - 679
[48] Image-Sensitive Language Modeling for Automatic Speech Recognition
Naszadi, Kata
Oualil, Youssef
Klakow, Dietrich
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 173 - 179
[49] Modeling Dialectal Variation for Swiss German Automatic Speech Recognition
Khosravani, Abbas
Garner, Philip N.
Lazaridis, Alexandros
INTERSPEECH 2021, 2021, : 2896 - 2900
[50] Lexical modeling for the development of Amharic automatic speech recognition systems
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (03) : 963 - 984

← 1 2 3 4 5 →