Prosody modeling for automatic speech recognition and understanding

被引：0

作者：

Shriberg, E ^{[1
]}

Stolcke, A ^{[1
]}

机构：

[1] SRI Int, Menlo Pk, CA 94025 USA

来源：

MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING | 2004年 / 138卷

关键词：

prosody; speech recognition and understanding; hidden Markov models;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automatic sentence segmentation and disfluency detection, topic segmentation, dialog act labeling, and word recognition.

引用

页码：105 / 114

页数：10

共 50 条

[21] Prosody Dependent Mandarin Speech Recognition
Ni, Chong-Jia
Liu, Wen-Ju
Xu, Bo
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
[22] On the use of prosody in automatic dialogue understanding
Nöth, E
Batliner, A
Warnke, V
Haas, J
Boros, M
Buckow, J
Huber, R
Gallwitz, F
Nutt, M
Niemann, H
SPEECH COMMUNICATION, 2002, 36 (1-2) : 45 - 62
[23] Lexical modeling of non-native speech for automatic speech recognition
Livescu, K
Glass, J
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1683 - 1686
[24] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
Saraclar, Murat
Dikici, Erinc
Arisoy, Ebru
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
[25] An Evaluation of Structured Language Modeling for Automatic Speech Recognition
Bjorklund, Johanna
Cleophas, Loek
Karlsson, My
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2017, 23 (11) : 1019 - 1034
[26] STATISTICAL MODELING AND AUTOMATIC PARAMETER ESTIMATION IN SPEECH RECOGNITION
BAHL, LR
BAKER, JK
JELINEK, F
MERCER, RL
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 : S96 - S96
[27] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
Nguyen, Long
Ng, Tim
Nguyen, Kham
Zbib, Rabih
Makhoul, John
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
[28] MODELING ERROR RECOVERY AND REPAIR IN AUTOMATIC SPEECH RECOGNITION
BABER, C
HONE, KS
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1993, 39 (03): : 495 - 515
[29] Improved Acoustic Modeling for Automatic Dysarthric Speech Recognition
Sriranjani, R.
Reddy, M. Ramasubba
Umesh, S.
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[30] An Automatic Prosody Labeling Method for Mandarin Speech
Chiang, Chen-Yu
Yu, Hsiu-Min
Wang, Yih-Ru
Chen, Sin-Horng
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 725 - +

← 1 2 3 4 5 →