Storing prosody attributes of spontaneous speech

被引：0

作者：

Klecková, J ^{[1
]}

机构：

[1] Univ W Bohemia, Fac Sci Appl, Dept Comp Sci, CZ-30614 Plzen, Czech Republic

来源：

TEXT, SPEECH AND DIALOGUE | 1999年 / 1692卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper deals with the problem of the storage of the prosody attributes which have been used in the Czech dialog system. In Czech language featured by a free-word-ordering the prosody serves a critical information for the recognition and understanding system. For some sentences the intonation is essential to determine the core of a communication, beeing used by a speaker who to emphasize a meaning of a sentence. The prosodic characteristics included in the sentence (features describing fundamental frequency F0, voice energy, the length of a pause behind and before the word, the speaking rate, flags indicating word finality and the lexical word accent) are stored in the database and consequently exploited by the linguistic module as an additional information used for recognizing and understanding the spontaneous speech. In case of storing digital speech signal in database we meet a problem of its high redundancy. Another problem is choosing cut points for segmenting speech. Suitable points are pauses, but they are not often present in the fluent speech. Instead a coarticulation between phonemes makes the placement of the cut points difficult. Their suitably for segmenting speech should be dependent on a context information. Having processed the characteristics by usual methods of statistics the database can also be used to generate answers in the dialog system. The module was implemented in the C language and supported by the ORACLE database.

引用

页码：268 / 273

页数：6

共 50 条

[31] Prosody conversion from neutral speech to emotional speech
Tao, Jianhua
Kang, Yongguo
Li, Aijun
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1145 - 1154
[32] Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition
Ananthakrishnan, Sankaranarayanan
Narayanan, Shrikanth
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 138 - 149
[33] Prosody Dependent Mandarin Speech Recognition
Ni, Chong-Jia
Liu, Wen-Ju
Xu, Bo
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
[34] M = Syntax plus Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases
Batliner, A
Kompe, R
Kiessling, A
Mast, M
Niemann, H
Noth, E
SPEECH COMMUNICATION, 1998, 25 (04) : 193 - 222
[35] Automatic Analysis of Speech Prosody in Dutch
Hu, Na
Janssen, Berit
Hanssen, Judith
Gussenhoven, Carlos
Chen, Aoju
INTERSPEECH 2020, 2020, : 155 - 159
[36] PROSODY MODELING FOR MANDARIN EXCLAMATORY SPEECH
Jia, Huibin
Tao, Jianhua
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 890 - 893
[37] Hemispheric roles in the perception of speech prosody
Gandour, J
Tong, YX
Wong, D
Talavage, T
Dzemidzic, M
Xu, YS
Li, XJ
Lowe, M
NEUROIMAGE, 2004, 23 (01) : 344 - 357
[38] Form and function in the representation of speech prosody
Hirst, DJ
SPEECH COMMUNICATION, 2005, 46 (3-4) : 334 - 347
[39] Vocalism and prosody of Lovrec's speech
Botica, Tomislava Bosnjak
Menac-Mihalic, Mira
RASPRAVE, 2006, 32 (01): : 25 - 41
[40] THE ROLE OF CALLOSAL CONNECTIONS IN SPEECH PROSODY
KLOUDA, GV
ROBIN, DA
GRAFFRADFORD, NR
COOPER, WE
BRAIN AND LANGUAGE, 1988, 35 (01) : 154 - 171

← 1 2 3 4 5 →