Storing prosody attributes of spontaneous speech

被引:0
|
作者
Klecková, J [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Comp Sci, CZ-30614 Plzen, Czech Republic
来源
TEXT, SPEECH AND DIALOGUE | 1999年 / 1692卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper deals with the problem of the storage of the prosody attributes which have been used in the Czech dialog system. In Czech language featured by a free-word-ordering the prosody serves a critical information for the recognition and understanding system. For some sentences the intonation is essential to determine the core of a communication, beeing used by a speaker who to emphasize a meaning of a sentence. The prosodic characteristics included in the sentence (features describing fundamental frequency F0, voice energy, the length of a pause behind and before the word, the speaking rate, flags indicating word finality and the lexical word accent) are stored in the database and consequently exploited by the linguistic module as an additional information used for recognizing and understanding the spontaneous speech. In case of storing digital speech signal in database we meet a problem of its high redundancy. Another problem is choosing cut points for segmenting speech. Suitable points are pauses, but they are not often present in the fluent speech. Instead a coarticulation between phonemes makes the placement of the cut points difficult. Their suitably for segmenting speech should be dependent on a context information. Having processed the characteristics by usual methods of statistics the database can also be used to generate answers in the dialog system. The module was implemented in the C language and supported by the ORACLE database.
引用
收藏
页码:268 / 273
页数:6
相关论文
共 50 条
  • [1] Leveraging Prosody for Punctuation Prediction of Spontaneous Speech
    Cho, Jenny Yeonjin
    Ng, Sara
    Trang Tran
    Ostendorf, Mari
    INTERSPEECH 2022, 2022, : 555 - 559
  • [2] Hierarchical prosody modeling for Mandarin spontaneous speech
    Lin, Cheng-Hsien
    You, Chung-Long
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Chen, Sin-Horng
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (04): : 2576 - 2596
  • [3] Important prosody characteristics for spontaneous speech recognition
    Kleckova, J
    Krutisova, J
    Matousek, V
    Schwarz, J
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 717 - 721
  • [4] Prosody for Mandarin Speech Recognition: a Comparative Study of Read and Spontaneous Speech
    Yeung, Yu Ting
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1133 - +
  • [5] Modeling prosody for language identification on read and spontaneous speech
    Rouas, JL
    Farinas, J
    Pellegrino, F
    André-Obrecht, R
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 753 - 756
  • [6] Modeling prosody for language identification on read and spontaneous speech
    Rouas, JL
    Farinas, J
    Pellegrino, F
    André-Obrecht, R
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 40 - 43
  • [7] Computing prosody: Computational models for processing spontaneous speech
    Jassem, W
    PHONETICA, 1999, 56 (3-4) : 200 - 201
  • [8] A MODEL FOR PREDICTING THE PROSODY OF SPONTANEOUS SPEECH (PPSS MODEL)
    ROSSI, M
    SPEECH COMMUNICATION, 1993, 13 (1-2) : 87 - 107
  • [9] Summarization of Spontaneous Speech using Automatic Speech Recognition and a Speech Prosody based Tokenizer
    Szaszak, Gyorgy
    Tundik, Mate Akos
    Beke, Andras
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 221 - 227
  • [10] Speech Recognition with Word Fragment Detection Using Prosody Features for Spontaneous Speech
    Yeh, Jui-Feng
    Yen, Ming-Chi
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2012, 6 (02): : 669S - 675S