Constructing multi-level speech database for spontaneous speech processing

被引:0
|
作者
Hahn, M
Kim, S
Lee, JC
Lee, YJ
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a new database, called muti-level speech database, for spontaneous speech processing. We designed the database to cover textual and acoustic variations from declarative speech to spontaneous speech. The database is composed of 5 categories which are, in the order of decreasing spontaneity, spontaneous speech, interview, simulated interview, declarative speech with context, and declarative speech without context. We collected total 112 sets from 23 subjects(male: 19, female: 4). Then the database was firstly transcribed using 15 transcription symbols according to our own transcription rules. Secondly, prosodic information will be added. The goal of this research is a comparative textual and prosodic analysis at each level, quantification of spontaneity of diversified speech database for dialogue speech synthesis and recognition. From the preliminary analysis of transcribed texts, the spontaneous speech has more corrections, repetitions, and pauses than the others as expected. In addition, average number of sentences per turn of spontaneous speech is greater than the others. From the above results, we can quantify the spontaneity of speech database.
引用
收藏
页码:1930 / 1933
页数:4
相关论文
共 50 条
  • [1] Multi-level annotation in SpeeCon Polish Speech Database
    Marasek, K
    Gubrynowicz, R
    [J]. INTELLIGENT MEDIA TECHNOLOGY FOR COMMUNICATIVE INTELLIGENCE, 2005, 3490 : 58 - 67
  • [2] Multi-level annotation in the Emu speech database management system
    Cassidy, S
    Harrington, J
    [J]. SPEECH COMMUNICATION, 2001, 33 (1-2) : 61 - 77
  • [3] Multi-level Adaptive Speech Activity Detector for Speech in Naturalistic Environments
    Sharma, Bidisha
    Das, Rohan Kumar
    Li, Haizhou
    [J]. INTERSPEECH 2019, 2019, : 2015 - 2019
  • [4] The Multi-level Approach to Speech Corpora Annotation for Automatic Speech Recognition
    Glavatskih, Igor
    Platonova, Tatyana
    Rogozhina, Valeria
    Shirokova, Anna
    Smolina, Anna
    Kotov, Mikhail
    Ovsyannikova, Anna
    Repalov, Sergey
    Zulkarneev, Mikhail
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 438 - 445
  • [5] Topic Refinement in Multi-level Hate Speech Detection
    Bourgeade, Tom
    Chiril, Patricia
    Benamara, Farah
    Moriceau, Veronique
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 367 - 376
  • [6] An Ensemble Model for Multi-Level Speech Emotion Recognition
    Zheng, Chunjun
    Wang, Chunli
    Jia, Ning
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [7] Speech Emotion Recognition via Multi-Level Attention Network
    Liu, Ke
    Wang, Dekui
    Wu, Dongya
    Liu, Yutao
    Feng, Jun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2278 - 2282
  • [8] A multi-level description of date expressions in German telephone speech
    Draxler, C
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1906 - 1909
  • [9] Development of Multi-Level Speech based Person Authentication System
    Das, Rohan Kumar
    Jelil, Sarfaraz
    Prasanna, S. R. Mahadeva
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (03): : 259 - 271
  • [10] Multi-Level Adaptive Network for Accented Mandarin Speech Recognition
    Wang, Huiyong
    Wang, Lan
    Liu, Xunying
    [J]. 2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 602 - 605