Analysis and recognition of spontaneous speech using Corpus of Spontaneous Japanese

被引:16
|
作者
Furui, S [1 ]
Nakamura, M [1 ]
Ichiba, T [1 ]
Iwano, K [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Meguro Ku, Tokyo 1528552, Japan
基金
日本科学技术振兴机构;
关键词
spontaneous speech; Corpus of Spontaneous Japanese; automatic speech recognition; cepstrum; speaking rate;
D O I
10.1016/j.specom.2005.02.010
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automatic speech recognition. Broadening the application of speech recognition depends crucially on raising recognition performance for spontaneous speech. For this purpose, it is necessary to analyze and model spontaneous speech using spontaneous speech databases, since spontaneous speech and read speech are significantly different. This paper reports analysis and recognition of spontaneous speech using a large-scale spontaneous speech database "Corpus of Spontaneous Japanese (CSJ)". Recognition results in this experiment show that recognition accuracy significantly increases as a function of the size of acoustic as well as language model training data and the improvement levels off at approximately 7M words of training data. This means that acoustic and linguistic variation of spontaneous speech is so large that we need a very large corpus in order to encompass the variations. Spectral analysis using various styles of utterances in the CSJ shows that the spectral distribution/difference of phonemes is significantly reduced in spontaneous speech compared to read speech. It has also been observed that speaking rates of both vowels and consonants in spontaneous speech are significantly faster than those in read speech. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:208 / 219
页数:12
相关论文
共 50 条
  • [1] Gemination of consonant in spontaneous speech: An analysis of the "Corpus of Spontaneous Japanese"
    Fujimoto, M
    Kagomiya, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 562 - 568
  • [2] Morphological analysis of a large spontaneous speech corpus in Japanese
    Uchimoto, K
    Nobata, C
    Yamada, A
    Sekine, S
    Isahara, H
    [J]. 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 479 - 488
  • [3] KsponSpeech: Korean Spontaneous Speech Corpus for Automatic Speech Recognition
    Bang, Jeong-Uk
    Yun, Seung
    Kim, Seung-Hi
    Choi, Mu-Yeol
    Lee, Min-Kyu
    Kim, Yeo-Jeong
    Kim, Dong-Hyun
    Park, Jun
    Lee, Young-Jik
    Kim, Sang-Hun
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 17
  • [4] An automatic speech recognition system for spontaneous Punjabi speech corpus
    Kumar Y.
    Singh N.
    [J]. International Journal of Speech Technology, 2017, 20 (2) : 297 - 303
  • [5] Large-vocabulary spontaneous speech recognition using a corpus of lectures
    Nishimura, M
    Itoh, N
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2003, 86 (08): : 52 - 60
  • [6] Spontaneous Speech Recognition for the Credit Card Corpus Using the HTK Toolkit
    Young, Stephen J.
    Woodland, Philip C.
    Byrne, William J.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 615 - 621
  • [7] Morphological Annotation of a Large Spontaneous Speech Corpus in Japanese
    Uchimoto, Kiyotaka
    Isahara, Hitoshi
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1731 - 1737
  • [8] Morphological analysis of the corpus of spontaneous Japanese
    Uchimoto, K
    Takaoka, K
    Nobata, C
    Yamada, A
    Sekine, S
    Isahara, H
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 382 - 390
  • [9] Analysis of spontaneous Japanese in a multi-language telephone-speech corpus
    Arai, Takayuki
    Warner, Natasha
    Greenberg, Steven
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (01) : 46 - 48
  • [10] Analysis of humor in a corpus of spontaneous child speech
    Garrote, Marta
    [J]. SPANISH IN CONTEXT, 2021, 18 (01) : 30 - 55