Study of intra-speaker's speech variability over long and short time periods for speech recognition

被引:0
|
作者
Tsuge, Satoru [1 ]
Shishibori, Masami [1 ]
Kita, Kenji [1 ]
Ren, Fuji [1 ]
Kuroiwa, Shingo [1 ]
机构
[1] Univ Tokushima, Tokushima 770, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe a Japanese speech corpus collected for investigating the speech variability of a specific speaker over short and long time periods and then report the variability of speech recognition performance over short and long time periods. Although speakers use a speaker-dependent speech recognition system, it is known that speech recognition performance varies pending when the utterance was uttered. This is because speech quality varies by occasion even if the speaker and utterance remain constant. However, the relationships between intra-speaker speech variability and speech recognition performance are not clear. Hence, we have been collecting speech data to investigate these relationships since November 2002. In this paper, we introduce our speech corpus and report speech recognition experiments using our corpus. Experimental results show that the variability of recognition performance over different days is larger than variability of recognition performance within a day.
引用
收藏
页码:397 / 400
页数:4
相关论文
共 50 条
  • [1] Study of relationships between intra-speaker's speech variability and speech recognition performance
    Tsuge, Satoru
    Fukumi, Minoru
    Shishibori, Masami
    Ren, Fuji
    Kita, Kenji
    Kuroiwa, Shingo
    [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 33 - +
  • [2] INTRA-SPEAKER VARIABILITY OF THE LONG-TERM SPEECH SPECTRUM
    HARMEGNIES, B
    LANDERCY, A
    [J]. SPEECH COMMUNICATION, 1988, 7 (01) : 81 - 86
  • [3] On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks
    Hansen, John H. L.
    Boril, Hynek
    [J]. SPEECH COMMUNICATION, 2018, 101 : 94 - 108
  • [4] Data collection for investigating speech variability in a specific speaker over long and short time periods
    Tsuge, S
    Shishibori, M
    Ren, F
    Kita, K
    Kuroiwa, S
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 152 - 157
  • [5] An amplitude warping approach to intra-speaker normalization for speech recognition
    Hong, KS
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 2, PROCEEDINGS, 2003, 2668 : 639 - 645
  • [6] On the sources of inter- & intra-speaker variability in the acoustic dynamics of speech
    Yang, X
    Millar, JB
    Macleod, I
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1792 - 1795
  • [7] Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech
    Yoma, Nestor Becerra
    Garreton, Claudio
    Molina, Carlos
    Huenupan, Fernando
    [J]. SPEECH COMMUNICATION, 2008, 50 (11-12) : 953 - 964
  • [8] Intra-speaker and inter-speaker variability in speech sound pressure level across repeated readings
    Castellana, Antonella
    Carullo, Alessio
    Astolfi, Arianna
    Puglisi, Giuseppina Emma
    Fugiglando, Umberto
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (04): : 2353 - 2363
  • [9] Intra-speaker phonetic variation in read speech: comparison with inter-speaker variability in a controlled population
    Audibert, Nicolas
    Fougeronl, Cecile
    [J]. INTERSPEECH 2022, 2022, : 4755 - 4759
  • [10] Variable pronunciations reveal dynamic intra-speaker variation in speech planning
    Oriana Kilbourn-Ceron
    Matthew Goldrick
    [J]. Psychonomic Bulletin & Review, 2021, 28 : 1365 - 1380