Study of relationships between intra-speaker's speech variability and speech recognition performance

被引:0
|
作者
Tsuge, Satoru [1 ]
Fukumi, Minoru [1 ]
Shishibori, Masami [1 ]
Ren, Fuji [1 ]
Kita, Kenji [1 ]
Kuroiwa, Shingo [1 ]
机构
[1] Univ Tokushima, 2-1 Minami Josanjima, Tokushima 7708506, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even if a speaker uses a speaker-dependent speech recognition system, speech recognition performance varies. For this reason, speech quality is varied by some factors, which are including emotion, background noise, and so on, even though the speaker and utterance remain constant. However, the relationships between intra-speaker's speech variability and speech recognition performance are not clear. Hence, we focus on the intra-speaker's speech variability which affects the speech recognition performances. To investigate these relationships, we have been collecting speech data since November 2002. Using a part of the speech corpus, we conducted speech recognition experiments. In this paper, we analyze the relationships between intra-speaker's speech variability and the phoneme accuracy by using the correlation analysis. For factors of the correlation analysis, we use a number of errors, a speaking rate, a likelihood. Analysis results show a strong correlation between the number of the substitution errors and the phoneme accuracy although the correlations of the number of the deletion and the insertion errors are low. Therefore, it is considered that there are overlaps between phonemes since the feature parameters vary at each speaking rate. For improving the phoneme accuracy, it is needed that we study a method which discriminates phonemes. On the other hand, although the correlation between the phoneme accuracy and the speaking rate seems to be low, a strong correlation between the speaking rate and the number of deletion errors and insertion errors are found. Since the number of the insertion errors and the number of the deletion errors were in the counterbalance relation, the correlation between the speaking rate and the phoneme accuracy was low. However, we consider that it is needed to normalize the speaking rate because the speaking rate influences on the number of the deletion and the insertion errors.
引用
收藏
页码:33 / +
页数:2
相关论文
共 50 条
  • [1] Study of intra-speaker's speech variability over long and short time periods for speech recognition
    Tsuge, Satoru
    Shishibori, Masami
    Kita, Kenji
    Ren, Fuji
    Kuroiwa, Shingo
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 397 - 400
  • [2] On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks
    Hansen, John H. L.
    Boril, Hynek
    [J]. SPEECH COMMUNICATION, 2018, 101 : 94 - 108
  • [3] INTRA-SPEAKER VARIABILITY OF THE LONG-TERM SPEECH SPECTRUM
    HARMEGNIES, B
    LANDERCY, A
    [J]. SPEECH COMMUNICATION, 1988, 7 (01) : 81 - 86
  • [4] An amplitude warping approach to intra-speaker normalization for speech recognition
    Hong, KS
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 2, PROCEEDINGS, 2003, 2668 : 639 - 645
  • [5] On the sources of inter- & intra-speaker variability in the acoustic dynamics of speech
    Yang, X
    Millar, JB
    Macleod, I
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1792 - 1795
  • [6] Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech
    Yoma, Nestor Becerra
    Garreton, Claudio
    Molina, Carlos
    Huenupan, Fernando
    [J]. SPEECH COMMUNICATION, 2008, 50 (11-12) : 953 - 964
  • [7] Intra-speaker and inter-speaker variability in speech sound pressure level across repeated readings
    Castellana, Antonella
    Carullo, Alessio
    Astolfi, Arianna
    Puglisi, Giuseppina Emma
    Fugiglando, Umberto
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (04): : 2353 - 2363
  • [8] Intra-speaker variability effects on Speaker Verification performance
    Kahn, Juliette
    Audibert, Nicolas
    Rossato, Solange
    Bonastre, Jean-Francois
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 109 - 116
  • [9] Intra-speaker phonetic variation in read speech: comparison with inter-speaker variability in a controlled population
    Audibert, Nicolas
    Fougeronl, Cecile
    [J]. INTERSPEECH 2022, 2022, : 4755 - 4759
  • [10] Variable pronunciations reveal dynamic intra-speaker variation in speech planning
    Oriana Kilbourn-Ceron
    Matthew Goldrick
    [J]. Psychonomic Bulletin & Review, 2021, 28 : 1365 - 1380