Repetition Detection in Stuttered Speech

被引:9
|
作者
Ramteke, Pravin B. [1 ]
Koolagudi, Shashidhar G. [1 ]
Afroz, Fathima [1 ]
机构
[1] Natl Inst Technol Karnataka, Surathkal 575025, Karnataka, India
关键词
MFCCs; Formants; Shimmer; Jitter; Dynamic time warping; CLASSIFICATION;
D O I
10.1007/978-81-322-2538-6_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper mainly focuses on detection of repetitions in stuttered speech. The stuttered speech signal is divided into isolated units based on energy. Mel-frequency cepstrum coefficients (MFCCs), formants and shimmer are used as features for repetition recognition. These features are extracted from each isolated unit. Using Dynamic Time Warping (DTW) the features of each isolated unit are compared with those subsequent units within one second interval of speech. Based on the analysis of scores obtained from DTW a threshold is set, if the score is below the set threshold then the units are identified as repeated events. Twenty seven seconds of speech data used in this work, consists of 50 repetition events. The result shows that the combination of MFCCs, formants and shimmer can be used for the recognition of repetitions in stuttered speech. Out of 50 repetitions, 47 are correctly identified.
引用
收藏
页码:611 / 617
页数:7
相关论文
共 50 条
  • [21] INCIPIENT STUTTERING AND SPONTANEOUS REMISSION OF STUTTERED SPEECH
    DICKSON, S
    JOURNAL OF COMMUNICATION DISORDERS, 1971, 4 (02) : 99 - 110
  • [22] Cross-language analysis of stuttered speech
    Rezaei-Aghbash, N
    Whiteside, SP
    Cudd, P
    JOURNAL OF FLUENCY DISORDERS, 2000, 25 (03) : 248 - 249
  • [23] LPC AND ITS DERIVATIVES FOR STUTTERED SPEECH RECOGNITION
    Alim, Sabur Ajibola
    Rashid, Nahrul Khair Alang
    Sediono, Wahju
    Hashim, Nik Nur Wahidah Nik
    JURNAL TEKNOLOGI, 2015, 77 (18): : 11 - 16
  • [24] ACOUSTIC ANALYSIS AND PERCEPTION OF VOWELS IN STUTTERED SPEECH
    HOWELL, P
    VAUSE, L
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 79 (05): : 1571 - 1579
  • [25] Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech
    G. Diwakar
    Veena Karjigi
    Circuits, Systems, and Signal Processing, 2020, 39 : 5543 - 5567
  • [26] PERCEPTUAL AND ACOUSTIC ANALYSIS OF REPETITIONS IN STUTTERED SPEECH
    MONTGOMERY, AA
    COOKE, PA
    JOURNAL OF COMMUNICATION DISORDERS, 1976, 9 (04) : 317 - 330
  • [27] Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech
    Diwakar, G.
    Karjigi, Veena
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (11) : 5543 - 5567
  • [28] The effect of lexical constraints on spontaneous stuttered speech
    Koopmans, M
    Slis, I
    Rietveld, T
    CLINICAL LINGUISTICS & PHONETICS, 1996, 10 (03) : 207 - 223
  • [29] FURTHER ANALYSIS OF FLUENCY WITHIN STUTTERED SPEECH
    FEW, LR
    LINGWALL, JB
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1972, 15 (02): : 356 - &
  • [30] Using Clinician Annotations to Improve Automatic Speech Recognition of Stuttered Speech
    Heeman, Peter A.
    Lunsford, Rebecca
    McMillin, Andy
    Yaruss, J. Scott
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2651 - 2655