Repetition Detection in Stuttered Speech

被引:9
|
作者
Ramteke, Pravin B. [1 ]
Koolagudi, Shashidhar G. [1 ]
Afroz, Fathima [1 ]
机构
[1] Natl Inst Technol Karnataka, Surathkal 575025, Karnataka, India
关键词
MFCCs; Formants; Shimmer; Jitter; Dynamic time warping; CLASSIFICATION;
D O I
10.1007/978-81-322-2538-6_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper mainly focuses on detection of repetitions in stuttered speech. The stuttered speech signal is divided into isolated units based on energy. Mel-frequency cepstrum coefficients (MFCCs), formants and shimmer are used as features for repetition recognition. These features are extracted from each isolated unit. Using Dynamic Time Warping (DTW) the features of each isolated unit are compared with those subsequent units within one second interval of speech. Based on the analysis of scores obtained from DTW a threshold is set, if the score is below the set threshold then the units are identified as repeated events. Twenty seven seconds of speech data used in this work, consists of 50 repetition events. The result shows that the combination of MFCCs, formants and shimmer can be used for the recognition of repetitions in stuttered speech. Out of 50 repetitions, 47 are correctly identified.
引用
收藏
页码:611 / 617
页数:7
相关论文
共 50 条
  • [1] Recognition of Repetition and Prolongation in Stuttered Speech Using ANN
    Savin, P. S.
    Ramteke, Pravin B.
    Koolagudi, Shashidhar G.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 65 - 71
  • [2] Deep Learning Bidirectional LSTM based Detection of Prolongation and Repetition in Stuttered Speech using Weighted MFCC
    Gupta, Sakshi
    Shukla, Ravi S.
    Shukla, Rajesh K.
    Verma, Rajesh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 345 - 356
  • [3] Speech Recognition and Correction of a Stuttered Speech
    Dash, Ankit
    Subramani, Nikhil
    Manjunath, Tejas
    Yaragarala, Vishruti
    Tripathi, Shikha
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1757 - 1760
  • [4] Repetition Detection in Dysarthric Speech
    Diwakar, G.
    Karjigi, Veena
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1150 - 1154
  • [5] Gaze aversion to stuttered speech: a pilot study investigating differential visual attention to stuttered and fluent speech
    Bowers, Andrew L.
    Crawcour, Stephen C.
    Saltuklaroglu, Tim
    Kalinowski, Joseph
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2010, 45 (02) : 133 - 144
  • [6] BEHAVIORAL DIMENSIONS OF STUTTERED SPEECH
    PRINS, D
    LOHR, F
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1972, 15 (01): : 61 - +
  • [7] Intelligent Processing of Stuttered Speech
    Andrzej Czyzewski
    Andrzej Kaczmarek
    Bozena Kostek
    Journal of Intelligent Information Systems, 2003, 21 : 143 - 171
  • [8] Intelligent processing of stuttered speech
    Czyzewski, A
    Kaczmarek, A
    Kostek, B
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2003, 21 (02) : 143 - 171
  • [9] Masseter Muscle Action Potentials In Stuttered And Non-Stuttered Speech
    Williams, Dean E.
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1955, 20 (03): : 242 - 261
  • [10] Processing of self-repairs in stuttered and non-stuttered speech
    Lowder, Matthew W.
    Maxfield, Nathan D.
    Ferreira, Fernanda
    LANGUAGE COGNITION AND NEUROSCIENCE, 2020, 35 (01) : 93 - 105