Recognition and Classification of Pauses in Stuttered Speech using Acoustic Features

被引:0
|
作者
Afroz, Fathima [1 ]
Koolagudi, Shashidhar G. [2 ]
机构
[1] JSS Acad Tech Educ Bangalore, Dept Informat Sci & Engn, JSSATE B Campus Dr Vishnuvardan Rd, Bengaluru 560060, Karnataka, India
[2] NITK, Dept Comp Sci & Engn, NH 66, Mangaluru 575025, Karnataka, India
关键词
Terms Acoustic Features; Blind segmentation; Intermorphic pauses; Intra-morphic pauses; Stuttered Speech;
D O I
10.1109/spin.2019.8711569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Pauses plays an essential role in speech activities. Normally it helps the listener by creating a time and space to decode and interpret the message of a speaker. But in case of stuttering pauses disturbs the normal flow of speech. The uncontrolled, frequent and unplanned occurance of pasuses leads to slow speaking rate, results in broken words and increases the severity level of stuttering. Hence pauses and stuttering has a close relationship. Pauses are considered as one of the important pattern in diagnoisis and treatment of stuttering. In this work, an attempt has been made for the identification of inaudible (Silent or Unfilled) pauses from stuttered speech. The attributes like duration, frequency, position and distribution of pauses during speech tasks are measured and quantified. UCLASS stuttered speech corpus is considered for the analysis. Automatic blind segmentation approach is adopted to segment the speech signal into voice and unvoiced regions using dynamic threshold set based on energy and zero crossing rate (ZCR). 4th formant frequencies are analysed to identify intra-morphic (unfilled) pauses present within voiced regions. The duratiion of intra-morphic pauses are analysed for stuttred speech and normal speech. It is observed that the duration of normal intramorphic pause ranges from 150 ms-250 ms and inter-morphic pauses are <=250 ms and short pause have duration ranges from 50 ms-150 ms. Whereas in stuttering short intra-morphic pauses ranges from 10 ms to 50 ms, long pauses ranges from 250 ms to 1 or 2 seconds. Segmentation of the intra-morphic pauses is observed to acheive an accuracy of 98%. Results are compared and validated with manual method.
引用
收藏
页码:921 / 926
页数:6
相关论文
共 50 条
  • [1] Speech Recognition and Correction of a Stuttered Speech
    Dash, Ankit
    Subramani, Nikhil
    Manjunath, Tejas
    Yaragarala, Vishruti
    Tripathi, Shikha
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1757 - 1760
  • [2] Speech Emotion Classification using Acoustic Features
    Chen, Shizhe
    Jin, Qin
    Li, Xirong
    Yang, Gang
    Xu, Jieping
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
  • [3] Recognition of Repetition and Prolongation in Stuttered Speech Using ANN
    Savin, P. S.
    Ramteke, Pravin B.
    Koolagudi, Shashidhar G.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 65 - 71
  • [4] Using Clinician Annotations to Improve Automatic Speech Recognition of Stuttered Speech
    Heeman, Peter A.
    Lunsford, Rebecca
    McMillin, Andy
    Yaruss, J. Scott
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2651 - 2655
  • [5] SOME ACOUSTIC ASPECTS OF STUTTERED SPEECH
    AGNELLO, JG
    WINGATE, ME
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 159 - &
  • [6] ACOUSTIC ANALYSIS AND PERCEPTION OF VOWELS IN STUTTERED SPEECH
    HOWELL, P
    VAUSE, L
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 79 (05): : 1571 - 1579
  • [7] PERCEPTUAL AND ACOUSTIC ANALYSIS OF REPETITIONS IN STUTTERED SPEECH
    MONTGOMERY, AA
    COOKE, PA
    JOURNAL OF COMMUNICATION DISORDERS, 1976, 9 (04) : 317 - 330
  • [8] Classification of Speech with and without Face Mask using Acoustic Features
    Das, Rohan Kumar
    Li, Haizhou
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 747 - 752
  • [9] LPC AND ITS DERIVATIVES FOR STUTTERED SPEECH RECOGNITION
    Alim, Sabur Ajibola
    Rashid, Nahrul Khair Alang
    Sediono, Wahju
    Hashim, Nik Nur Wahidah Nik
    JURNAL TEKNOLOGI, 2015, 77 (18): : 11 - 16
  • [10] Acoustic feature analysis and discriminative modeling of filled pauses for spontaneous speech recognition
    Wu, CH
    Yan, GL
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 91 - 104