Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points

被引:0
|
作者
Ramakrishna Thirumuru
Suryakanth V. Gangashetty
Anil Kumar Vuppala
机构
[1] International Institute of Information Technology Hyderabad,Language Technology Research Center
来源
关键词
Vowel onset point (VOP); Vowel end-point (VEP); Zero frequency filtering; Magnitude spectrum; Epoch intervals; Strength of the excitation;
D O I
暂无
中图分类号
学科分类号
摘要
Vowels are produced with an open configuration of the vocal tract, without any audible friction. The acoustic signal is relatively loud with varying strength of impulse-like excitation. Vowels possess significant energy content in the low-frequency bands of the speech signal. Acoustic events such as vowel onset point (VOP) and vowel end-point (VEP) can be used as landmarks to detect vowel regions in a speech signal. In this paper, a two-stage algorithm is proposed to detect precise vowel regions. In the first level, the speech signal is processed using zero frequency filtering to emphasize energy content in low-frequency bands of speech. Zero frequency filtered signal predominantly contains low-frequency content of the speech signal as it is filtered around 0 Hz. This process is followed by the extraction of dominant spectral peaks from the magnitude spectrum around glottal closure regions of the speech signal. The vowel onset points and vowel end-points are obtained by convolving the enhanced spectral contour of zero frequency filtered signal with first order Gaussian differentiator. In the next level, a post-processing is carried out in the regions around VOP and VEP to remove spurious vowel regions based on uniformity of epoch intervals. In addition, the positions of VOPs and VEPs are also corrected using the strength of the excitation of the speech signal. The performance of the proposed vowel region detection method is compared with the existing state of art methods on TIMIT acoustic-phonetic speech corpus. It is reported that this method produced significant improvement in vowel region detection in clean and noisy environments.
引用
收藏
页码:4753 / 4767
页数:14
相关论文
共 50 条
  • [1] Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points
    Thirumuru, Ramakrishna
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (04) : 4753 - 4767
  • [2] Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points
    Anil Kumar Vuppala
    K. Sreenivasa Rao
    Saswat Chakrabarti
    [J]. Circuits, Systems, and Signal Processing, 2012, 31 : 1459 - 1474
  • [3] Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points
    Vuppala, Anil Kumar
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2012, 31 (04) : 1459 - 1474
  • [4] Improvements in the Detection of Vowel Onset and Offset Points in a Speech Sequence
    Avinash Kumar
    S. Shahnawazuddin
    Gayadhar Pradhan
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 2315 - 2340
  • [5] Improved Vowel Onset and Offset Points Detection Using Bessel Features
    Sarma, Biswajit Dev
    Prajwal, Supreeth S.
    Prasanna, S. R. Mahadeva
    [J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2014,
  • [6] Improvements in the Detection of Vowel Onset and Offset Points in a Speech Sequence
    Kumar, Avinash
    Shahnawazuddin, S.
    Pradhan, Gayadhar
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (06) : 2315 - 2340
  • [7] A Study on Vowel Region Detection from a Continuous Speech
    Thirumuru, Ramakrishna
    Vydana, Harikrishna
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    [J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION (MIKE 2016), 2017, 10089 : 74 - 82
  • [8] Exploration of Vowel Onset and Offset Points for Hybrid Speech Segmentation
    Sarma, Biswajit Dev
    Sharma, Bidisha
    Shanmugam, S. Aswin
    Prasanna, S. R. Mahadeva
    Murthy, Hema A.
    [J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [9] Issues in Formant Analysis of Emotive Speech Using Vowel-Like Region Onset Points
    Surya, R.
    Ashwini, R.
    Pravena, D.
    Govind, D.
    [J]. INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 139 - 146
  • [10] Excitation Source Features for Improving the Detection of Vowel Onset and Offset Points in a Speech Sequence
    Pradhan, Gayadhar
    Kumar, Avinash
    Shahnawazuddin, S.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1884 - 1888