Vowel onset point detection for noisy speech using spectral energy at formant frequencies

被引:16
|
作者
Vuppala A.K. [1 ]
Rao K.S. [2 ]
机构
[1] LTRC, International Institute of Information Technology-Hyderabad, Hyderabad
[2] School of Information Technology, Indian Institute of Technology, Kharagpur
关键词
Excitation source; Formant frequencies; Glottal closure region; Modulation spectrum; Spectral peaks; Vowel onset point (VOP);
D O I
10.1007/s10772-012-9179-8
中图分类号
学科分类号
摘要
In this paper, we propose a method for robust detection of the vowel onset points (VOPs) from noisy speech. The proposed VOP detection method exploits the spectral energy at formant frequencies of the speech segments present in glottal closure region. In this work, formants are extracted by using group delay function, and glottal closure instants are extracted by using zero frequency filter based method. Performance of the proposed VOP detection method is compared with the existing method, which uses the combination of evidence from excitation source, spectral peaks energy and modulation spectrum. Speech data from TIMIT database and noise samples from NOISEX database are used for analyzing the performance of the VOP detection methods. Significant improvement in the performance of VOP detection is observed by using proposed method compared to existing method. © 2012 Springer Science+Business Media New York.
引用
收藏
页码:229 / 235
页数:6
相关论文
共 50 条
  • [1] Detection of vowel onset point in speech
    Prasanna, SRM
    Zachariah, JM
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4159 - 4159
  • [2] Homomorphic Filtered Spectral Peaks Energy for Automatic Detection of Vowel Onset Point in Continuous Speech
    Zang, Xian
    Chong, Kil To
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 949 - 956
  • [3] Using formant frequencies to word detection in recorded speech
    Laszko, Lukasz
    [J]. PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 797 - 801
  • [4] INTRINSIC VOWEL DURATION AND FORMANT FREQUENCIES - DATA FROM SPEECH ACQUISITION
    LIEBERMAN, P
    KUBASKA, C
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S34 - S34
  • [5] Syllable Segmentation of Tamil Speech Signals Using Vowel Onset Point and Spectral Transition Measure
    Geetha K.
    Vadivel R.
    [J]. Automatic Control and Computer Sciences, 2018, 52 (1) : 25 - 31
  • [6] Voice onset time and formant onset frequencies in Arabic stuttered speech
    Al-Tamimi, Feda
    Howell, Peter
    [J]. CLINICAL LINGUISTICS & PHONETICS, 2021, 35 (06) : 493 - 508
  • [7] Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies
    Prasanna, S. R. Mahadeva
    Reddy, B. V. Sandeep
    Krishnamoorthy, P.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 556 - 565
  • [8] Robust analysis for improvement of vowel onset point detection under noisy conditions
    Saha P.
    Baruah U.
    Laskar R.H.
    Mishra S.
    Choudhury S.P.
    Das T.K.
    [J]. International Journal of Speech Technology, 2016, 19 (3) : 433 - 448
  • [9] Vowel Onset Point Detection for Low Bit Rate Coded Speech
    Vuppala, Anil Kumar
    Yadav, Jainath
    Chakrabarti, Saswat
    Rao, K. Sreenivasa
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1894 - 1903
  • [10] Issues in Formant Analysis of Emotive Speech Using Vowel-Like Region Onset Points
    Surya, R.
    Ashwini, R.
    Pravena, D.
    Govind, D.
    [J]. INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 139 - 146