A New Approach for Phoneme Segmentation of Speech Signals

被引:0
|
作者
Golipour, Ladan [1 ]
O'Shaughnessy, Douglas [1 ]
机构
[1] Univ Quebec, INRS, EMT, Montreal, PQ H3C 3P8, Canada
关键词
phoneme segmentation; auditory spectrum; group-delay function; speech spectrogram;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new method for segmenting speech at the phoneme level. For this purpose, we use the short-time Fourier transform of the speech signal. The goal is to recognize the locations of main energy changes in frequency over time, which can be described as phoneme boundaries. We apply a sub-band analysis and search for energy changes in individual bands as well to obtain further precision. Moreover, we employ the modified group-delay function to achieve a more clear representation of the locations of boundaries, and smooth out the undesired fluctuations of the signal. We also study the use of an auditory spectrogram instead of a regular spectrogram in the segmentation process. Since this method merely utilizes the power spectrum of the signal for segmentation, there is no need for any adaptation of the parameters or training for different speakers in advance. In addition, no transcript information such as the phonemes themselves or voiced/unvoiced decision making is required. The method was tested over the phoneticallydiverse part of the Timit database, and the results show that 87% of the boundaries are successfully recognized.
引用
收藏
页码:2296 / 2299
页数:4
相关论文
共 50 条
  • [1] Phoneme segmentation of speech
    Ziolko, Bartosz
    Manandhar, Suresh
    Wilson, Richard C.
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 282 - +
  • [2] Stochastic Filter Approaches for a Phoneme-Based Segmentation of Speech Signals
    Rauh, Andreas
    Tiede, Susann
    Klenke, Cornelia
    [J]. 2016 21ST INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2016, : 732 - 737
  • [3] A NEW STATISTICAL APPROACH FOR THE AUTOMATIC SEGMENTATION OF CONTINUOUS SPEECH SIGNALS
    ANDREOBRECHT, R
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (01): : 29 - 40
  • [4] Phoneme Segmentation of Speech Signal
    Goh, Y. H.
    Raveendran, P.
    [J]. 2009 INTERNATIONAL CONFERENCE FOR TECHNICAL POSTGRADUATES (TECHPOS 2009), 2009, : 150 - 152
  • [5] Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals
    Ravi, Kishore Kumar
    Krothapalli, Sreenivasa Rao
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (04) : 2088 - 2117
  • [6] Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals
    Kishore Kumar Ravi
    Sreenivasa Rao Krothapalli
    [J]. Circuits, Systems, and Signal Processing, 2022, 41 : 2088 - 2117
  • [7] A new search algorithm in segmentation lattices of speech signals
    Husson, JL
    Laprie, Y
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2099 - 2102
  • [8] A New Approach to Securing Speech Signals
    Belmeguenai, Aissa
    Ahmida, Zahir
    Djemili, Rafik
    [J]. 2017 10TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2017), 2017, : 77 - 82
  • [9] Speech/Non-Speech Segmentation Based on Phoneme Recognition Features
    Janez Žibert
    Nikola Pavešić
    France Mihelič
    [J]. EURASIP Journal on Advances in Signal Processing, 2006
  • [10] Speech/non-speech segmentation based on phoneme recognition features
    Zibert, Janez
    Pavesic, Nikola
    Mihelic, France
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)