A hybrid architecture for automatic segmentation of speech waveforms

被引:10
|
作者
Mporas, Iosif [1 ]
Ganchev, Todor [1 ]
Fakotakis, Nikos [1 ]
机构
[1] Univ Patras, Dept Elect & Comp Engn, Artificial Intelligence Grp, Wire Commun Lab, Rion 26500, Greece
关键词
speech segmentation; hidden Markov models; embedded training; isolated-unit training;
D O I
10.1109/ICASSP.2008.4518645
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the present work, we propose a hybrid architecture for automatic alignment of speech waveforms and their corresponding phone sequence. The proposed architecture does not exploit any phone boundary information. Our approach combines the efficiency of embedded training techniques and the high performance of isolated-unit training. Evaluating on the established for the task of phone segmentation TIMIT database, we achieved an accuracy of 83.56%, which corresponds to improving the baseline system's accuracy by 6.09%.
引用
收藏
页码:4457 / 4460
页数:4
相关论文
共 50 条
  • [1] LINGUISTIC SEGMENTATION OF ACOUSTIC SPEECH WAVEFORMS
    RESNIKOFF, HL
    SITTON, GA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 366 - +
  • [2] Automatic Speech Segmentation for Automatic Speech Translation
    Klosowski, Piotr
    Dustor, Adam
    [J]. COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
  • [3] AUTOMATIC SEGMENTATION OF SPEECH
    VANHEMERT, JP
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 1008 - 1012
  • [4] Automatic Segmentation of Spontaneous Speech
    Bigi, Brigitte
    Meunier, Christine
    [J]. REVISTA DE ESTUDOS DA LINGUAGEM, 2018, 26 (04) : 1489 - 1530
  • [5] AUTOMATIC SEGMENTATION OF SPEECH INTO DIPHONES
    VANHEMERT, JP
    [J]. PHILIPS TECHNICAL REVIEW, 1987, 43 (09): : 233 - 242
  • [6] Automatic Speech Segmentation in French
    Martin, Philippe
    [J]. REVISTA DE ESTUDOS DA LINGUAGEM, 2018, 26 (04) : 1551 - 1570
  • [7] Automatic sentence segmentation of speech for automatic summarization
    Mrozinski, Joanna
    Whittaker, Edward W. D.
    Chatain, Pierre
    Furui, Sadaoki
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 981 - 984
  • [8] ''Blind'' speech segmentation: Automatic segmentation of speech without linguistic knowledge
    Sharma, M
    Mammone, R
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1237 - 1240
  • [9] On the Influence of Automatic Segmentation and Clustering in Automatic Speech Recognition
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Garcia-Mateo, Carmen
    Cardenal-Lopez, Antonio
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 49 - 58
  • [10] EXPERIMENTS IN AUTOMATIC SEGMENTATION OF CONTINUOUS SPEECH
    DEMORI, R
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1974, 22 (04): : 286 - 286