A hybrid architecture for automatic segmentation of speech waveforms

被引:10
|
作者
Mporas, Iosif [1 ]
Ganchev, Todor [1 ]
Fakotakis, Nikos [1 ]
机构
[1] Univ Patras, Dept Elect & Comp Engn, Artificial Intelligence Grp, Wire Commun Lab, Rion 26500, Greece
关键词
speech segmentation; hidden Markov models; embedded training; isolated-unit training;
D O I
10.1109/ICASSP.2008.4518645
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the present work, we propose a hybrid architecture for automatic alignment of speech waveforms and their corresponding phone sequence. The proposed architecture does not exploit any phone boundary information. Our approach combines the efficiency of embedded training techniques and the high performance of isolated-unit training. Evaluating on the established for the task of phone segmentation TIMIT database, we achieved an accuracy of 83.56%, which corresponds to improving the baseline system's accuracy by 6.09%.
引用
收藏
页码:4457 / 4460
页数:4
相关论文
共 50 条
  • [41] The use of articulator motion information in automatic speech segmentation
    Akdemir, Eren
    Ciloglu, Tolga
    [J]. SPEECH COMMUNICATION, 2008, 50 (07) : 594 - 604
  • [42] Automatic Speech Segmentation and Multi Level Labeling Tool
    Kumar, R. Ravindra
    Sulochana, K. G.
    Stephen, Jose
    [J]. INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 9 - 14
  • [43] Sentence-Level Automatic Speech Segmentation for Amharic
    Tamiru, Rahel Mekonen
    Abate, Solomon Teferra
    [J]. PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 477 - 485
  • [44] Automatic speech segmentation with the application of the Czech TTS system
    Horák, P
    Hesounová, B
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 201 - 206
  • [45] Automatic Acoustic Segmentation for Speech Recognition on Broadcast Recordings
    Peng, Gang
    Hwang, Mei-Yuh
    Ostendorf, Mari
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2580 - 2583
  • [46] A hierarchical method of automatic speech segmentation for synthesis applications
    Pauws, S
    Kamp, Y
    Willems, L
    [J]. SPEECH COMMUNICATION, 1996, 19 (03) : 207 - 220
  • [47] Automatic speech segmentation for an open vocabulary recognition system
    Ban, L
    Tatai, P
    [J]. SIGNAL ANALYSIS & PREDICTION I, 1997, : 303 - 306
  • [48] An evaluation of automatic phone segmentation for concatenative speech synthesis
    Kawai, H
    Toda, T
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 677 - 680
  • [49] Game Theoretic Approach for Automatic Speech Segmentation and Recognition
    Rekha, J. Ujwala
    Chatrapati, K. Shahu
    Babu, A. Vinaya
    [J]. 2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [50] Neural network boundary refining for automatic speech segmentation
    Toledano, DT
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3438 - 3441