Comparative study of automatic phone segmentation methods for TTS

被引:0
|
作者
Adell, J [1 ]
Bonafonte, A [1 ]
Gómez, JA [1 ]
Castro, MJ [1 ]
机构
[1] Tech Univ Catalonia UPC, TALP Res Ctr, Dpt Signal Theory & Commun, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the present paper we present two novel approaches to phonetic speech segmentation. One based on acoustical clustering plus dynamic time warping and a second one based on a boundary specific Correction by means of a decision tree. The use of objective or perceptual evaluations is discussed. Novel approaches clearly outperform objective results of the baseline system based on HMM. They get results similar to agreement between manual segmentations. We show how phonetic features can be successfully used for boundary detection together with HMMs. Finally, the need for perceptual tests in order to evaluate segmentation systems is pointed out.
引用
收藏
页码:309 / 312
页数:4
相关论文
共 50 条
  • [1] TOWARDS AUTOMATIC PHONETIC SEGMENTATION FOR TTS
    Rendel, Asaf
    Sorin, Alexander
    Hoory, Ron
    Breen, Andrew
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4533 - 4536
  • [2] A Comparative Study of Classification Methods for Automatic Multimodal Brain Tumor Segmentation
    El-Melegy, Moumen T.
    El-Magd, Khaled M. Abo
    Ali, Samia A.
    Hussain, Khaled F.
    Mahdy, Yousef B.
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMPUTER ENGINEERING (ITCE' 2018), 2018, : 36 - 41
  • [3] Automatic speech segmentation with the application of the Czech TTS system
    Horák, P
    Hesounová, B
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 201 - 206
  • [4] Automatic phone segmentation of expressive speech
    Charonnat, Laure
    Vidal, Gaelle
    Boeffard, Olivier
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2376 - 2379
  • [5] Automatic Segmentation of Parasitic Sounds in Speech Corpora for TTS Synthesis
    Matousek, Jindrich
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 369 - 376
  • [6] Hybrid model method for automatic segmentation of mandarin TTS corpus
    Yuan, Xiaoliang
    Dong, Yuan
    Huang, Dezhi
    Guo, Jun
    Wang, Haila
    [J]. INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 906 - 912
  • [7] TEMPLATE BASED TECHNIQUES FOR AUTOMATIC SEGMENTATION OF TTS UNIT DATABASE
    Adithya, S.
    Rao, Sunil
    Mahima, C.
    Vishnu, S.
    Thippareddy, Mythri
    Ramasubramanian, V.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5605 - 5609
  • [8] Automatic phone segmentation and labeling of continuous speech
    Jeong, CG
    Jeong, H
    [J]. SPEECH COMMUNICATION, 1996, 20 (3-4) : 291 - 311
  • [9] Comparative Study On Segmentation Methods Of Fundus Images
    Cao, Juan
    Liu, JinJia
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 400 - 405
  • [10] Comparative Study of Retinal Vessel Segmentation Methods
    Justin, Judith
    Vanithamani, R.
    Christina, R. Renee
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 67 - 69