An evaluation of automatic phone segmentation for concatenative speech synthesis

被引:0
|
作者
Kawai, H
Toda, T
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper studies the performance of automatic phone segmentation from two viewpoints: (1) temporal precision and (2) effect on the naturalness of synthetic speech. The absolute error of the phone onset time for the best 90% and worst 10% were 4.6 ms and 25.9 ms, respectively. These values are comparable to discrepancies among human labelers. As the result of perception tests in which naturalness was pair-compared between synthetic speeches generated from hand-segmented data and from auto-segmented data, it was found that the latter is statistically inferior.
引用
收藏
页码:677 / 680
页数:4
相关论文
共 50 条
  • [1] Automatic Labeling Schemes for Concatenative Speech Synthesis
    Kacur, Juraj
    Cepko, Jozef
    Palenik, Andrej
    [J]. PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
  • [2] Acoustic speech unit segmentation for concatenative synthesis
    Torres, H. M.
    Gurlekian, J. A.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 196 - 206
  • [3] Automatic phone segmentation of expressive speech
    Charonnat, Laure
    Vidal, Gaelle
    Boeffard, Olivier
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2376 - 2379
  • [4] Automatic segmentation for construction of signal dictionary in concatenative synthesis
    Chowdhury, S
    Datta, AK
    Chaudhuri, BB
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 237 - 240
  • [5] Automatic phone segmentation and labeling of continuous speech
    Jeong, CG
    Jeong, H
    [J]. SPEECH COMMUNICATION, 1996, 20 (3-4) : 291 - 311
  • [6] Archisegment-based letter-to-phone conversion for concatenative speech synthesis in Portuguese
    Albano, EC
    Moreira, AA
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1708 - 1711
  • [7] Perceptual evaluation of cost for segment selection in concatenative speech synthesis
    Toda, T
    Kawai, H
    Tsuzaki, M
    Shikano, K
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 183 - 186
  • [9] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
    OLIVE, J
    LIBERMAN, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
  • [10] On the detection of discontinuities in concatenative speech synthesis
    Pantazis, Yannis
    Stylianou, Yannis
    [J]. PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 89 - +