An evaluation of automatic phone segmentation for concatenative speech synthesis

被引：0

作者：

Kawai, H

Toda, T

机构：

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper studies the performance of automatic phone segmentation from two viewpoints: (1) temporal precision and (2) effect on the naturalness of synthetic speech. The absolute error of the phone onset time for the best 90% and worst 10% were 4.6 ms and 25.9 ms, respectively. These values are comparable to discrepancies among human labelers. As the result of perception tests in which naturalness was pair-compared between synthetic speeches generated from hand-segmented data and from auto-segmented data, it was found that the latter is statistically inferior.

引用

页码：677 / 680

页数：4

共 50 条

[1] Automatic Labeling Schemes for Concatenative Speech Synthesis
Kacur, Juraj
Cepko, Jozef
Palenik, Andrej
[J]. PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
[2] Acoustic speech unit segmentation for concatenative synthesis
Torres, H. M.
Gurlekian, J. A.
[J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 196 - 206
[3] Automatic phone segmentation of expressive speech
Charonnat, Laure
Vidal, Gaelle
Boeffard, Olivier
[J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2376 - 2379
[4] Automatic segmentation for construction of signal dictionary in concatenative synthesis
Chowdhury, S
Datta, AK
Chaudhuri, BB
[J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 237 - 240
[5] Automatic phone segmentation and labeling of continuous speech
Jeong, CG
Jeong, H
[J]. SPEECH COMMUNICATION, 1996, 20 (3-4) : 291 - 311
[6] Archisegment-based letter-to-phone conversion for concatenative speech synthesis in Portuguese
Albano, EC
Moreira, AA
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1708 - 1711
[7] Perceptual evaluation of cost for segment selection in concatenative speech synthesis
Toda, T
Kawai, H
Tsuzaki, M
Shikano, K
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 183 - 186
[8] Automatic speech segmentation to improve speech synthesis performance
[J]. 1600, IEEE Computer Society
[9] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
OLIVE, J
LIBERMAN, M
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
[10] On the detection of discontinuities in concatenative speech synthesis
Pantazis, Yannis
Stylianou, Yannis
[J]. PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 89 - +

← 1 2 3 4 5 →