Unsupervised features from text for speech synthesis in a speech-to-speech translation system

被引：0

作者：

Watts, Oliver ^{[1
]}

Zhou, Bowen ^{[1
]}

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9YL, Midlothian, Scotland

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

speech synthesis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We explore the use of linguistic features for text to speech (ITS) conversion in the context of a speech-to-speech translation system that can be extracted from unannotated text in an unsupervised, language-independent fashion. The features are intended to act as surrogates for conventional part of speech (POS) features. Unlike POS features, the experimental features assume only the availability of tools and data that must already be in place for the construction of other components of the translation system, and can therefore be used for the TTS module without incurring additional TTS-specific costs. We here describe the use of the experimental features in a speech synthesiser, using six different configurations of the system to allow the comparison of the proposed features with conventional, knowledge-based POS features. We present results of objective and subjective evaluations of the usefulness of the new features.

引用

页码：2164 / 2167

页数：4

共 50 条

[41] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
Anumanchipalli, Gopala Krishna
Oliveira, Luis C.
Black, Alan W.
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
[42] Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Jia, Ye
Ding, Yifan
Bapna, Ankur
Cherry, Colin
Zhang, Yu
Conneau, Alexis
Morioka, Nobuyuki
[J]. INTERSPEECH 2022, 2022, : 1721 - 1725
[43] Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition
Ni, Junrui
Wang, Liming
Gao, Heting
Qian, Kaizhi
Zhang, Yang
Chang, Shiyu
Hasegawa-Johnson, Mark
[J]. INTERSPEECH 2022, 2022, : 461 - 465
[44] The BBN 2007 Displayless English/Iraqi Speech-to-Speech Translation System
Stallard, David
Choi, Fred
Kao, Chia-lin
Krstovski, Kriste
Natarajan, Prem
Prasad, Rohit
Saleem, Shirin
Subramanian, Krishna
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2284 - 2287
[45] OUT-OF-VOCABULARY WORD DETECTION IN A SPEECH-TO-SPEECH TRANSLATION SYSTEM
Kuo, Hong-Kwang
Kislal, Ellen Eide
Mangu, Lidia
Soltau, Hagen
Beran, Tomas
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[46] Design and implementation of a single-chip speech-to-speech translation system
Wang, J. -F.
Lin, S. -C.
Wang, J. -C.
Yang, H. -W.
[J]. IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 2006, 153 (05): : 416 - 426
[47] End-to-end evaluation in JANUS: A speech-to-speech translation system
Gates, D
Lavie, A
Levin, L
Waibel, A
Gavalda, M
Mayfield, L
Woszczyna, M
Zhan, PM
[J]. DIALOGUE PROCESSING IN SPOKEN LANGUAGE SYSTEMS, 1997, 1236 : 195 - 206
[48] An ARM-based embedded system design for speech-to-speech translation
Lin, Shun-Chieh
Wang, Jhing-Fa
Wang, Jia-Ching
Yang, Hsueh-Wei
[J]. EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 499 - 508
[49] TECNOPARLA - Speech technologies for Catalan and its application to Speech-to-speech Translation
Schulz, Henrik
Costa-Jussa, Marta R.
Fonollosa, Jose A. R.
[J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 319 - 320
[50] Hindi-English Speech-to-Speech Translation System for Travel Expressions
Mrinalini, K.
Vijayalakshmi, P.
[J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY, INFORMATION AND COMMUNICATION (ICCPEIC), 2015, : 250 - 255

← 1 2 3 4 5 →