Unsupervised features from text for speech synthesis in a speech-to-speech translation system

被引：0

作者：

Watts, Oliver ^{[1
]}

Zhou, Bowen ^{[1
]}

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9YL, Midlothian, Scotland

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

speech synthesis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We explore the use of linguistic features for text to speech (ITS) conversion in the context of a speech-to-speech translation system that can be extracted from unannotated text in an unsupervised, language-independent fashion. The features are intended to act as surrogates for conventional part of speech (POS) features. Unlike POS features, the experimental features assume only the availability of tools and data that must already be in place for the construction of other components of the translation system, and can therefore be used for the TTS module without incurring additional TTS-specific costs. We here describe the use of the experimental features in a speech synthesiser, using six different configurations of the system to allow the comparison of the proposed features with conventional, knowledge-based POS features. We present results of objective and subjective evaluations of the usefulness of the new features.

引用

页码：2164 / 2167

页数：4

共 50 条

[1] AN ANALYSIS OF MACHINE TRANSLATION AND SPEECH SYNTHESIS IN SPEECH-TO-SPEECH TRANSLATION SYSTEM
Hashimoto, Kei
Yamagishi, Junichi
Byrne, William
King, Simon
Tokuda, Keiichi
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5108 - 5111
[2] Statistical Vowelization of Arabic Text for Speech Synthesis in Speech-to-Speech Translation Systems
Gu, Liang
Zhang, Wei
Tahir, Lazkin
Gao, Yuqing
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 509 - 512
[3] Impacts of machine translation and speech synthesis on speech-to-speech translation
Hashimoto, Kei
Yamagishi, Junichi
Byrne, William
King, Simon
Tokuda, Keiichi
[J]. SPEECH COMMUNICATION, 2012, 54 (07) : 857 - 866
[4] AUTOMATIC PRONUNCIATION PREDICTION FOR TEXT-TO-SPEECH SYNTHESIS OF DIALECTAL ARABIC IN A SPEECH-TO-SPEECH TRANSLATION SYSTEM
Ananthakrishnan, Sankaranarayanan
Tsakalidis, Stavros
Prasad, Rohit
Natarajan, Prem
Vembu, Aravind Namandi
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4957 - 4960
[5] The NESPOLE! speech-to-speech translation system
Lavie, A
Levin, L
Frederking, R
Pianesi, F
[J]. MACHINE TRANSLATION: FROM RESEARCH TO REAL USERS, 2002, 2499 : 240 - 243
[6] Unsupervised training for Farsi-English speech-to-speech translation
Xiang, Bing
Deng, Yonggang
Gao, Yuqing
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4977 - 4980
[7] Multilingual speech-to-speech translation system: VoiceTra
Matsuda, Shigeki
Hu, Xinhui
Shiga, Yoshinori
Kashioka, Hideki
Hori, Chiori
Yasuda, Keiji
Okuma, Hideo
Uchiyama, Masao
Sumita, Eiichiro
Kawai, Hisashi
Nakamura, Satoshi
[J]. 2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2, 2013, : 229 - 233
[8] The ATR multilingual speech-to-speech translation system
Nakamura, S
Markov, K
Nakaiwa, H
Kikui, G
Kawai, H
Jitsuhiro, T
Zhang, JS
Yamamoto, H
Sumita, E
Yamamoto, S
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 365 - 376
[9] From Speech-to-Speech Translation to Automatic Dubbing
Federico, Marcello
Enyedi, Robert
Barra-Chicote, Roberto
Giri, Ritwik
Isik, Umut
Krishnaswamy, Arvindh
Sawaf, Hassan
[J]. 17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020), 2020, : 257 - 264
[10] Predicting dialogue acts for a speech-to-speech translation system
Reithinger, N
Engel, R
Kipp, M
Klesen, M
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 654 - 657

← 1 2 3 4 5 →