Unsupervised features from text for speech synthesis in a speech-to-speech translation system

被引:0
|
作者
Watts, Oliver [1 ]
Zhou, Bowen [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9YL, Midlothian, Scotland
关键词
speech synthesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore the use of linguistic features for text to speech (ITS) conversion in the context of a speech-to-speech translation system that can be extracted from unannotated text in an unsupervised, language-independent fashion. The features are intended to act as surrogates for conventional part of speech (POS) features. Unlike POS features, the experimental features assume only the availability of tools and data that must already be in place for the construction of other components of the translation system, and can therefore be used for the TTS module without incurring additional TTS-specific costs. We here describe the use of the experimental features in a speech synthesiser, using six different configurations of the system to allow the comparison of the proposed features with conventional, knowledge-based POS features. We present results of objective and subjective evaluations of the usefulness of the new features.
引用
收藏
页码:2164 / 2167
页数:4
相关论文
共 50 条
  • [41] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
    Anumanchipalli, Gopala Krishna
    Oliveira, Luis C.
    Black, Alan W.
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
  • [42] Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
    Jia, Ye
    Ding, Yifan
    Bapna, Ankur
    Cherry, Colin
    Zhang, Yu
    Conneau, Alexis
    Morioka, Nobuyuki
    [J]. INTERSPEECH 2022, 2022, : 1721 - 1725
  • [43] Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition
    Ni, Junrui
    Wang, Liming
    Gao, Heting
    Qian, Kaizhi
    Zhang, Yang
    Chang, Shiyu
    Hasegawa-Johnson, Mark
    [J]. INTERSPEECH 2022, 2022, : 461 - 465
  • [44] The BBN 2007 Displayless English/Iraqi Speech-to-Speech Translation System
    Stallard, David
    Choi, Fred
    Kao, Chia-lin
    Krstovski, Kriste
    Natarajan, Prem
    Prasad, Rohit
    Saleem, Shirin
    Subramanian, Krishna
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2284 - 2287
  • [45] OUT-OF-VOCABULARY WORD DETECTION IN A SPEECH-TO-SPEECH TRANSLATION SYSTEM
    Kuo, Hong-Kwang
    Kislal, Ellen Eide
    Mangu, Lidia
    Soltau, Hagen
    Beran, Tomas
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [46] Design and implementation of a single-chip speech-to-speech translation system
    Wang, J. -F.
    Lin, S. -C.
    Wang, J. -C.
    Yang, H. -W.
    [J]. IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 2006, 153 (05): : 416 - 426
  • [47] End-to-end evaluation in JANUS: A speech-to-speech translation system
    Gates, D
    Lavie, A
    Levin, L
    Waibel, A
    Gavalda, M
    Mayfield, L
    Woszczyna, M
    Zhan, PM
    [J]. DIALOGUE PROCESSING IN SPOKEN LANGUAGE SYSTEMS, 1997, 1236 : 195 - 206
  • [48] An ARM-based embedded system design for speech-to-speech translation
    Lin, Shun-Chieh
    Wang, Jhing-Fa
    Wang, Jia-Ching
    Yang, Hsueh-Wei
    [J]. EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 499 - 508
  • [49] TECNOPARLA - Speech technologies for Catalan and its application to Speech-to-speech Translation
    Schulz, Henrik
    Costa-Jussa, Marta R.
    Fonollosa, Jose A. R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 319 - 320
  • [50] Hindi-English Speech-to-Speech Translation System for Travel Expressions
    Mrinalini, K.
    Vijayalakshmi, P.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY, INFORMATION AND COMMUNICATION (ICCPEIC), 2015, : 250 - 255