Multilingual speech-to-speech translation system: VoiceTra

被引:14
|
作者
Matsuda, Shigeki [1 ]
Hu, Xinhui [1 ]
Shiga, Yoshinori [1 ]
Kashioka, Hideki [1 ]
Hori, Chiori [1 ]
Yasuda, Keiji [1 ]
Okuma, Hideo [1 ]
Uchiyama, Masao [1 ]
Sumita, Eiichiro [1 ]
Kawai, Hisashi [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Universal Commun Res Inst, Kyoto, Japan
关键词
D O I
10.1109/MDM.2013.99
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This study presents an overview of VoiceTra, which was developed by NICT and released as the world's first network-based multilingual speech-to-speech translation system for smartphones, and describes in detail its multilingual speech recognition, its multilingual translation, and its multilingual speech synthesis in regards to field experiments. We show the effects of system updates using the data collected from field experiments to improve our acoustic and language models.
引用
收藏
页码:229 / 233
页数:5
相关论文
共 50 条
  • [41] An ARM-based embedded system design for speech-to-speech translation
    Lin, Shun-Chieh
    Wang, Jhing-Fa
    Wang, Jia-Ching
    Yang, Hsueh-Wei
    [J]. EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 499 - 508
  • [42] ASSESSING EVALUATION METRICS FOR SPEECH-TO-SPEECH TRANSLATION
    Salesky, Elizabeth
    Maeder, Julian
    Klinger, Severin
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 733 - 740
  • [43] Finite-state speech-to-speech translation
    Vidal, E
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 111 - 114
  • [44] A speech-to-speech translation based interface for tourism
    Cettolo, M
    Corazza, A
    Lazzari, G
    Pianesi, F
    Pianta, E
    Tovena, LM
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 1999, 1999, : 191 - 200
  • [45] Incremental Dialog Clustering For Speech-to-Speech Translation
    Stallard, David
    Tsakalidis, Stavros
    Saleem, Shirin
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 428 - 431
  • [46] Textless Speech-to-Speech Translation on Real Data
    Lee, Ann
    Gong, Hongyu
    Duquenne, Paul-Ambroise
    Schwenk, Holger
    Chen, Peng-Jen
    Wang, Changhan
    Popuri, Sravya
    Adi, Yossi
    Pino, Juan
    Gu, Jiatao
    Hsu, Wei-Ning
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 860 - 872
  • [47] Applications of Language Modeling in Speech-To-Speech Translation
    Fu-Hua Liu
    Liang Gu
    Yuqing Gao
    Michael Picheny
    [J]. International Journal of Speech Technology, 2004, 7 (2-3) : 221 - 229
  • [48] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
    Anumanchipalli, Gopala Krishna
    Oliveira, Luis C.
    Black, Alan W.
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
  • [49] Hindi-English Speech-to-Speech Translation System for Travel Expressions
    Mrinalini, K.
    Vijayalakshmi, P.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY, INFORMATION AND COMMUNICATION (ICCPEIC), 2015, : 250 - 255
  • [50] BBN Trans Talk: Robust multilingual two-way speech-to-speech translation for mobile platforms
    Prasad, Rohit
    Natarajan, Prem
    Stallard, David
    Saleem, Shirin
    Ananthakrishnan, Shankar
    Tsakalidis, Stavros
    Kao, Chia-lin
    Choi, Fred
    Meermeier, Ralf
    Rawls, Mark
    Devlin, Jacob
    Krstovski, Kriste
    Challenner, Aaron
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 475 - 491