The Asian Network-based Speech-to-Speech Translation System

被引:6
|
作者
Sakti, Sakriani [1 ]
Kimura, Noriyuki [1 ]
Paul, Michael [1 ]
Hori, Chiori [1 ]
Sumita, Eiichiro [1 ]
Nakamura, Satoshi [1 ]
Park, Jun [2 ]
Wutiwiwatchai, Chai [3 ]
Xu, Bo [4 ]
Riza, Hammam [5 ]
Arora, Karunesh [6 ]
Luong, Chi Mai [7 ]
Li, Haizhou [8 ]
机构
[1] Natl Inst Informat & Commun Technol NICT, Tokyo, Japan
[2] ETRI, Seoul, South Korea
[3] NECTEC, Bangkok, Thailand
[4] CASIA, Beijing, Peoples R China
[5] BPPT, Jakarta, Indonesia
[6] CDAC, New Delhi, India
[7] IOIT, Hanoi, Vietnam
[8] I2R, Singapore, Singapore
关键词
D O I
10.1109/ASRU.2009.5373353
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper outlines the first Asian network-based speech-to-speech translation system developed by the Asian Speech Translation Advanced Research (A-STAR) consortium. The system was designed to translate common spoken utterances of travel conversations from a certain source language into multiple target languages in order to facilitate multiparty travel conversations between people speaking different Asian languages. Each A-STAR member contributes one or more of the following spoken language technologies: automatic speech recognition, machine translation, and text-to-speech through Web servers. Currently, the system has successfully covered 9 languages - namely, 8 Asian languages (Hindi, Indonesian, Japanese, Korean, Malay, Thai, Vietnamese, Chinese) and additionally, the English language. The system's domain covers about 20,000 travel expressions, including proper nouns that are names of famous places or attractions in Asian countries. In this paper, we discuss the difficulties involved in connecting various different spoken language translation systems through Web servers. We also present speech-translation results on the first A-STAR demo experiments carried out in July 2009.
引用
收藏
页码:507 / +
页数:2
相关论文
共 50 条
  • [1] CORBA-based speech-to-speech translation system
    Gruhn, R
    Takashima, K
    Nishino, A
    Nakamura, S
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 355 - 358
  • [2] The NESPOLE! speech-to-speech translation system
    Lavie, A
    Levin, L
    Frederking, R
    Pianesi, F
    [J]. MACHINE TRANSLATION: FROM RESEARCH TO REAL USERS, 2002, 2499 : 240 - 243
  • [3] AN ANALYSIS OF MACHINE TRANSLATION AND SPEECH SYNTHESIS IN SPEECH-TO-SPEECH TRANSLATION SYSTEM
    Hashimoto, Kei
    Yamagishi, Junichi
    Byrne, William
    King, Simon
    Tokuda, Keiichi
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5108 - 5111
  • [4] Multilingual speech-to-speech translation system: VoiceTra
    Matsuda, Shigeki
    Hu, Xinhui
    Shiga, Yoshinori
    Kashioka, Hideki
    Hori, Chiori
    Yasuda, Keiji
    Okuma, Hideo
    Uchiyama, Masao
    Sumita, Eiichiro
    Kawai, Hisashi
    Nakamura, Satoshi
    [J]. 2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2, 2013, : 229 - 233
  • [5] The ATR multilingual speech-to-speech translation system
    Nakamura, S
    Markov, K
    Nakaiwa, H
    Kikui, G
    Kawai, H
    Jitsuhiro, T
    Zhang, JS
    Yamamoto, H
    Sumita, E
    Yamamoto, S
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 365 - 376
  • [6] A speech-to-speech translation based interface for tourism
    Cettolo, M
    Corazza, A
    Lazzari, G
    Pianesi, F
    Pianta, E
    Tovena, LM
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 1999, 1999, : 191 - 200
  • [7] Utterance Classification Using Linguistic and Non-Linguistic Information for Network-Based Speech-To-Speech Translation Systems
    Sugiura, Komei
    Lee, Ryong
    Kashioka, Hideki
    Zettsu, Koji
    Kidawara, Yutaka
    [J]. 2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2, 2013, : 212 - 216
  • [8] An ARM-based embedded system design for speech-to-speech translation
    Lin, Shun-Chieh
    Wang, Jhing-Fa
    Wang, Jia-Ching
    Yang, Hsueh-Wei
    [J]. EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 499 - 508
  • [9] A speech-to-speech translation system for Catalan, Spanish, and English
    Arranz, V
    Comelles, E
    Farwell, D
    Nadeu, C
    Padrell, J
    Febrer, A
    Alexander, D
    Peterson, K
    [J]. MACHINE TRANSLATION: FROM REAL USERS TO RESEARCH, PROCEEDINGS, 2004, 3265 : 7 - 16
  • [10] Predicting dialogue acts for a speech-to-speech translation system
    Reithinger, N
    Engel, R
    Kipp, M
    Klesen, M
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 654 - 657