Multilingual Web Conferencing Using Speech-to-Speech Translation

被引：0

作者：

Chen, John ^{[1
]}

Wen, Shufei ^{[1
]}

Sridhar, Vivek Kumar Rangarajan ^{[1
]}

Bangalore, Srinivas ^{[1
]}

机构：

[1] AT&T Labs Res, 180 Pk Ave, Florham Pk, NJ 07932 USA

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

Web conferencing; simultaneous speech-to-speech translation; session initiation protocol (SIP);

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is now commonplace to use web conferencing technology in order to hold meetings between participants situated in different physical locations. A drawback of this technology is that nearly all of the interaction between these participants is monolingual. Here, we demonstrate a novel form of this technology that enables cross-lingual speech-to-speech communication between conference participants in real time. We model this translation problem as a combination of incremental speech recognition and segmentation, addressing the question of finding which segmentation strategy maximizes translation accuracy while minimizing latency. Our demonstration takes the form of a web conferencing scenario where a presenter speaks in one language while talk participants listen to or read the speaker's translated texts in real time. This system is flexible enough to allow real-time translation of technical talks or speeches covering broad topics.

引用

页码：1860 / 1862

页数：3

共 50 条

[41] Speech-to-speech translation services for the Olympic Games 2008
Stueker, Sebastian
Zong, Chengqing
Reichert, Juergen
Cao, Wenjie
Kolss, Muntsin
Xie, Guodong
Peterson, Kay
Ding, Peng
Arranz, Victoria
Yu, Jian
Waibel, Alex
[J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 297 - +
[42] A hand-held speech-to-speech translation system
Zhou, BW
Gao, YQ
Sorensen, J
Déchelotte, D
Picheny, M
[J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 664 - 669
[43] Speech-to-speech translation software on PDAs for travel conversation
Isotani, Ryosuke
Yamabana, Kiyoshi
Ando, Shinichi
Hanazawa, Ken
Ishikawa, Shin-Ya
Iso, Ken-Ichi
[J]. NEC Research and Development, 2003, 44 (SPEC.): : 197 - 202
[44] Speech-to-speech translation software on PDAs for travel conversation
Isotani, R
Yamabana, K
Ando, S
Hanazawa, K
Ishikawa, S
Iso, K
[J]. NEC RESEARCH & DEVELOPMENT, 2003, 44 (02): : 197 - 202
[45] TECNOPARLA - Speech technologies for Catalan and its application to Speech-to-speech Translation
Schulz, Henrik
Costa-Jussa, Marta R.
Fonollosa, Jose A. R.
[J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 319 - 320
[46] NAME AWARE SPEECH-TO-SPEECH TRANSLATION FOR ENGLISH/IRAQI
Prasad, Rohit
Moran, Christine
Choi, Fred
Meermeier, Ralf
Saleem, Shirin
Kao, Chia-lin
Stallard, Dave
Natarajan, Prem
[J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 249 - 252
[47] Real-time speech-to-speech translation for PDAs
Prasad, R.
Krstovski, K.
Choi, F.
Saleem, S.
Natarajan, P.
Decerbo, M.
Stallard, D.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON PORTABLE INFORMATION DEVICES, 2007, : 95 - 99
[48] Input segmentation of spontaneous speech in JANUS: A speech-to-speech translation system
Lavie, A
Gates, D
Coccaro, N
Levin, L
[J]. DIALOGUE PROCESSING IN SPOKEN LANGUAGE SYSTEMS, 1997, 1236 : 86 - 99
[49] Enriching machine-mediated speech-to-speech translation using contextual information
Sridhar, Vivek Kumar Rangarajan
Bangalore, Srinivas
Narayanan, Shrikanth
[J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 492 - 508
[50] Approach toward speech-to-speech translation system by using a collection of sentences and utterances
Sumita, E
Nakaiwa, H
Kikui, G
Yamamoto, S
[J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 652 - 657

← 1 2 3 4 5 →