Multilingual Web Conferencing Using Speech-to-Speech Translation

被引:0
|
作者
Chen, John [1 ]
Wen, Shufei [1 ]
Sridhar, Vivek Kumar Rangarajan [1 ]
Bangalore, Srinivas [1 ]
机构
[1] AT&T Labs Res, 180 Pk Ave, Florham Pk, NJ 07932 USA
关键词
Web conferencing; simultaneous speech-to-speech translation; session initiation protocol (SIP);
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is now commonplace to use web conferencing technology in order to hold meetings between participants situated in different physical locations. A drawback of this technology is that nearly all of the interaction between these participants is monolingual. Here, we demonstrate a novel form of this technology that enables cross-lingual speech-to-speech communication between conference participants in real time. We model this translation problem as a combination of incremental speech recognition and segmentation, addressing the question of finding which segmentation strategy maximizes translation accuracy while minimizing latency. Our demonstration takes the form of a web conferencing scenario where a presenter speaks in one language while talk participants listen to or read the speaker's translated texts in real time. This system is flexible enough to allow real-time translation of technical talks or speeches covering broad topics.
引用
收藏
页码:1860 / 1862
页数:3
相关论文
共 50 条
  • [41] Speech-to-speech translation services for the Olympic Games 2008
    Stueker, Sebastian
    Zong, Chengqing
    Reichert, Juergen
    Cao, Wenjie
    Kolss, Muntsin
    Xie, Guodong
    Peterson, Kay
    Ding, Peng
    Arranz, Victoria
    Yu, Jian
    Waibel, Alex
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 297 - +
  • [42] A hand-held speech-to-speech translation system
    Zhou, BW
    Gao, YQ
    Sorensen, J
    Déchelotte, D
    Picheny, M
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 664 - 669
  • [43] Speech-to-speech translation software on PDAs for travel conversation
    Isotani, Ryosuke
    Yamabana, Kiyoshi
    Ando, Shinichi
    Hanazawa, Ken
    Ishikawa, Shin-Ya
    Iso, Ken-Ichi
    [J]. NEC Research and Development, 2003, 44 (SPEC.): : 197 - 202
  • [44] Speech-to-speech translation software on PDAs for travel conversation
    Isotani, R
    Yamabana, K
    Ando, S
    Hanazawa, K
    Ishikawa, S
    Iso, K
    [J]. NEC RESEARCH & DEVELOPMENT, 2003, 44 (02): : 197 - 202
  • [45] TECNOPARLA - Speech technologies for Catalan and its application to Speech-to-speech Translation
    Schulz, Henrik
    Costa-Jussa, Marta R.
    Fonollosa, Jose A. R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 319 - 320
  • [46] NAME AWARE SPEECH-TO-SPEECH TRANSLATION FOR ENGLISH/IRAQI
    Prasad, Rohit
    Moran, Christine
    Choi, Fred
    Meermeier, Ralf
    Saleem, Shirin
    Kao, Chia-lin
    Stallard, Dave
    Natarajan, Prem
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 249 - 252
  • [47] Real-time speech-to-speech translation for PDAs
    Prasad, R.
    Krstovski, K.
    Choi, F.
    Saleem, S.
    Natarajan, P.
    Decerbo, M.
    Stallard, D.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON PORTABLE INFORMATION DEVICES, 2007, : 95 - 99
  • [48] Input segmentation of spontaneous speech in JANUS: A speech-to-speech translation system
    Lavie, A
    Gates, D
    Coccaro, N
    Levin, L
    [J]. DIALOGUE PROCESSING IN SPOKEN LANGUAGE SYSTEMS, 1997, 1236 : 86 - 99
  • [49] Enriching machine-mediated speech-to-speech translation using contextual information
    Sridhar, Vivek Kumar Rangarajan
    Bangalore, Srinivas
    Narayanan, Shrikanth
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 492 - 508
  • [50] Approach toward speech-to-speech translation system by using a collection of sentences and utterances
    Sumita, E
    Nakaiwa, H
    Kikui, G
    Yamamoto, S
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 652 - 657