Speech to speech translation: a communication boon

被引:0
|
作者
Karunesh Arora
Sunita Arora
Mukund Kumar Roy
机构
[1] NLP Lab,
[2] CDAC,undefined
[3] Anusandhan Bhawan,undefined
关键词
Speech to speech translation; ASR; Statistical MT; TTS; U-STAR;
D O I
10.1007/s40012-013-0014-4
中图分类号
学科分类号
摘要
An average person speaks 11000–25000 words per day making speech the most common way of expressing ourselves. Be it a conversation, dialogue, speech, presentations or any general talks, we use speech to make other as well as ourselves understand thoughts and actions. If either of the side is unaware of the language of communication, the cycle will be incomplete. Hence we need a system that can bridge this language barrier. Speech to speech translation is one such system that can play important role by facilitating communication between persons speaking different languages. Worldwide efforts are being made to achieve this goal and implement it practically for use by common man. The present paper describes a major international and inter-institutional effort in this direction—in which an attempt is being made to automate speech translation among 23 Asian, Middle East and European languages including Hindi through a consortium project led by NICT Japan [1, 2]. The three key modules namely Speech Recognition, Language Translation and Speech Synthesis required for Hindi are being designed, developed and implemented by CDAC, Noida as Indian counterpart in the project. The language specific technology and the parallel corpora and the speech unit (segmental) database developed have been described. Technical details of this first ever effort, modules and their performance in the communication system have been discussed.
引用
收藏
页码:207 / 213
页数:6
相关论文
共 50 条
  • [1] Impacts of machine translation and speech synthesis on speech-to-speech translation
    Hashimoto, Kei
    Yamagishi, Junichi
    Byrne, William
    King, Simon
    Tokuda, Keiichi
    [J]. SPEECH COMMUNICATION, 2012, 54 (07) : 857 - 866
  • [2] AN ANALYSIS OF MACHINE TRANSLATION AND SPEECH SYNTHESIS IN SPEECH-TO-SPEECH TRANSLATION SYSTEM
    Hashimoto, Kei
    Yamagishi, Junichi
    Byrne, William
    King, Simon
    Tokuda, Keiichi
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5108 - 5111
  • [3] Prototype of speech translation system for audio effective communication
    Bello, Richard Rojas
    Araya, Erick Araya
    Vidal, Luis Vidal
    [J]. PROFESSIONAL PRACTICE IN ARTIFICIAL INTELLIGENCE, 2006, 218 : 229 - +
  • [4] SPEECH COMMUNICATION AND SPEECH AUDIOMETRY
    KAPTEYN, TS
    [J]. AUDIOLOGY, 1971, 10 (03): : 191 - &
  • [5] Towards Machine Speech-to-speech Translation
    Satoshi, Nakamura
    Sudoh, Katsuhito
    Sakti, Sakriani
    [J]. TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2019, (17): : 81 - 87
  • [6] Prosody generation for speech-to-speech translation
    Aguero, Pablo Daniel
    Adell, Jordi
    Bonafonte, Antonio
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 557 - 560
  • [7] UWSpeech: Speech to Speech Translation for Unwritten Languages
    Zhang, Chen
    Tan, Xu
    Ren, Yi
    Qin, Tao
    Zhang, Kejun
    Liu, Tie-Yan
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14319 - 14327
  • [8] Hierarchical Classification for Speech-to-Speech Translation
    Ettelaie, Emil
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth S.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2534 - 2537
  • [9] Speech translation enhanced automatic speech recognition
    Paulik, M
    Stüker, S
    Fügen, C
    Schultz, T
    Schaaf, T
    Waibel, A
    [J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 121 - 126
  • [10] Automatic Speech Segmentation for Automatic Speech Translation
    Klosowski, Piotr
    Dustor, Adam
    [J]. COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475