SEQUENTIAL SYSTEM COMBINATION FOR MACHINE TRANSLATION OF SPEECH

被引:0
|
作者
Karakos, Damianos [1 ]
Khudanpur, Sanjeev [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
Machine Translation; System Combination; Confusion Networks; Alignments with reordering;
D O I
10.1109/SLT.2008.4777889
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
System combination is a technique which has been shown to yield significant gains in speech recognition and machine translation. Most combination schemes perform an alignment between different system outputs in order to produce lattices (or confusion networks), from which a composite hypothesis is chosen, possibly with the help of a large language model. The benefit of this approach is two-fold: (i) whenever many systems agree with each other on a set of words, the combination output contains these words with high confidence; and (ii) whenever the systems disagree, the language model resolves the ambiguity based on the (probably correct) agreed-upon context. The case of machine translation system combination is more challenging because of the different word orders of the translations: the alignment has to incorporate computationally expensive movements of word blocks. In this paper, we show how one can combine translation outputs efficiently, extending the incremental alignment procedure of [1]. A comparison between different system combination design choices is performed on an Arabic speech translation task.
引用
收藏
页码:257 / +
页数:2
相关论文
共 50 条
  • [41] Class-based Statistical Machine Translation for Field Maintainable Speech-To-Speech Translation
    Lane, Ian R.
    Waibel, Alex
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2362 - 2365
  • [42] HYPOTHESIS RANKING AND TWO-PASS APPROACHES FOR MACHINE TRANSLATION SYSTEM COMBINATION
    Karakos, Damianos
    Smith, Jason
    Khudanpur, Sanjeev
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5202 - 5205
  • [43] Talking to a System and Talking to a Human: A study from a Speech-to-Speech, Machine Translation mediated Map Task
    Akira, Hayakawa
    Luz, Saturnino
    Campbell, Nick
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1422 - 1426
  • [44] Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation
    Khadivi, Shahram
    Ney, Hermann
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1551 - 1564
  • [45] Lost in Translation: Machine Translation and Text-To-Speech in Industry 4.0
    Haslwanter, Jean D. Hallewell
    Heiml, Michael
    Wolfartsberger, Josef
    12TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2019), 2019, : 333 - 342
  • [46] An automatic machine translation system for multi-lingual speech to Indian sign language
    Amandeep Singh Dhanjal
    Williamjeet Singh
    Multimedia Tools and Applications, 2022, 81 : 4283 - 4321
  • [47] An automatic machine translation system for multi-lingual speech to Indian sign language
    Dhanjal, Amandeep Singh
    Singh, Williamjeet
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 4283 - 4321
  • [48] Response Generation based on Statistical Machine Translation for Speech-Oriented Guidance System
    Nishimura, Kazuma
    Kawanami, Hiromichi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [49] Attempt Towards Stress Transfer in Speech-to-Speech Machine Translation
    Akarsh, Sai C.
    Narasinga, Vamshiraghusimha
    Mondal, Anindita
    Vuppala, Anil
    2024 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM 2024, 2024,
  • [50] CLASS-BASED NAMED ENTITY TRANSLATION IN A SPEECH TO SPEECH TRANSLATION SYSTEM
    Maskey, Sameer R.
    Cmejrek, Martin
    Zhou, Bowen
    Gao, Yuqing
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 253 - 256