ISOMETRIC MT: NEURAL MACHINE TRANSLATION FOR AUTOMATIC DUBBING

被引:4
|
作者
Lakew, Surafel M. [1 ]
Virkar, Yogesh [1 ]
Mathur, Prashant [1 ]
Federico, Marcello [1 ]
机构
[1] Amazon AI, Seattle, WA 98109 USA
关键词
Machine Translation; Isometric Translation; Automatic Dubbing;
D O I
10.1109/ICASSP43922.2022.9747023
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic dubbing (AD) is among the machine translation (MT) use cases where translations should match a given length to allow for synchronicity between source and target speech. For neural MT, generating translations of length close to the source length (e.g. within +/- 10% in character count), while preserving quality is a challenging task. Controlling MT output length comes at a cost to translation quality, which is usually mitigated with a two step approach of generating N-best hypotheses and then re-ranking based on length and quality. This work introduces a self-learning approach that allows a transformer model to directly learn to generate outputs that closely match the source length, in short Isometric MT. In particular, our approach does not require to generate multiple hypotheses nor any auxiliary ranking function. We report results on four language pairs (English -> French, Italian, German, Spanish) with a publicly available benchmark. Automatic and manual evaluations show that our method for Isometric MT outperforms more complex approaches proposed in the literature.
引用
收藏
页码:6242 / 6246
页数:5
相关论文
共 50 条
  • [1] Isochrony-Aware Neural Machine Translation for Automatic Dubbing
    Tam, Derek
    Lakew, Surafel M.
    Virkar, Yogesh
    Mathur, Prashant
    Federico, Marcello
    INTERSPEECH 2022, 2022, : 1776 - 1780
  • [2] MACHINE TRANSLATION VERBOSITY CONTROL FOR AUTOMATIC DUBBING
    Lakew, Surafel M.
    Federico, Marcello
    Wang, Yue
    Hoang, Cuong
    Virkar, Yogesh
    Barra-Chicote, Roberto
    Enyedi, Robert
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7538 - 7542
  • [3] Integration of Dubbing Constraints into Machine Translation
    Saboo, Ashutosh
    Baumann, Timo
    FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 1: RESEARCH PAPERS, 2019, : 94 - 101
  • [4] Democratizing neural machine translation with OPUS-MT
    Tiedemann, Jorg
    Aulamo, Mikko
    Bakshandaeva, Daria
    Boggia, Michele
    Gronroos, Stig-Arne
    Nieminen, Tommi
    Raganato, Alessandro
    Scherrer, Yves
    Vazquez, Raul
    Virpioja, Sami
    LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (02) : 713 - 755
  • [5] The application of machine translation in automatic dubbing in China: A case study of the feature film Mulan
    Jin, Haina
    Yuan, Zichen
    LINGUISTICA ANTVERPIENSIA NEW SERIES-THEMES IN TRANSLATION STUDIES, 2023, 22 : 80 - 94
  • [6] From Speech-to-Speech Translation to Automatic Dubbing
    Federico, Marcello
    Enyedi, Robert
    Barra-Chicote, Roberto
    Giri, Ritwik
    Isik, Umut
    Krishnaswamy, Arvindh
    Sawaf, Hassan
    17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020), 2020, : 257 - 264
  • [7] DURATION MODELING OF NEURAL TTS FOR AUTOMATIC DUBBING
    Effendi, Johanes
    Virkar, Yogesh
    Barra-Chicote, Roberto
    Federico, Marcello
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8037 - 8041
  • [8] Automatic Long Sentence Segmentation for Neural Machine Translation
    Kuang, Shaohui
    Xiong, Deyi
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 162 - 174
  • [9] Automatic Evaluation and Analysis of Idioms in Neural Machine Translation
    Baziotis, Christos
    Mathur, Prashant
    Hasler, Eva
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3682 - 3700
  • [10] An Empirical Study on Automatic Post Editing for Neural Machine Translation
    Moon, Hyeonseok
    Park, Chanjun
    Eo, Sugyeong
    Seo, Jaehyung
    Lim, Heuiseok
    IEEE ACCESS, 2021, 9 : 123754 - 123763