Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation

被引:0
|
作者
Zhu, Yaoming [1 ]
Sun, Zewei [1 ]
Cheng, Shanbo [1 ]
Huang, Luyang [1 ]
Wu, Liwei [1 ]
Wang, Mingxuan [1 ]
机构
[1] ByteDance, Shenzhen, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work has questioned the necessity of visual information in Multimodal Machine Translation (MMT). This paper tries to answer this question and build a new benchmark in this work. As the available dataset is simple and the text input is self-sufficient, we introduce a challenging dataset called EMMT, whose test-set is deliberately designed to ensure ambiguity. More importantly, we study this problem in a real-word scenario towards making the most of multimodal training data. We propose a new framework 2/3-Triplet which can naturally make full use of large-scale image-text and parallel text-only data. Extensive experiments show that visual information is highly crucial in EMMT. The proposed 2/3-Triplet outperforms the strong text-only competitor by 3.8 BLEU score, and even bypasses a commercial translation system. (1)
引用
收藏
页码:2679 / 2697
页数:19
相关论文
共 50 条
  • [1] Multimodal Transformer for Multimodal Machine Translation
    Yao, Shaowei
    Wan, Xiaojun
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4346 - 4350
  • [2] Small Data, Big Impact: Leveraging Minimal Data for Effective Machine Translation
    Maillard, Jean
    Gao, Cynthia
    Kalbassi, Elahe
    Sadagopan, Kaushik Ram
    Goswami, Vedanuj
    Koehn, Philipp
    Fan, Angela
    Guzman, Francisco
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2740 - 2756
  • [3] Multimodal Image Translation Network Leveraging Total-Body PET Data
    Jin, Yuxi
    Li, Zhihua
    Li, Qingneng
    Zhou, Chao
    Chen, Zixiang
    Huang, Zhenxing
    Zhang, Na
    Zhang, Xu
    Fan, Wei
    Yuan, Jianmin
    He, Qiang
    Zhang, Wei-guang
    Yang, Yongfeng
    Liang, Dong
    Zheng, Hairong
    Hu, Zhanli
    JOURNAL OF NUCLEAR MEDICINE, 2024, 65
  • [4] Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation
    Reheman, Abudurexiti
    Lu, Yingfeng
    Ruan, Junhao
    Ma, Anxiang
    Zhang, Chunliang
    Xiao, Tong
    Zhu, Jingbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12216 - 12228
  • [5] Leveraging Synthetic Targets for Machine Translation
    Mittal, Sarthak
    Hrinchuk, Oleksii
    Kuchaiev, Oleksii
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9365 - 9379
  • [6] Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
    Siddhant, Aditya
    Bapna, Ankur
    Cao, Yuan
    Firat, Orhan
    Chen, Mia
    Kudungunta, Sneha
    Arivazhagan, Naveen
    Wu, Yonghui
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2827 - 2835
  • [7] Adversarial Evaluation of Multimodal Machine Translation
    Elliott, Desmond
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2974 - 2978
  • [8] 5. Multimodal machine translation
    Nakayama H.
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2018, 72 (05): : 668 - 671
  • [9] Multimodal Comparable Corpora for Machine Translation
    Afli, Haithem
    Barrault, Loic
    Schwenk, Holger
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [10] On Vision Features in Multimodal Machine Translation
    Li, Bei
    Lv, Chuanhao
    Zhou, Zefan
    Zhou, Tao
    Xiao, Tong
    Ma, Anxiang
    Zhu, Jingbo
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6327 - 6337