Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation

被引：0

作者：

Zhu, Yaoming ^{[1
]}

Sun, Zewei ^{[1
]}

Cheng, Shanbo ^{[1
]}

Huang, Luyang ^{[1
]}

Wu, Liwei ^{[1
]}

Wang, Mingxuan ^{[1
]}

机构：

[1] ByteDance, Shenzhen, Peoples R China

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent work has questioned the necessity of visual information in Multimodal Machine Translation (MMT). This paper tries to answer this question and build a new benchmark in this work. As the available dataset is simple and the text input is self-sufficient, we introduce a challenging dataset called EMMT, whose test-set is deliberately designed to ensure ambiguity. More importantly, we study this problem in a real-word scenario towards making the most of multimodal training data. We propose a new framework 2/3-Triplet which can naturally make full use of large-scale image-text and parallel text-only data. Extensive experiments show that visual information is highly crucial in EMMT. The proposed 2/3-Triplet outperforms the strong text-only competitor by 3.8 BLEU score, and even bypasses a commercial translation system. (1)

引用

页码：2679 / 2697

页数：19

共 50 条

[1] Multimodal Transformer for Multimodal Machine Translation
Yao, Shaowei
Wan, Xiaojun
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4346 - 4350
[2] Small Data, Big Impact: Leveraging Minimal Data for Effective Machine Translation
Maillard, Jean
Gao, Cynthia
Kalbassi, Elahe
Sadagopan, Kaushik Ram
Goswami, Vedanuj
Koehn, Philipp
Fan, Angela
Guzman, Francisco
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2740 - 2756
[3] Multimodal Image Translation Network Leveraging Total-Body PET Data
Jin, Yuxi
Li, Zhihua
Li, Qingneng
Zhou, Chao
Chen, Zixiang
Huang, Zhenxing
Zhang, Na
Zhang, Xu
Fan, Wei
Yuan, Jianmin
He, Qiang
Zhang, Wei-guang
Yang, Yongfeng
Liang, Dong
Zheng, Hairong
Hu, Zhanli
JOURNAL OF NUCLEAR MEDICINE, 2024, 65
[4] Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation
Reheman, Abudurexiti
Lu, Yingfeng
Ruan, Junhao
Ma, Anxiang
Zhang, Chunliang
Xiao, Tong
Zhu, Jingbo
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12216 - 12228
[5] Leveraging Synthetic Targets for Machine Translation
Mittal, Sarthak
Hrinchuk, Oleksii
Kuchaiev, Oleksii
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9365 - 9379
[6] Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Siddhant, Aditya
Bapna, Ankur
Cao, Yuan
Firat, Orhan
Chen, Mia
Kudungunta, Sneha
Arivazhagan, Naveen
Wu, Yonghui
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2827 - 2835
[7] Adversarial Evaluation of Multimodal Machine Translation
Elliott, Desmond
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2974 - 2978
[8] 5. Multimodal machine translation
Nakayama H.
Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2018, 72 (05): : 668 - 671
[9] Multimodal Comparable Corpora for Machine Translation
Afli, Haithem
Barrault, Loic
Schwenk, Holger
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
[10] On Vision Features in Multimodal Machine Translation
Li, Bei
Lv, Chuanhao
Zhou, Zefan
Zhou, Tao
Xiao, Tong
Ma, Anxiang
Zhu, Jingbo
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6327 - 6337

← 1 2 3 4 5 →