共 50 条
- [1] A Framework for Vision-Language Warm-up Tasks in Multimodal Dialogue Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2789 - 2799
- [4] Deep Learning for Language and Vision Tasks in Surveillance Applications COMPUTACION Y SISTEMAS, 2021, 25 (02): : 317 - 328
- [6] VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [8] A Survey on Multimodal Deep Learning for Image Synthesis Applications, methods, datasets, evaluation metrics, and results comparison 2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 108 - 120
- [9] A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets The Visual Computer, 2022, 38 : 2939 - 2970
- [10] A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets VISUAL COMPUTER, 2022, 38 (08): : 2939 - 2970