共 50 条
- [33] Multi-modal fusion for video understanding 30TH APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP, PROCEEDINGS: ANALYSIS AND UNDERSTANDING OF TIME VARYING IMAGERY, 2001, : 103 - 108
- [34] Contextualized Keyword Representations for Multi-modal Retinal Image Captioning PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 645 - 652
- [35] The CropAndWeed Dataset: a Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3718 - 3727
- [36] Towards Unified Multi-modal Dataset Creation for Deep Learning Utilizing Structured Reports BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 130 - 135
- [38] Automated Multi-Modal Video Editing for Ads Video PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4823 - 4827