共 50 条
- [42] Multi-modal fusion for video understanding 30TH APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP, PROCEEDINGS: ANALYSIS AND UNDERSTANDING OF TIME VARYING IMAGERY, 2001, : 103 - 108
- [43] Multi-modal Dense Video Captioning 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4117 - 4126
- [44] RetrievalMMT: Retrieval-Constrained Multi-Modal Prompt Learning for Multi-Modal Machine Translation PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 860 - 868
- [45] UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3032 - 3041
- [46] Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 396 - 404
- [47] Temporally Multi-Modal Semantic Reasoning with Spatial Language Constraints for Video Question Answering SYMMETRY-BASEL, 2022, 14 (06):
- [49] Automated Multi-Modal Video Editing for Ads Video PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4823 - 4827
- [50] Personalized retrieval of sports video based on multi-modal analysis and user preference acquisition Multimedia Tools and Applications, 2009, 44 : 305 - 330