共 50 条
- [3] Hierarchical Cross-Modal Graph Consistency Learning for Video-Text Retrieval [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1114 - 1124
- [4] CMMT: Cross-Modal Meta-Transformer for Video-Text Retrieval [J]. PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 76 - 84
- [5] Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 19 - 27
- [6] Fine-Grained Cross-Modal Contrast Learning for Video-Text Retrieval [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14866 : 298 - 310
- [7] Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 135 - 143
- [9] CLIP4Hashing: Unsupervised Deep Hashing for Cross-Modal Video-Text Retrieval [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 158 - 166
- [10] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2472 - 2482