共 50 条
- [21] Masking Modalities for Cross-modal Video Retrieval 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2111 - 2120
- [22] Cross-modal Embeddings for Video and Audio Retrieval COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 711 - 716
- [23] Cross-Modal and Hierarchical Modeling of Video and Text COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 385 - 401
- [24] Fine-grained Cross-modal Alignment Network for Text-Video Retrieval PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3826 - 3834
- [26] Cross-Modal Coherence for Text-to-Image Retrieval THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10427 - 10435
- [27] X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4996 - 5005
- [29] Video-Based Cross-Modal Recipe Retrieval PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1685 - 1693