共 50 条
- [1] CRET: Cross-Modal Retrieval Transformer for Efficient Text-Video Retrieval PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 949 - 959
- [4] Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening Visual Intelligence, 2025, 3 (1):
- [7] Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 396 - 404
- [8] Fine-grained Cross-modal Alignment Network for Text-Video Retrieval PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3826 - 3834
- [9] Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2579 - 2583
- [10] An Intelligent Advertisement Short Video Production System via Multi-Modal Retrieval PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3368 - 3372