共 50 条
- [2] Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5207 - 5214
- [4] Visual to Text: Survey of Image and Video Captioning IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2019, 3 (04): : 297 - 312
- [5] Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 444 - 461
- [6] Write What YouWant: Applying Text-to-Video Retrieval to Audiovisual Archives ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2023, 16 (04):
- [7] Fighting FIRe with FIRE: Assessing the Validity of Text-to-Video Retrieval Benchmarks 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 47 - 68
- [8] Relation Triplet Construction for Cross-modal Text-to-Video Retrieval PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4759 - 4767
- [9] Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12020 - 12030