共 50 条
- [21] Hierarchical Separable Video Transformer for Snapshot Compressive Imaging COMPUTER VISION - ECCV 2024, PT LXXXI, 2025, 15139 : 104 - 122
- [25] Multimodal Interaction Fusion Network Based on Transformer for Video Captioning ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT I, 2022, 1700 : 21 - 36
- [28] Dual-Stream Multimodal Learning for Topic-Adaptive Video Highlight Detection PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 272 - 279