共 50 条
- [21] Trends in Event Understanding and Caption Generation/Reconstruction in Dense Video: A Review CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 2941 - 2965
- [24] CLIP4Caption: CLIP for Video Caption PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4858 - 4862
- [25] Transformer model incorporating local graph semantic attention for image caption VISUAL COMPUTER, 2024, 40 (09): : 6533 - 6544
- [29] Attention-based Visual-Audio Fusion for Video Caption Generation 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 839 - 844