共 50 条
- [32] Hierarchical Global-Local Temporal Modeling for Video Captioning [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 774 - 783
- [33] Diverse Video Captioning by Adaptive Spatio-temporal Attention [J]. PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 409 - 425
- [34] Video Captioning Based on the Spatial-Temporal Saliency Tracing [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 59 - 70
- [35] Exploiting long-term temporal dynamics for video captioning [J]. World Wide Web, 2019, 22 : 735 - 749
- [36] Exploiting long-term temporal dynamics for video captioning [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 735 - 749
- [37] Exploring the Spatio-Temporal Aware Graph for video captioning [J]. IET COMPUTER VISION, 2022, 16 (05) : 456 - 467
- [38] Spatio-Temporal Attention Models for Grounded Video Captioning [J]. COMPUTER VISION - ACCV 2016, PT IV, 2017, 10114 : 104 - 119
- [39] Fused GRU with semantic-temporal attention for video captioning [J]. NEUROCOMPUTING, 2020, 395 (395) : 222 - 228