共 50 条
- [41] Adaptively Converting Auxiliary Attributes and Textual Embedding for Video Captioning Based on BiLSTM [J]. Neural Processing Letters, 2020, 52 : 2353 - 2369
- [45] Improving Image Captioning through Visual and Semantic Mutual Promotion [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4716 - 4724
- [46] Visual versus Textual Embedding for Video Retrieval [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017), 2017, 10617 : 386 - 395
- [47] Video Captioning via Sentence Augmentation and Spatio-Temporal Attention [J]. COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 269 - 286
- [50] Spatio-Temporal Ranked-Attention Networks for Video Captioning [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1606 - 1615