共 50 条
- [21] TopicDVC: Dense Video Captioning with Topic Guidance [J]. 2024 IEEE 10TH INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD, EDGECOM 2024, 2024, : 82 - 87
- [22] Multimodal attention-based transformer for video captioning [J]. Applied Intelligence, 2023, 53 : 23349 - 23368
- [23] Jointly Localizing and Describing Events for Dense Video Captioning [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7492 - 7500
- [24] Multimodal graph neural network for video procedural captioning [J]. NEUROCOMPUTING, 2022, 488 : 88 - 96
- [26] Multimodal attention-based transformer for video captioning [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 23349 - 23368
- [27] Learning Multimodal Attention LSTM Networks for Video Captioning [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 537 - 545
- [28] Step by Step: A Gradual Approach for Dense Video Captioning [J]. IEEE ACCESS, 2023, 11 : 51949 - 51959
- [29] Dense Video Captioning With Early Linguistic Information Fusion [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2309 - 2322