共 50 条
- [41] Dense Captioning with Joint Inference and Visual Context 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1978 - 1987
- [43] MIVCN: Multimodal interaction video captioning network based on semantic association graph Applied Intelligence, 2022, 52 : 5241 - 5260
- [44] Video captioning algorithm based on mixed training and semantic association Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (11): : 67 - 74
- [46] MULTI-MODAL HIERARCHICAL ATTENTION-BASED DENSE VIDEO CAPTIONING 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 475 - 479
- [47] Jointly Localizing and Describing Events for Dense Video Captioning 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7492 - 7500
- [49] Context Visual Information-based Deliberation Network for Video Captioning 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9812 - 9818
- [50] Step by Step: A Gradual Approach for Dense Video Captioning IEEE ACCESS, 2023, 11 : 51949 - 51959