共 50 条
- [1] End-to-end Generative Pretraining for Multimodal Video Captioning [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17938 - 17947
- [2] PWS-DVC: Enhancing Weakly Supervised Dense Video Captioning With Pretraining Approach [J]. IEEE ACCESS, 2023, 11 : 128162 - 128174
- [3] Multirate Multimodal Video Captioning [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1877 - 1882
- [4] Survey of Dense Video Captioning [J]. Computer Engineering and Applications, 2023, 59 (12): : 28 - 48
- [5] Streamlined Dense Video Captioning [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3581 - +
- [6] Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10714 - 10726
- [7] CapDet: Unifying Dense Captioning and Open-World Detection Pretraining [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15233 - 15243
- [8] Deep multimodal embedding for video captioning [J]. Multimedia Tools and Applications, 2019, 78 : 31793 - 31805
- [10] An Efficient Framework for Dense Video Captioning [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12039 - 12046