共 50 条
- [1] End-to-End Dense Video Captioning with Masked Transformer [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8739 - 8748
- [2] End-to-End Dense Video Captioning with Parallel Decoding [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6827 - 6837
- [3] End-to-end Generative Pretraining for Multimodal Video Captioning [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17938 - 17947
- [4] End-to-End Video Captioning with Multitask Reinforcement Learning [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 339 - 348
- [5] SWINBERT: End-to-End Transformers with Sparse Attention for Video Captioning [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17928 - 17937
- [7] An End-to-End Deep Learning Approach for Video Captioning Through Mobile Devices [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 715 - 729
- [8] End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3261 - 3269
- [9] Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings [J]. SYMMETRY-BASEL, 2020, 12 (06):