共 50 条
- [1] Multi-modal Dense Video Captioning 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4117 - 4126
- [2] Multi-modal Dependency Tree for Video Captioning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [4] A comprehensive video dataset for multi-modal recognition systems Data Science Journal, 2019, 18 (01):
- [5] MULTI-MODAL HIERARCHICAL ATTENTION-BASED DENSE VIDEO CAPTIONING 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 475 - 479
- [9] M-VAD names: a dataset for video captioning with naming Multimedia Tools and Applications, 2019, 78 : 14007 - 14027
- [10] Towards Developing a Multi-Modal Video Recommendation System 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,