共 50 条
- [1] Multi-modal Dense Video Captioning [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4117 - 4126
- [2] Effective deep learning-based multi-modal retrieval [J]. VLDB JOURNAL, 2016, 25 (01): : 79 - 101
- [4] Multi-modal Dependency Tree for Video Captioning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] Applying deep learning-based multi-modal for detection of coronavirus [J]. Multimedia Systems, 2022, 28 : 1251 - 1262
- [7] Deep Learning-Based CNN Multi-Modal Camera Model Identification for Video Source Identification [J]. Informatica (Slovenia), 2023, 47 (03): : 417 - 430
- [8] ReCoAt: A Deep Learning-based Framework for Multi-Modal Motion Prediction in Autonomous Driving Application [J]. 2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 988 - 993
- [9] Multi-Modal Deep Learning-Based Violin Bowing Action Recognition [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
- [10] MULTI-MODAL HIERARCHICAL ATTENTION-BASED DENSE VIDEO CAPTIONING [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 475 - 479