共 50 条
- [31] MDC-Net: Multimodal Detection and Captioning Network for Steel Surface Defects ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS, ROBOVIS 2024, 2024, 2077 : 316 - 333
- [33] Element-Centered Multi-granularity Network for Dense Video Captioning PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 445 - 459
- [35] A news image captioning approach based on multimodal pointer-generator network CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (07):
- [36] Dense Receptive Field Network: A Backbone Network for Object Detection ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 105 - 118
- [37] Object Hallucination in Image Captioning 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4035 - 4045
- [38] Deep multimodal embedding for video captioning Multimedia Tools and Applications, 2019, 78 : 31793 - 31805
- [40] Weakly Supervised Dense Video Captioning 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5159 - 5167