共 50 条
- [1] A novel method for image captioning using multimodal feature fusion employing mask RNN and LSTM models Soft Computing, 2023, 27 : 14205 - 14218
- [5] Sieve: Multimodal Dataset Pruning Using Image Captioning Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 22423 - 22432
- [6] Multimodal Data Augmentation for Image Captioning using Diffusion Models PROCEEDINGS OF THE 1ST WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM3A 2023, 2023, : 23 - 33
- [9] A NOVEL METHOD FOR STEREO MATCHING USING GABOR FEATURE IMAGE AND CONFIDENCE MASK 2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
- [10] Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture SCIENTIFIC REPORTS, 2024, 14 (01):