共 50 条
- [2] Parallel Image Captioning Using 2D Masked Convolution [J]. APPLIED SCIENCES-BASEL, 2019, 9 (09):
- [5] Feedback Attention Model for Image Captioning [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (07): : 1122 - 1129
- [6] Semantic-Conditional Diffusion Networks for Image Captioning [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23359 - 23368
- [7] Masked Diffusion Transformer is a Strong Image Synthesizer [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23107 - 23116
- [9] Multimodal Data Augmentation for Image Captioning using Diffusion Models [J]. PROCEEDINGS OF THE 1ST WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM3A 2023, 2023, : 23 - 33