共 50 条
- [44] Multi-level Visual Fusion Networks for Image Captioning 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
- [46] A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning APPLIED SCIENCES-BASEL, 2024, 14 (06):
- [47] Aligning Linguistic Words and Visual Semantic Units for Image Captioning PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 765 - 773
- [50] VIXEN: Visual Text Comparison Network for Image Difference Captioning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 846 - 854