共 50 条
- [3] Attention-Aligned Transformer for Image Captioning [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 607 - 615
- [4] Bridging CNN and Transformer With Cross-Attention Fusion Network for Hyperspectral Image Classification [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [6] Cross modification attention-based deliberation model for image captioning [J]. Applied Intelligence, 2023, 53 : 5910 - 5933
- [9] Relational Attention with Textual Enhanced Transformer for Image Captioning [J]. PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 151 - 163
- [10] Stacked cross-modal feature consolidation attention networks for image captioning [J]. Multimedia Tools and Applications, 2024, 83 : 12209 - 12233