共 50 条
- [1] Multimodal attention-based transformer for video captioning [J]. Applied Intelligence, 2023, 53 : 23349 - 23368
- [2] Multimodal attention-based transformer for video captioning [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 23349 - 23368
- [3] Exploring adaptive attention in memory transformer applied to coherent video paragraph captioning [J]. 2022 IEEE EIGHTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2022), 2022, : 37 - 44
- [4] Captioning Transformer with Stacked Attention Modules [J]. APPLIED SCIENCES-BASEL, 2018, 8 (05):
- [5] Attention-Aligned Transformer for Image Captioning [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 607 - 615
- [7] Accelerated masked transformer for dense video captioning [J]. NEUROCOMPUTING, 2021, 445 : 72 - 80
- [8] Dense video captioning based on local attention [J]. IET IMAGE PROCESSING, 2023, 17 (09) : 2673 - 2685
- [9] Video captioning with global and local text attention [J]. The Visual Computer, 2022, 38 : 4267 - 4278
- [10] Contextual Attention Network for Emotional Video Captioning [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1858 - 1867