共 50 条
- [1] Deep multimodal embedding for video captioning [J]. Multimedia Tools and Applications, 2019, 78 : 31793 - 31805
- [2] Deep multimodal embedding for video captioning [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (22) : 31793 - 31805
- [3] Position embedding fusion on transformer for dense video captioning [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 792 - 799
- [4] Early and Late Combinations of Criteria for Reranking Distributional Thesauri [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 470 - 476
- [5] Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3038 - 3048
- [6] VIDEO SEARCH RERANKING VIA ONLINE ORDINAL RERANKING [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 285 - 288
- [7] Adaptively Converting Auxiliary Attributes and Textual Embedding for Video Captioning Based on BiLSTM [J]. Neural Processing Letters, 2020, 52 : 2353 - 2369
- [10] Dense Video Captioning With Early Linguistic Information Fusion [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2309 - 2322