共 50 条
- [2] TransFusion: Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1093 - 1101
- [3] Video Visual Relation Detection via Multi-modal Feature Fusion PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2657 - 2661
- [4] A Chinese Multi-modal Relation Extraction Model for Internet Security of Finance 52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOP VOLUME (DSN-W 2022), 2022, : 123 - 128
- [5] Latent Variable Model for Multi-modal Translation 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6392 - 6405
- [6] Visual Agreement Regularized Training for Multi-Modal Machine Translation THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9418 - 9425
- [7] Adding visual attention into encoder-decoder model for multi-modal machine translation JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (02):
- [10] MUSE: MULTI-MODAL TARGET SPEAKER EXTRACTION WITH VISUAL CUES 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6678 - 6682