共 50 条
- [1] Attention-Based Multimodal Fusion for Video Description [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4203 - 4212
- [3] Singing Voice Extraction with Attention-based Spectrograms Fusion [J]. INTERSPEECH 2020, 2020, : 2412 - 2416
- [4] Hierarchical attention-based multimodal fusion for video captioning [J]. NEUROCOMPUTING, 2018, 315 : 362 - 370
- [9] Multimodal Sentiment Analysis Using BiGRU and Attention-Based Hybrid Fusion Strategy [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02): : 1963 - 1981
- [10] Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition [J]. INTERSPEECH 2020, 2020, : 379 - 383