共 50 条
- [21] Multi-Modal Emotion Recognition Fusing Video and Audio APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 455 - 462
- [22] ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3176 - 3189
- [24] Multi-modal Image Fusion Based on ROI and Laplacian Pyramid SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
- [25] Fabric image retrieval based on multi-modal feature fusion Signal, Image and Video Processing, 2024, 18 : 2207 - 2217
- [27] On Multi-modal Fusion for Freehand Gesture Recognition ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 862 - 873
- [28] Masked Audio Text Encoders are Effective Multi-Modal Rescorers FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10718 - 10730
- [30] Audio-Visual Scene Classification Based on Multi-modal Graph Fusion INTERSPEECH 2022, 2022, : 4157 - 4161