共 50 条
- [1] Audio-Visual Fusion Network Based on Conformer for Multimodal Emotion Recognition [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 315 - 326
- [4] Fuzzy-Neural-Network Based Audio-Visual Fusion for Speech Recognition [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 210 - 214
- [5] Audio-Visual Action Recognition Using Transformer Fusion Network [J]. APPLIED SCIENCES-BASEL, 2024, 14 (03):
- [7] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
- [9] Continuous Phoneme Recognition based on Audio-Visual Modality Fusion [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [10] Robust Audio-Visual Speech Recognition Based on Hybrid Fusion [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7580 - 7586