共 50 条
- [41] Multimodal Emotion Recognition using Cross-Modal Attention and 1D Convolutional Neural Networks INTERSPEECH 2020, 2020, : 4243 - 4247
- [44] Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10007 - 10016
- [45] Cross-Modal Mutual Learning for Audio-Visual Speech Recognition and Manipulation THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3036 - 3044
- [49] Cross-Modal Distillation for Speaker Recognition THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12977 - 12985