共 50 条
- [21] Cross-Modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 279 - 286
- [22] Temporal and Cross-modal Attention for Audio-Visual Zero-Shot Learning COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 488 - 505
- [23] Multimodal Emotion Recognition using Cross-Modal Attention and 1D Convolutional Neural Networks INTERSPEECH 2020, 2020, : 4243 - 4247
- [25] Deep Cross-Modal Audio-Visual Generation PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 349 - 357
- [27] Cross-modal prediction in audio-visual communication 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2056 - 2059
- [28] Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5076 - 5084
- [29] Temporal aggregation of audio-visual modalities for emotion recognition 2020 43RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2020, : 305 - 308