共 50 条
- [32] Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5076 - 5084
- [34] Cross-Modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 279 - 286
- [35] Temporal and Cross-modal Attention for Audio-Visual Zero-Shot Learning COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 488 - 505
- [36] Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval 2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 1 - 9
- [37] Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 225 - 232
- [38] Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10543 - 10553