共 50 条
- [1] Temporal Cross-Modal Attention for Audio-Visual Event Localization [J]. Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2022, 88 (03): : 263 - 268
- [2] Cross-modal Background Suppression for Audio-Visual Event Localization [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19957 - 19966
- [3] SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 261 - 270
- [4] Audio-Visual Event Localization based on Cross-Modal Interacting Guidance [J]. 2021 IEEE FOURTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2021), 2021, : 104 - 107
- [5] Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3893 - 3901
- [6] Cross-Modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 279 - 286
- [8] Effect of Uncertainty in Audio-Visual Cross-Modal Statistical Learning [J]. PERCEPTION, 2019, 48 : 109 - 109
- [9] Deep Cross-Modal Audio-Visual Generation [J]. PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 349 - 357
- [10] Cross-modal prediction in audio-visual communication [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2056 - 2059