共 24 条
- [2] VGGSOUND: A LARGE-SCALE AUDIO-VISUAL DATASET [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 721 - 725
- [4] ON ADVERSARIAL ROBUSTNESS OF LARGE-SCALE AUDIO VISUAL LEARNING [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 231 - 235
- [6] Audio-visual aligned saliency model for omnidirectional video with implicit neural representation learning [J]. Applied Intelligence, 2023, 53 : 22615 - 22634
- [7] A Large-scale Depth-based Multimodal Audio-Visual Corpus in Mandarin [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 881 - 885
- [9] Large Scale Audio-Visual Video Analytics Platform for Forensic Investigations of Terroristic Attacks [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 106 - 119