共 50 条
- [1] Audio-Visual Attention Networks for Emotion Recognition [J]. AVSU'18: PROCEEDINGS OF THE 2018 WORKSHOP ON AUDIO-VISUAL SCENE UNDERSTANDING FOR IMMERSIVE MULTIMEDIA, 2018, : 27 - 32
- [2] Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 355 - 358
- [3] TAVT:Towards Transferable Audio-Visual Text Generation [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14983 - 14999
- [4] Towards Audio-Visual Cues for Cloud Infrastructure Monitoring [J]. PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2016, : 218 - 219
- [5] Towards practical deployment of audio-visual speech recognition [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 777 - 780
- [7] Dynamic Bayesian Networks for Audio-Visual Speech Recognition [J]. EURASIP Journal on Advances in Signal Processing, 2002
- [8] Dynamic Bayesian Networks for audio-visual speaker recognition [J]. ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 539 - 545