共 50 条
- [1] Combining text and audio-visual features in video indexing [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1005 - 1008
- [2] Multimedia: A traditional subject (Audio-visual, text, libraries) [J]. DEGRES-REVUE DE SYNTHESE A ORIENTATION SEMIOLOGIQUE, 1998, (92-93): : B1 - B12
- [3] The audio-visual text: Subtitling and dubbing different genres [J]. META, 2004, 49 (01) : 25 - 38
- [4] PREDICTING AUDIO-VISUAL SALIENT EVENTS BASED ON VISUAL, AUDIO AND TEXT MODALITIES FOR MOVIE SUMMARIZATION [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4361 - 4365
- [5] Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 355 - 358
- [6] An audio-visual distance for audio-visual speech vector quantization [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
- [8] Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2056 - 2063
- [9] Towards Audio-Visual Cues for Cloud Infrastructure Monitoring [J]. PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2016, : 218 - 219
- [10] Towards practical deployment of audio-visual speech recognition [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 777 - 780