共 50 条
- [21] A fusion scheme of visual and auditory modalities for event detection in sports video [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 189 - 192
- [22] Acoustic Event Detection Based on Feature-Level Fusion of Audio and Video Modalities [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
- [23] Acoustic Event Detection Based on Feature-Level Fusion of Audio and Video Modalities [J]. EURASIP Journal on Advances in Signal Processing, 2011
- [24] Information Fusion for Combining Visual and Textual Image Retrieval in ImageCLEF@ICPR [J]. RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 129 - 137
- [28] Semantic analysis based on fusion of audio/visual features for soccer video [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY, 2021, 183 : 563 - 571
- [29] Attention-Based Audio-Visual Fusion for Video Summarization [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
- [30] Video scene retrieval with symbol sequence based on integrated audio and visual features [J]. MULTIMEDIA CONTENT ANALYSIS, MANAGEMENT, AND RETRIEVAL 2006, 2006, 6073