共 50 条
- [1] Multi-Modal Multi-Action Video Recognition 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13658 - 13667
- [2] Language-guided Multi-Modal Fusion for Video Action Recognition 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3151 - 3155
- [4] Multi-modal Laughter Recognition in Video Conversations 2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 869 - 874
- [5] On Pursuit of Designing Multi-modal Transformer for Video Grounding 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9810 - 9823
- [7] Multi-modal Transformer for Indoor Human Action Recognition 2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1155 - 1160
- [8] Multi-Modal Emotion Recognition Fusing Video and Audio APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 455 - 462
- [9] Everything at Once - Multi-modal Fusion Transformer for Video Retrieval 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19988 - 19997
- [10] A comprehensive video dataset for multi-modal recognition systems Data Science Journal, 2019, 18 (01):