共 50 条
- [1] Micro-expression recognition based on contextual transformer networks VISUAL COMPUTER, 2025, 41 (03): : 1527 - 1541
- [2] Visual Speech Recognition in Natural Scenes Based on Spatial Transformer Networks 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2020, : 1 - 5
- [3] Speech recognition by integrating audio, visual and contextual features based on neural networks ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 155 - 164
- [6] Visual contextual relationship augmented transformer for image captioning Applied Intelligence, 2024, 54 : 4794 - 4813
- [7] ResT: An Efficient Transformer for Visual Recognition ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [9] Contextual Debiasing for Visual Recognition with Causal Mechanisms 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12745 - 12755