共 50 条
- [32] PlaceFormer: Transformer-Based Visual Place Recognition Using Multi-Scale Patch Selection and Fusion IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6552 - 6559
- [34] Multi-Encoder Learning and Stream Fusion for Transformer-Based End-to-End Automatic Speech Recognition INTERSPEECH 2021, 2021, : 2846 - 2850
- [36] Worker behavior recognition based on temporal and spatial self-attention of vision Transformer Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (03): : 446 - 454
- [39] Transformer-based Self-supervised Representation Learning for Emotion Recognition Using Bio-signal Feature Fusion 2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
- [40] Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition INTERSPEECH 2020, 2020, : 379 - 383