共 41 条
- [33] Multi-page Document Visual Question Answering Using Self-attention Scoring Mechanism DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 219 - 232
- [40] AFT-SAM: Adaptive Fusion Transformer with a Sparse Attention Mechanism for Audio-Visual Speech Recognition APPLIED SCIENCES-BASEL, 2025, 15 (01):