共 50 条
- [21] Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, 2 : 1495 - 1499
- [22] UNTIED POSITIONAL ENCODINGS FOR EFFICIENT TRANSFORMER-BASED SPEECH RECOGNITION 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 108 - 114
- [23] The MERSA Dataset and a Transformer-Based Approach for Speech Emotion Recognition PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13960 - 13970
- [24] AFT-SAM: Adaptive Fusion Transformer with a Sparse Attention Mechanism for Audio-Visual Speech Recognition APPLIED SCIENCES-BASEL, 2025, 15 (01):
- [25] Intra-ensemble: A New Method for Combining Intermediate Outputs in Transformer-based Automatic Speech Recognition INTERSPEECH 2023, 2023, : 2203 - 2207
- [26] STRUCTURED SPARSE ATTENTION FOR END-TO-END AUTOMATIC SPEECH RECOGNITION 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7044 - 7048
- [27] Layer Sparse Transformer for Speech Recognition 2023 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH, ICKG, 2023, : 269 - 273
- [28] Simulating reading mistakes for child speech Transformer-based phone recognition INTERSPEECH 2021, 2021, : 3860 - 3864
- [29] End to end transformer-based contextual speech recognition based on pointer network INTERSPEECH 2021, 2021, : 2087 - 2091
- [30] Multi-Encoder Learning and Stream Fusion for Transformer-Based End-to-End Automatic Speech Recognition INTERSPEECH 2021, 2021, : 2846 - 2850