共 50 条
- [31] Improving End-to-End SLU performance with Prosodic Attention and Distillation INTERSPEECH 2023, 2023, : 1114 - 1118
- [32] SWINBERT: End-to-End Transformers with Sparse Attention for Video Captioning 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17928 - 17937
- [33] Self-Attention Transducers for End-to-End Speech Recognition INTERSPEECH 2019, 2019, : 4395 - 4399
- [34] End-to-End Chinese Image Text Recognition with Attention Model NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 180 - 189
- [35] STRUCTURED SPARSE ATTENTION FOR END-TO-END AUTOMATIC SPEECH RECOGNITION 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7044 - 7048
- [36] END-TO-END NEURAL SPEAKER DIARIZATION WITH SELF-ATTENTION 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 296 - 303
- [37] A Novel End-to-End Image Caption Based on Multimodal Attention Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (06): : 867 - 874
- [38] Improved training of end-to-end attention models for speech recognition 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 7 - 11