共 50 条
- [1] Self-Attention Transducers for End-to-End Speech Recognition [J]. INTERSPEECH 2019, 2019, : 4395 - 4399
- [2] Insights on Neural Representations for End-to-End Speech Recognition [J]. INTERSPEECH 2021, 2021, : 4079 - 4083
- [4] Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition [J]. INTERSPEECH 2022, 2022, : 2098 - 2102
- [5] Segmental Recurrent Neural Networks for End-to-end Speech Recognition [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 385 - 389
- [6] End-to-End Speech Emotion Recognition Based on Neural Network [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1634 - 1638
- [7] Towards End-to-End Speech Recognition with Recurrent Neural Networks [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1764 - 1772
- [8] ESPRESSO: A FAST END-TO-END NEURAL SPEECH RECOGNITION TOOLKIT [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 136 - 143
- [9] Large-Scale Streaming End-to-End Speech Translation with Neural Transducers [J]. INTERSPEECH 2022, 2022, : 3263 - 3267
- [10] Exploring end-to-end framework towards Khasi speech recognition system [J]. International Journal of Speech Technology, 2021, 24 : 419 - 424