共 50 条
- [42] Compact and Efficient Multitask Learning in Vision, Language and Speech [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2933 - 2942
- [43] Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3532 - 3536
- [44] End-to-end Tibetan Ando dialect speech recognition based on hybrid CTC/attention architecture [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 628 - 632
- [45] Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, AUTOMATION AND MECHANICAL ENGINEERING (EAME 2015), 2015, 13 : 342 - 345
- [46] A language model for Amdo Tibetan speech recognition [J]. 2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
- [47] Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition [J]. INTERSPEECH 2019, 2019, : 3835 - 3839
- [48] Noise-robust Attention Learning for End-to-End Speech Recognition [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 311 - 315
- [49] REPRESENTATION LEARNING WITH SPECTRO-TEMPORAL-CHANNEL ATTENTION FOR SPEECH EMOTION RECOGNITION [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6304 - 6308
- [50] Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition [J]. INTERSPEECH 2020, 2020, : 2357 - 2361