共 50 条
- [21] DeepVANet: A Deep End-to-End Network for Multi-modal Emotion Recognition [J]. HUMAN-COMPUTER INTERACTION, INTERACT 2021, PT III, 2021, 12934 : 227 - 237
- [22] CONVOLUTIONAL DROPOUT AND WORDPIECE AUGMENTATION FOR END-TO-END SPEECH RECOGNITION [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5984 - 5988
- [24] DEEP CONTEXT: END-TO-END CONTEXTUAL SPEECH RECOGNITION [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 418 - 425
- [26] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features [J]. THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
- [28] End-to-End Speech Command Recognition with Capsule Network [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 776 - 780
- [30] End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021