共 50 条
- [1] Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings INTERSPEECH 2021, 2021, : 3400 - 3404
- [2] WavFusion: Towards Wav2vec 2.0 Multimodal Speech Emotion Recognition MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 325 - 336
- [3] Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 333 - 343
- [4] Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition INTERSPEECH 2022, 2022, : 4725 - 4729
- [6] Speech Emotion Recognition Based on Shallow Structure of Wav2vec 2.0 and Attention Mechanism 2024 IEEE 14TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, ISCSLP 2024, 2024, : 398 - 402
- [8] Using Speaker-Specific Emotion Representations in Wav2vec 2.0-Based Modules for Speech Emotion Recognition CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 1009 - 1030
- [9] Detection of Prosodic Boundaries in Speech Using Wav2Vec 2.0 TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 377 - 388
- [10] SERAB: A MULTI-LINGUAL BENCHMARK FOR SPEECH EMOTION RECOGNITION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7697 - 7701