共 19 条
- [1] Exploring wav2vec 2.0 on speaker verification and language identification INTERSPEECH 2021, 2021, : 1509 - 1513
- [2] Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0 INTERSPEECH 2022, 2022, : 3543 - 3547
- [3] wav2vec 2.0 ASR for Cantonese-Speaking Older Adults in a Clinical Setting INTERSPEECH 2023, 2023, : 4958 - 4962
- [4] SYNTHETIC SPEECH DETECTION WITH WAV2VEC 2.0 IN VARIOUS LANGUAGE SETTINGS 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 585 - 589
- [5] On the robustness of wav2vec 2.0 based speaker recognition systems INTERSPEECH 2023, 2023, : 3177 - 3181
- [7] wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [8] Multi-level Fusion of Fisher Vector Encoded BERT and Wav2vec 2.0 Embeddings for Native Language Identification SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 391 - 403
- [9] Speech Emotion Recognition Based on Shallow Structure of Wav2vec 2.0 and Attention Mechanism 2024 IEEE 14TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, ISCSLP 2024, 2024, : 398 - 402
- [10] Exploring Aggregated wav2vec 2.0 Features and Dual-Stream TDNN for Efficient Spoken Dialect Identification IEEE ACCESS, 2025, 13 : 3115 - 3129