共 50 条
- [31] UNSUPERVISED PRE-TRAINING OF BIDIRECTIONAL SPEECH ENCODERS VIA MASKED RECONSTRUCTION 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6889 - 6893
- [32] Reducing Domain mismatch in Self-supervised speech pre-training INTERSPEECH 2022, 2022, : 3028 - 3032
- [33] SPEECH ENHANCEMENT WITH MIXTURE OF DEEP EXPERTS WITH CLEAN CLUSTERING PRE-TRAINING 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 716 - 720
- [34] wav2vec: Unsupervised Pre-training for Speech Recognition INTERSPEECH 2019, 2019, : 3465 - 3469
- [35] TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
- [36] Efficient Pre-training for Localized Instruction Generation of Procedural Videos COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 347 - 363
- [38] Efficient Image Pre-training with Siamese Cropped Masked Autoencoders COMPUTER VISION - ECCV 2024, PT XXIII, 2025, 15081 : 348 - 366
- [39] SENTIMENT-AWARE AUTOMATIC SPEECH RECOGNITION PRE-TRAINING FOR ENHANCED SPEECH EMOTION RECOGNITION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7347 - 7351
- [40] POSPAN: Position-Constrained Span Masking for Language Model Pre-training PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4420 - 4424