共 50 条
- [21] CPT: CROSS-MODAL PREFIX-TUNING FOR SPEECH-TO-TEXT TRANSLATION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6217 - 6221
- [22] ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [23] STREAMING SIMULTANEOUS SPEECH TRANSLATION WITH AUGMENTED MEMORY TRANSFORMER [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7523 - 7527
- [24] Transcribing paralinguistic acoustic cues to target language text in transformer-based speech-to-text translation [J]. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, 5 : 3976 - 3980
- [25] Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-based Speech-to-Text Translation [J]. INTERSPEECH 2021, 2021, : 2262 - 2266
- [26] Comparative Analysis of Models for Neural Machine Speech-to-Text Translation for Turkic State Languages [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, ACIIDS 2024, 2024, 14796 : 360 - 371
- [28] M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation [J]. INTERSPEECH 2022, 2022, : 111 - 115
- [29] LEVERAGING WEAKLY SUPERVISED DATA TO IMPROVE END-TO-END SPEECH-TO-TEXT TRANSLATION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7180 - 7184
- [30] Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer [J]. INTERSPEECH 2023, 2023, : 32 - 36