共 50 条
- [1] Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation [J]. INTERSPEECH 2022, 2022, : 1781 - 1785
- [2] LEVERAGING WEAKLY SUPERVISED DATA TO IMPROVE END-TO-END SPEECH-TO-TEXT TRANSLATION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7180 - 7184
- [3] Direct Speech-to-Speech Translation With Discrete Units [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 3327 - 3339
- [4] Unsupervised training for Farsi-English speech-to-speech translation [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4977 - 4980
- [5] Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation [J]. INTERSPEECH 2022, 2022, : 5195 - 5199
- [6] Textless Speech-to-Speech Translation on Real Data [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 860 - 872
- [7] Unsupervised features from text for speech synthesis in a speech-to-speech translation system [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2164 - 2167
- [8] TRANSFORMER-BASED DIRECT SPEECH-TO-SPEECH TRANSLATION WITH TRANSCODER [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 958 - 965
- [9] Direct speech-to-speech translation with a sequence-to-sequence model [J]. INTERSPEECH 2019, 2019, : 1123 - 1127
- [10] Direct Vs Cascaded Speech-to-Speech Translation Using Transformer [J]. SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 258 - 270