共 50 条
- [1] INTEGRATION OF PRE-TRAINED NETWORKS WITH CONTINUOUS TOKEN INTERFACE FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7152 - 7156
- [3] Pre-trained multimodal end-to-end network for spoken language assessment incorporating prompts [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1394 - 1398
- [5] End-to-End Neural Transformer Based Spoken Language Understanding [J]. INTERSPEECH 2020, 2020, : 866 - 870
- [6] Unsupervised Visual Anomaly Detection Using Self-Supervised Pre-Trained Transformer [J]. IEEE ACCESS, 2024, 12 : 127604 - 127613
- [7] Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model [J]. INTERSPEECH 2021, 2021, : 4718 - 4722
- [8] TOWARDS END-TO-END SPOKEN LANGUAGE UNDERSTANDING [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5754 - 5758
- [9] Low resource end-to-end spoken language understanding with capsule networks [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
- [10] Adapting Transformer to End-to-end Spoken Language Translation [J]. INTERSPEECH 2019, 2019, : 1133 - 1137