共 50 条
- [11] Data Augmentation for End-to-End Optical Music Recognition [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 59 - 73
- [12] Semantic Mask for Transformer based End-to-End Speech Recognition [J]. INTERSPEECH 2020, 2020, : 971 - 975
- [13] You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation [J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 439 - 444
- [14] Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 175 - 179
- [15] Tibetan-Mandarin Bilingual Speech Recognition Based on End-to-End Framework [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1214 - 1217
- [16] SUBBAND TEMPORAL ENVELOPE FEATURES AND DATA AUGMENTATION FOR END-TO-END RECOGNITION OF DISTANT CONVERSATIONAL SPEECH [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6251 - 6255
- [17] AN ANALYSIS OF DECODING FOR ATTENTION-BASED END-TO-END MANDARIN SPEECH RECOGNITION [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 384 - 388
- [18] Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 816 - 820
- [19] On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition [J]. INTERSPEECH 2019, 2019, : 2165 - 2169