共 50 条
- [31] REGARDING TOPOLOGY AND ADAPTABILITY IN DIFFERENTIABLE WFST-BASED E2E ASR 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 843 - 847
- [32] Language model adaptation using WFST-based speaking-style translation 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 228 - 231
- [33] Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study INTERSPEECH 2024, 2024, : 4468 - 4472
- [34] WFST-based Ground Truth Alignment for Difficult Historical Documents with Text Modification and Layout Variations DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
- [35] Robust Automatic Speech Recognition with Decoder Oriented Ideal Binary Mask Estimation 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2066 - 2069
- [36] Segmental Encoder-Decoder Models for Large Vocabulary Automatic Speech Recognition 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 766 - 770
- [37] NON-AUTOREGRESSIVE TRANSFORMER WITH UNIFIED BIDIRECTIONAL DECODER FOR AUTOMATIC SPEECH RECOGNITION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6527 - 6531
- [38] A GPU-based WFST Decoder with Exact Lattice Generation 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2212 - 2216
- [39] Automatic speech recognition based on diphones MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 6 - 10