共 50 条
- [41] A BETTER AND FASTER END-TO-END MODEL FOR STREAMING ASR [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5634 - 5638
- [42] INDEPENDENT LANGUAGE MODELING ARCHITECTURE FOR END-TO-END ASR [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7059 - 7063
- [43] End-to-end Multi-modal Video Temporal Grounding [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [44] SPEAKER ADAPTATION FOR END-TO-END CTC MODELS [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 542 - 549
- [45] GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4879 - 4883
- [46] A study on end-to-end speaker diarization system using single-label classification [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2023, 42 (06): : 536 - 543
- [47] End-to-end Keywords Spotting Based on Connectionist Temporal Classification for Mandarin [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
- [48] TOWARDS END-TO-END SPEAKER DIARIZATION WITH GENERALIZED NEURAL SPEAKER CLUSTERING [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8372 - 8376
- [49] End-to-end Learning for Graph Decomposition [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10092 - 10101
- [50] End-To-End Graph-Based Deep Semi-Supervised Learning with Extended Graph Laplacian [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5948 - 5953