共 50 条
- [1] End-to-end recurrent denoising autoencoder embeddings for speaker identification [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21): : 14429 - 14439
- [2] End-to-End Chinese Speaker Identification [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2274 - 2285
- [3] Shortcut Connections based Deep Speaker Embeddings for End-to-End Speaker Verification System [J]. INTERSPEECH 2019, 2019, : 2928 - 2932
- [4] DEEP NEURAL NETWORK-BASED SPEAKER EMBEDDINGS FOR END-TO-END SPEAKER VERIFICATION [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 165 - 170
- [5] End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning [J]. INTERSPEECH 2019, 2019, : 4425 - 4429
- [6] Improved Relation Networks for End-to-End Speaker Verification and Identification [J]. INTERSPEECH 2022, 2022, : 5085 - 5089
- [7] FRAME-LEVEL SPEAKER EMBEDDINGS FOR TEXT-INDEPENDENT SPEAKER RECOGNITION AND ANALYSIS OF END-TO-END MODEL [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1007 - 1013
- [8] End-to-end Convolutional Semantic Embeddings [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5735 - 5744
- [9] Hybrid Network For End-To-End Text-Independent Speaker Identification [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2352 - 2359
- [10] End-to-End Active Speaker Detection [J]. COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 126 - 143