共 50 条
- [1] END-TO-END AUDIO-VISUAL SPEECH RECOGNITION WITH CONFORMERS [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7613 - 7617
- [2] MODALITY ATTENTION FOR END-TO-END AUDIO-VISUAL SPEECH RECOGNITION [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6565 - 6569
- [3] FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3430 - 3434
- [4] Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition [J]. INTERSPEECH 2019, 2019, : 4090 - 4094
- [6] Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition [J]. INTERSPEECH 2022, 2022, : 2838 - 2842
- [7] END-TO-END MULTI-PERSON AUDIO/VISUAL AUTOMATIC SPEECH RECOGNITION [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6994 - 6998
- [8] END-TO-END VISUAL SPEECH RECOGNITION WITH LSTMS [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2592 - 2596
- [9] END-TO-END MULTI-TALKER OVERLAPPING SPEECH RECOGNITION [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6129 - 6133
- [10] Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 867 - 871