共 50 条
- [41] Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 867 - 871
- [42] Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition INTERSPEECH 2022, 2022, : 2838 - 2842
- [43] Audio-Visual Speech Recognition in Noisy Audio Environments 2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 484 - 487
- [45] Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18783 - 18794
- [46] A robust visual feature extraction based BTSM-LDA for audio-visual speech recognition 2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 1044 - +
- [47] Fuzzy-Neural-Network Based Audio-Visual Fusion for Speech Recognition 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 210 - 214
- [50] MULTIPOSE AUDIO-VISUAL SPEECH RECOGNITION 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1065 - 1069