共 44 条
- [21] Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 272 - 276
- [22] Audio2DiffuGesture: Generating a diverse co-speech gesture based on a diffusion model [J]. ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (09): : 5392 - 5408
- [23] Audio-visual speech translation with automatic lip synchronization and face tracking based on 3-D read model [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2117 - 2120
- [24] FACE LANDMARK-BASED SPEAKER-INDEPENDENT AUDIO-VISUAL SPEECH ENHANCEMENT IN MULTI-TALKER ENVIRONMENTS [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6900 - 6904
- [25] Audio-visual speech translation with automatic LIP synchronization and face tracking based on 3-D head model [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2002, 2
- [26] One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2531 - 2539
- [27] Predicting Group-Level Skin Attention to Short Movies from Audio-Based LSTM-Mixture of Experts Models [J]. INTERSPEECH 2019, 2019, : 61 - 65
- [29] An Attention-based Bidirectional LSTM Model for Continuous Cross-Subject Estimation of Knee Joint Angle during Running from sEMG Signals [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,