共 3 条
- [1] Multihead Attention-based Audio Image Generation with Cross-Modal Shared Weight Classifier 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [2] Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams INTERSPEECH 2021, 2021, : 321 - 325