共 50 条
- [22] Multi-level Alignment Network for Domain Adaptive Cross-modal Retrieval [J]. NEUROCOMPUTING, 2021, 440 : 207 - 219
- [23] Cross-modal collaborative representation and multi-level supervision for crowd counting [J]. Signal, Image and Video Processing, 2023, 17 : 601 - 608
- [25] Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2852 - 2861
- [26] Low-Order Multi-Level Features for Speech Emotion Recognition [J]. BALTIC JOURNAL OF MODERN COMPUTING, 2015, 3 (04): : 234 - 247
- [27] Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation [J]. INTERSPEECH 2020, 2020, : 896 - 900
- [28] Image Emotion Recognition via Fusion Multi-Level Representations [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (10): : 1566 - 1576
- [30] Speech2Video: Cross-Modal Distillation for Speech to Video Generation [J]. INTERSPEECH 2021, 2021, : 1629 - 1633