共 50 条
- [41] Cross-Modal Matching of Audio-Visual German and French Fluent Speech in Infancy PLOS ONE, 2014, 9 (02):
- [42] Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5076 - 5084
- [44] Audio-to-Image Cross-Modal Generation 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [45] Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval 2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 1 - 9
- [46] Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval INTERSPEECH 2024, 2024, : 4064 - 4068
- [48] Cross-Modal Remote Sensing Image-Audio Retrieval With Adaptive Learning for Aligning Correlation IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [49] Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions INTERSPEECH 2023, 2023, : 341 - 345