共 50 条
- [1] Seeing voices and hearing voices: Learning discriminative embeddings using cross-modal self-supervision [J]. INTERSPEECH 2020, 2020, : 3486 - 3490
- [3] Disentangled Self-Supervision in Sequential Recommenders [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 483 - 491
- [5] SymforNet: application of cross-modal information correspondences based on self-supervision in symbolic music generation [J]. Applied Intelligence, 2024, 54 : 4140 - 4152
- [7] Diachronic Cross-modal Embeddings [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2061 - 2069
- [8] Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [9] Probabilistic Embeddings for Cross-Modal Retrieval [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8411 - 8420
- [10] Face-Voice Matching using Cross-modal Embeddings [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1011 - 1019