共 50 条
- [1] AUTOMATED AUDIO CAPTIONING USING TRANSFER LEARNING AND RECONSTRUCTION LATENT SPACE SIMILARITY REGULARIZATION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7722 - 7726
- [2] Feature-informed Embedding Space Regularization For Audio Classification [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 419 - 423
- [4] Generating Accurate Caption Units for Figure Captioning [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2792 - 2804
- [5] AUDIO CAPTION: LISTEN AND TELL [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 830 - 834
- [6] BEYOND CAPTION TO NARRATIVE: VIDEO CAPTIONING WITH MULTIPLE SENTENCES [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3364 - 3368
- [8] Caption TLSTMs: combining transformer with LSTMs for image captioning [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 111 - 121
- [9] CLOTHO: AN AUDIO CAPTIONING DATASET [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 736 - 740
- [10] Audio Captioning Based on Combined Audio and Semantic Embeddings [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 41 - 48