共 50 条
- [2] Show and Tell: A Neural Image Caption Generator [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3156 - 3164
- [3] A novel framework for automatic caption and audio generation [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 65 : 3248 - 3252
- [4] Fast Caption Alignment for Automatic Indexing of Audio [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2010, 1 (02): : 1 - 17
- [5] Listen to the data - They have a story to tell [J]. SYMPOSIUM ON ENVIRONMENTAL APPLICATIONS, 1996, : 15 - 21
- [6] Learn and Tell: Learning Priors for Image Caption Generation [J]. APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 17
- [7] Speech Evaluation Based on Deep Learning Audio Caption [J]. ADVANCES IN E-BUSINESS ENGINEERING FOR UBIQUITOUS COMPUTING, 2020, 41 : 51 - 66
- [8] CAN AUDIO CAPTIONS BE EVALUATED WITH IMAGE CAPTION METRICS? [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 981 - 985
- [10] LISTEN AND LEARN FROM NARRATIVES THAT TELL A STORY [J]. RELIGIOUS EDUCATION, 1990, 85 (04) : 617 - 630