共 50 条
- [31] Enhancing text summarization and audio generation using hybrid model ENGINEERING RESEARCH EXPRESS, 2025, 7 (01):
- [32] Mining Audio, Text and Visual Information for Talking Face Generation 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 787 - 795
- [33] Open Domain Event Text Generation THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7748 - 7755
- [34] AUDIO-BASED NONLINEAR VIDEO DIFFUSION 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2486 - 2489
- [35] Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6430 - 6440
- [36] Audio Watermarking Based on Quantization in Wavelet Domain INFORMATION SYSTEMS SECURITY, PROCEEDINGS, 2008, 5352 : 235 - 242
- [38] Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder INTERSPEECH 2020, 2020, : 3545 - 3549
- [39] MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 431 - 449
- [40] HMM-based audio keyword generation ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 3, PROCEEDINGS, 2004, 3333 : 566 - 574