共 50 条
- [2] TEXT2VIDEO: TEXT-DRIVEN TALKING-HEAD VIDEO SYNTHESIS WITH PERSONALIZED PHONEME - POSE DICTIONARY [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2659 - 2663
- [3] Text2Video: Text-driven facial animation using MPEG-4 [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 : 492 - 500
- [4] Automatic text segmentation and text recognition for video indexing [J]. Multimedia Systems, 2000, 8 : 69 - 81
- [6] Video Generation from Text [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7065 - 7072
- [7] Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2914 - 2923
- [8] Automatic video text localization and recognition [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS, 2007, : 484 - +
- [9] Automatic video text localization and recognition [J]. 2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 964 - 967