共 50 条
- [31] StrucTexT: Structured Text Understanding with Multi-Modal Transformers PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1912 - 1920
- [32] Image and Encoded Text Fusion for Multi-Modal Classification 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209
- [33] Multi-modal Visualization and Search for Text and Prosody Annotations PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2015): SYSTEM DEMONSTRATIONS, 2015, : 25 - 30
- [34] VTLayout: A Multi-Modal Approach for Video Text Layout PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2775 - 2784
- [35] Image Sense Classification in Text-Based Image Retrieval INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 124 - 135
- [36] External query reformulation for text-based image retrieval Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 7024 LNCS : 249 - 260
- [37] External Query Reformulation for Text-Based Image Retrieval STRING PROCESSING AND INFORMATION RETRIEVAL, 2011, 7024 : 249 - 260
- [39] Imagic: Text-Based Real Image Editing with Diffusion Models 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6007 - 6017
- [40] Multi-scale Multi-modal Dictionary BERT For Effective Text-image Retrieval in Multimedia Advertising PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4655 - 4660