Visual text reader for virtual image communication on networks

被引：0

作者：

Yamada, A

Ohta, M

机构：

来源：

1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING | 1997年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a media conversion system from text to video, which can be used as a virtual image communication tool over narrow-band networks. The proposed system analyzes a plain text, such as e-mail, and generates a video sequence for a human's bust shot which includes actions and facial expressions related to the contents. The voice sounds are also generated using text-to-speech system. Both video and voice are synchronized, so that video-phone like communication is available without any additional information out of the original text.

引用

页码：495 / 500

页数：6

共 50 条

[41] Image Captioning with Text-Based Visual Attention
Chen He
Haifeng Hu
Neural Processing Letters, 2019, 49 : 177 - 185
[42] Visual Semantic Reasoning for Image-Text Matching
Li, Kunpeng
Zhang, Yulun
Li, Kai
Li, Yuanyuan
Fu, Yun
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4653 - 4661
[43] Text and Image: A Critical Introduction to the Visual/Verbal Divide
Wen, Suijun
FUNCTIONS OF LANGUAGE, 2015, 22 (01) : 142 - 149
[44] Text and Image: A Critical Introduction to the Visual/Verbal Divide
Phung Tien Nguyen
VISUAL COMMUNICATION, 2016, 15 (03) : 393 - 397
[45] Text and image: a critical introduction to the visual/verbal divide
Hardy-Vallee, Michel
VISUAL STUDIES, 2016, 31 (04) : 366 - 368
[46] Shapley visual transformers for image-to-text generation
Belhadi, Asma
Djenouri, Youcef
Belbachir, Ahmed Nabil
Michalak, Tomasz
Srivastava, Gautam
APPLIED SOFT COMPUTING, 2024, 166
[47] Mobile Visual Search Using Image and Text Features
Tsai, Sam S.
Chen, Huizhong
Chen, David
Vedantham, Ramakrishna
Grzeszczuk, Radek
Girod, Bernd
2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 845 - 849
[48] Text, image and visual rhetoric in Egyptian funerary art
Lemos, Rennan
TOPOI-REVISTA DE HISTORIA, 2022, 23 (49): : 340 - 343
[49] Image Captioning with Text-Based Visual Attention
He, Chen
Hu, Haifeng
NEURAL PROCESSING LETTERS, 2019, 49 (01) : 177 - 185
[50] Visual Programming for Text-to-Image Generation and Evaluation
Cho, Jaemin
Zala, Abhay
Bansal, Mohit
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →