Visual text reader for virtual image communication on networks

被引:0
|
作者
Yamada, A
Ohta, M
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a media conversion system from text to video, which can be used as a virtual image communication tool over narrow-band networks. The proposed system analyzes a plain text, such as e-mail, and generates a video sequence for a human's bust shot which includes actions and facial expressions related to the contents. The voice sounds are also generated using text-to-speech system. Both video and voice are synchronized, so that video-phone like communication is available without any additional information out of the original text.
引用
收藏
页码:495 / 500
页数:6
相关论文
共 50 条
  • [41] Image Captioning with Text-Based Visual Attention
    Chen He
    Haifeng Hu
    Neural Processing Letters, 2019, 49 : 177 - 185
  • [42] Visual Semantic Reasoning for Image-Text Matching
    Li, Kunpeng
    Zhang, Yulun
    Li, Kai
    Li, Yuanyuan
    Fu, Yun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4653 - 4661
  • [43] Text and Image: A Critical Introduction to the Visual/Verbal Divide
    Wen, Suijun
    FUNCTIONS OF LANGUAGE, 2015, 22 (01) : 142 - 149
  • [44] Text and Image: A Critical Introduction to the Visual/Verbal Divide
    Phung Tien Nguyen
    VISUAL COMMUNICATION, 2016, 15 (03) : 393 - 397
  • [45] Text and image: a critical introduction to the visual/verbal divide
    Hardy-Vallee, Michel
    VISUAL STUDIES, 2016, 31 (04) : 366 - 368
  • [46] Shapley visual transformers for image-to-text generation
    Belhadi, Asma
    Djenouri, Youcef
    Belbachir, Ahmed Nabil
    Michalak, Tomasz
    Srivastava, Gautam
    APPLIED SOFT COMPUTING, 2024, 166
  • [47] Mobile Visual Search Using Image and Text Features
    Tsai, Sam S.
    Chen, Huizhong
    Chen, David
    Vedantham, Ramakrishna
    Grzeszczuk, Radek
    Girod, Bernd
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 845 - 849
  • [48] Text, image and visual rhetoric in Egyptian funerary art
    Lemos, Rennan
    TOPOI-REVISTA DE HISTORIA, 2022, 23 (49): : 340 - 343
  • [49] Image Captioning with Text-Based Visual Attention
    He, Chen
    Hu, Haifeng
    NEURAL PROCESSING LETTERS, 2019, 49 (01) : 177 - 185
  • [50] Visual Programming for Text-to-Image Generation and Evaluation
    Cho, Jaemin
    Zala, Abhay
    Bansal, Mohit
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,