VTalk: A system for generating text-to-audio-visual speech

被引:0
|
作者
Kalra, Prem [1 ]
Kapoor, Ashish [1 ]
Kumar Goyal, Udit [1 ]
机构
[1] Dept. of Computer Science and Eng., Indian Institute of Technology, New Delhi 110 016, India
关键词
D O I
暂无
中图分类号
学科分类号
摘要
20
引用
收藏
页码:307 / 314
相关论文
共 50 条
  • [41] Enhancing Quality and Accuracy of Speech Recognition System by Using Multimodal Audio-Visual Speech signal
    El Maghraby, Eslam E.
    Gody, Amr M.
    Farouk, M. Hesham
    ICENCO 2016 - 2016 12TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO) - BOUNDLESS SMART SOCIETIES, 2016, : 219 - 229
  • [42] Remote control generating dynamic user interface corresponding to audio visual system configuration
    Nonaka, Takako
    Kimura, Shuya
    Hase, Tomohiro
    2008 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2008, : 516 - 517
  • [43] MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT
    Yoon, Seunghyun
    Byun, Seokhyun
    Jung, Kyomin
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 112 - 118
  • [44] Audio-visual integration for speech recognition
    Kober, R
    Harz, U
    NEUROLOGY PSYCHIATRY AND BRAIN RESEARCH, 1996, 4 (04) : 179 - 184
  • [45] Audio-Visual Speech Cue Combination
    Arnold, Derek H.
    Tear, Morgan
    Schindel, Ryan
    Roseboom, Warrick
    PLOS ONE, 2010, 5 (04):
  • [46] RAVSSNet: Recurrent Audio Visual Speech Separation
    Shankar, M. Chandan
    Nag, Hemanth
    Tripathi, Shikha
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 557 - 567
  • [47] Deep Audio-Visual Speech Recognition
    Afouras, Triantafyllos
    Chung, Joon Son
    Senior, Andrew
    Vinyals, Oriol
    Zisserman, Andrew
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8717 - 8727
  • [48] MULTIPOSE AUDIO-VISUAL SPEECH RECOGNITION
    Estellers, Virginia
    Thiran, Jean-Philippe
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1065 - 1069
  • [49] Audio-visual speech perception is special
    Tuomainen, J
    Andersen, TS
    Tiippana, K
    Sams, M
    COGNITION, 2005, 96 (01) : B13 - B22
  • [50] Comparing audio and visual information for speech processing
    Dean, D
    Lucey, P
    Sridharan, S
    Wark, T
    ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 58 - 61