VTalk: A system for generating text-to-audio-visual speech

被引:0
|
作者
Kalra, Prem [1 ]
Kapoor, Ashish [1 ]
Kumar Goyal, Udit [1 ]
机构
[1] Dept. of Computer Science and Eng., Indian Institute of Technology, New Delhi 110 016, India
关键词
D O I
暂无
中图分类号
学科分类号
摘要
20
引用
收藏
页码:307 / 314
相关论文
共 50 条
  • [31] Separation of audio-visual speech sources: A new approach exploiting the audio-visual coherence of speech stimuli
    Sodoyer, D. (sodoyer@icp.inpg.fr), 1600, Hindawi Publishing Corporation (2002):
  • [32] Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli
    David Sodoyer
    Jean-Luc Schwartz
    Laurent Girin
    Jacob Klinkisch
    Christian Jutten
    EURASIP Journal on Advances in Signal Processing, 2002
  • [33] Audio-visual speech perception without speech cues
    Saldana, HM
    Pisoni, DB
    Fellowes, JM
    Remez, RE
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2187 - 2190
  • [34] Audio-Visual Speech Modeling for Continuous Speech Recognition
    Dupont, Stephane
    Luettin, Juergen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2000, 2 (03) : 141 - 151
  • [35] Reconstructing intelligible audio speech from visual speech features
    Le Cornu, Thomas
    Ben Milner
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3355 - 3359
  • [36] Robot Command Interface Using an Audio-Visual Speech Recognition System
    Ceballos, Alexander
    Gomez, Juan
    Prieto, Flavio
    Redarce, Tanneguy
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 869 - +
  • [37] Audio-Visual Speech Recognition System Using Recurrent Neural Network
    Goh, Yeh-Huann
    Lau, Kai-Xian
    Lee, Yoon-Ket
    PROCEEDINGS OF THE 2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT): ENCOMPASSING INTELLIGENT TECHNOLOGY AND INNOVATION TOWARDS THE NEW ERA OF HUMAN LIFE, 2019, : 38 - 43
  • [38] Influence of native language phonetic system on audio-visual speech perception
    Wang, Yue
    Behne, Dawn M.
    Jiang, Haisheng
    JOURNAL OF PHONETICS, 2009, 37 (03) : 344 - 356
  • [39] Integration strategies for audio-visual speech processing: Applied to text-dependent speaker recognition
    Lucey, S
    Chen, TH
    Sridharan, S
    Chandran, V
    IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (03) : 495 - 506
  • [40] Lip Tracking Method for the System of Audio-Visual Polish Speech Recognition
    Kubanek, Mariusz
    Bobulski, Janusz
    Adrjanowicz, Lukasz
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 : 535 - 542