VTalk: A system for generating text-to-audio-visual speech

被引:0
|
作者
Kalra, Prem [1 ]
Kapoor, Ashish [1 ]
Kumar Goyal, Udit [1 ]
机构
[1] Dept. of Computer Science and Eng., Indian Institute of Technology, New Delhi 110 016, India
关键词
D O I
暂无
中图分类号
学科分类号
摘要
20
引用
收藏
页码:307 / 314
相关论文
共 50 条
  • [21] Robust front-end for audio, visual and audio–visual speech classification
    Terissi L.D.
    Sad G.D.
    Gómez J.C.
    International Journal of Speech Technology, 2018, 21 (2) : 293 - 307
  • [22] Audio-visual speech recognition based on joint training with audio-visual speech enhancement for robust speech recognition
    Hwang, Jung-Wook
    Park, Jeongkyun
    Park, Rae-Hong
    Park, Hyung-Min
    APPLIED ACOUSTICS, 2023, 211
  • [23] Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
    Yang, Karren
    Markovic, Dejan
    Krenn, Steven
    Agrawal, Vasu
    Richard, Alexander
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8217 - 8227
  • [24] Effects of aging on audio-visual speech integration Effects of aging on audio-visual speech integration
    Huyse, Aurelie
    Leybaert, Jacqueline
    Berthommier, Frederic
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (04): : 1918 - 1931
  • [25] From audio-only to audio and video Text-to-Speech
    Cosatto, E
    Graf, HP
    Ostermann, J
    Schroeter, J
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2004, 90 (06) : 1084 - 1095
  • [26] Audio-Visual Speech Recognition in Noisy Audio Environments
    Palecek, Karel
    Chaloupka, Josef
    2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 484 - 487
  • [27] An Audio-visual 3D Virtual Articulation System for Visual Speech Synthesis
    Li, Rui
    Yu, Jun
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON HAPTIC, AUDIO AND VISUAL ENVIRONMENTS AND GAMES (HAVE), 2017, : 25 - 30
  • [28] An audio-visual speech recognition with a new mandarin audio-visual database
    Liao, Wen-Yuan
    Pao, Tsang-Long
    Chen, Yu-Te
    Chang, Tsun-Wei
    INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
  • [29] Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis
    Ganesh, Attigodu Chandrashekara
    Berthommier, Frederic
    Schwartz, Jean-Luc
    PHYSIOLOGY, PSYCHOACOUSTICS AND COGNITION IN NORMAL AND IMPAIRED HEARING, 2016, 894 : 399 - 408
  • [30] Separation of audio-visual speech sources: A new approach exploiting the audio-visual coherence of speech stimuli
    Sodoyer, D
    Schwartz, JL
    Girin, L
    Klinkisch, J
    Jutten, C
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) : 1165 - 1173