VTalk: A system for generating text-to-audio-visual speech

被引：0

作者：

Kalra, Prem ^{[1
]}

Kapoor, Ashish ^{[1
]}

Kumar Goyal, Udit ^{[1
]}

机构：

[1] Dept. of Computer Science and Eng., Indian Institute of Technology, New Delhi 110 016, India

来源：

IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India) | 2001年 / 18卷 / 04期

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

页码：307 / 314

共 50 条

[41] Enhancing Quality and Accuracy of Speech Recognition System by Using Multimodal Audio-Visual Speech signal
El Maghraby, Eslam E.
Gody, Amr M.
Farouk, M. Hesham
ICENCO 2016 - 2016 12TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO) - BOUNDLESS SMART SOCIETIES, 2016, : 219 - 229
[42] Remote control generating dynamic user interface corresponding to audio visual system configuration
Nonaka, Takako
Kimura, Shuya
Hase, Tomohiro
2008 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2008, : 516 - 517
[43] MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT
Yoon, Seunghyun
Byun, Seokhyun
Jung, Kyomin
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 112 - 118
[44] Audio-visual integration for speech recognition
Kober, R
Harz, U
NEUROLOGY PSYCHIATRY AND BRAIN RESEARCH, 1996, 4 (04) : 179 - 184
[45] Audio-Visual Speech Cue Combination
Arnold, Derek H.
Tear, Morgan
Schindel, Ryan
Roseboom, Warrick
PLOS ONE, 2010, 5 (04):
[46] RAVSSNet: Recurrent Audio Visual Speech Separation
Shankar, M. Chandan
Nag, Hemanth
Tripathi, Shikha
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 557 - 567
[47] Deep Audio-Visual Speech Recognition
Afouras, Triantafyllos
Chung, Joon Son
Senior, Andrew
Vinyals, Oriol
Zisserman, Andrew
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8717 - 8727
[48] MULTIPOSE AUDIO-VISUAL SPEECH RECOGNITION
Estellers, Virginia
Thiran, Jean-Philippe
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1065 - 1069
[49] Audio-visual speech perception is special
Tuomainen, J
Andersen, TS
Tiippana, K
Sams, M
COGNITION, 2005, 96 (01) : B13 - B22
[50] Comparing audio and visual information for speech processing
Dean, D
Lucey, P
Sridharan, S
Wark, T
ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 58 - 61

← 1 2 3 4 5 →