共 50 条
- [45] Human interaction categorization by using audio-visual cues Machine Vision and Applications, 2014, 25 : 71 - 84
- [46] An audio-visual speech recognition with a new mandarin audio-visual database INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
- [47] Auxiliary Loss Multimodal GRU Model in Audio-Visual Speech Recognition IEEE ACCESS, 2018, 6 : 5573 - 5583
- [48] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition 2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385
- [49] LUMINA: Linguistic unified multimodal Indonesian natural audio-visual dataset DATA IN BRIEF, 2024, 54
- [50] Non-invasive extraction of audio-visual cues for multimodal applications HYBRID IMAGE AND SIGNAL PROCESSING VI, 1998, 3389 : 133 - 138