共 50 条
- [31] Multimodal Speech Emotion Recognition using Cross Attention with Aligned Audio and Text INTERSPEECH 2020, 2020, : 2717 - 2721
- [32] Learning Relationships between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8992 - 8999
- [35] Improved Multimodal Sentiment Detection Using Stressed Regions of Audio PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2834 - 2837
- [36] Multimodal approach by embedding text and graphs for the detection of abusive messages TRAITEMENT AUTOMATIQUE DES LANGUES, 2021, 62 (02): : 13 - 38
- [37] A Robust Approach for Scene Text Detection and Tracking in Video ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 303 - 314
- [38] An Adaptive Text Detection Approach in Images and Video Frames 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 72 - 77
- [39] An Automatic Video Text Detection, Localization and Extraction Approach ADVANCED INTERNET BASED SYSTEMS AND APPLICATIONS, 2009, 4879 : 1 - 9