共 50 条
- [1] MDETR - Modulated Detection for End-to-End Multi-Modal Understanding [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1760 - 1770
- [3] Multi-Modal Data Augmentation for End-to-End ASR [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2394 - 2398
- [4] End-to-end Knowledge Retrieval with Multi-modal Queries [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 8573 - 8589
- [5] End-to-end Multi-modal Video Temporal Grounding [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] End-to-End Deep Multi-Modal Physiological Authentication With Smartbands [J]. IEEE SENSORS JOURNAL, 2021, 21 (13) : 14977 - 14986
- [7] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7073 - 7083
- [9] MMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software Implications [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, IISWC, 2023, : 154 - 166
- [10] DeepVANet: A Deep End-to-End Network for Multi-modal Emotion Recognition [J]. HUMAN-COMPUTER INTERACTION, INTERACT 2021, PT III, 2021, 12934 : 227 - 237