共 50 条
- [32] Advancing Video Question Answering with a Multi-modal and Multi-layer Question Enhancement Network [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3985 - 3993
- [35] Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1839 - 1848
- [36] Answer-checking in Context: A Multi-modal Fully Attention Network for Visual Question Answering [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1173 - 1180
- [38] NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4542 - 4550
- [40] A Multi-scale and Multi-modal Transportation GIS for the City of Guangzhou [J]. INFORMATION FUSION AND GEOGRAPHIC INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 95 - 111