共 50 条
- [21] COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12995 - 13003
- [22] Visual Question Answering 2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 6 - 10
- [23] Transductive Cross-Lingual Scene-Text Visual Question Answering NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 452 - 467
- [24] Graphhopper: Multi-hop Scene Graph Reasoning for Visual Question Answering SEMANTIC WEB - ISWC 2021, 2021, 12922 : 111 - 127
- [27] Knowledge enhancement and scene understanding for knowledge-based visual question answering Knowledge and Information Systems, 2024, 66 : 2193 - 2208
- [29] Question Modifiers in Visual Question Answering LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
- [30] Language-Guided Visual Aggregation Network for Video Question Answering PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5195 - 5203