共 50 条
- [2] Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 349 - 357
- [4] Countering Language Drift via Visual Grounding [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4385 - 4395
- [5] A Visual Attention Grounding Neural Model for Multimodal Machine Translation [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3643 - 3653
- [6] Hierarchical cross-modal contextual attention network for visual grounding [J]. Multimedia Systems, 2023, 29 : 2073 - 2083
- [7] Attention-Based Keyword Localisation in Speech using Visual Grounding [J]. INTERSPEECH 2021, 2021, : 2991 - 2995
- [9] Diagram Visual Grounding: Learning to See with Gestalt-Perceptual Attention [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 837 - 845