RESCUENET-VQA: A LARGE-SCALE VISUAL QUESTION ANSWERING BENCHMARK FOR DAMAGE ASSESSMENT

被引:0
|
作者
Sarkar, Argho [1 ]
Rahnemoonfar, Maryam [2 ]
机构
[1] Univ Maryland Baltimore Cty, Baltimore, MD USA
[2] Lehigh Univ, Bethlehem, PA 18015 USA
关键词
Visual Question Answering; Remote-Sensing; Damage Assessment; Natural Disaster; Multi-modal;
D O I
10.1109/IGARSS52108.2023.10281747
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
In order to advance the research on AI-assisted efficient damage assessment during a natural disaster, we present in this study a large-scale visual question answering (VQA) dataset on remote sensing images, namely RescueNet-VQA. Visual question answering is the task of getting query-based scene information from images. The main advantage of this approach is that it can provide high-level scene information while interacting with users. For this merit, VQA has the potential to be considered in the decision-making processes for rapid response and recovery during any disaster. To conduct substantial research in this context, we present a novel VQA dataset for damage assessment on remote sensing imagery. Images in our dataset were collected after hurricane Michael. We have generated 1, 03, 192 image-question-answer triplets from 4, 375 images. This dataset is the only large-scale remote-sensed imagery-based visual question-answering dataset for damage assessment purposes. We have presented image collection and question generation procedures along with dataset statistics in this work.
引用
收藏
页码:1150 / 1153
页数:4
相关论文
共 50 条
  • [31] CFO: Conditional Focused Neural Question Answering with Large-scale Knowledge Bases
    Dai, Zihang
    Li, Lei
    Xu, Wei
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 800 - 810
  • [32] An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition
    Tsatsaronis, George
    Balikas, Georgios
    Malakasiotis, Prodromos
    Partalas, Ioannis
    Zschunke, Matthias
    Alvers, Michael R.
    Weissenborn, Dirk
    Krithara, Anastasia
    Petridis, Sergios
    Polychronopoulos, Dimitris
    Almirantis, Yannis
    Pavlopoulos, John
    Baskiotis, Nicolas
    Gallinari, Patrick
    Artieres, Thierry
    Ngomo, Axel-Cyrille Ngonga
    Heino, Norman
    Gaussier, Eric
    Barrio-Alvers, Liliana
    Schroeder, Michael
    Androutsopoulos, Ion
    Paliouras, Georgios
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [33] MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization
    Xu, Canwen
    Pei, Jiaxin
    Wu, Hongtao
    Liu, Yiyu
    Li, Chenliang
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3586 - 3596
  • [34] An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition
    George Tsatsaronis
    Georgios Balikas
    Prodromos Malakasiotis
    Ioannis Partalas
    Matthias Zschunke
    Michael R Alvers
    Dirk Weissenborn
    Anastasia Krithara
    Sergios Petridis
    Dimitris Polychronopoulos
    Yannis Almirantis
    John Pavlopoulos
    Nicolas Baskiotis
    Patrick Gallinari
    Thierry Artiéres
    Axel-Cyrille Ngonga Ngomo
    Norman Heino
    Eric Gaussier
    Liliana Barrio-Alvers
    Michael Schroeder
    Ion Androutsopoulos
    Georgios Paliouras
    [J]. BMC Bioinformatics, 16
  • [35] Arabic Question Answering System for Information Retrieval on Large-scale Image Objects
    Al-Zubi, Sawsan
    Awaysheh, Feras M.
    Al-Shboul, Bashar Awad
    [J]. 2021 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT DATA SCIENCE TECHNOLOGIES AND APPLICATIONS (IDSTA), 2021, : 162 - 170
  • [36] A Large-Scale Homography Benchmark
    Barath, Daniel
    Mishkin, Dmytro
    Polic, Michal
    Forstner, Wolfgang
    Matas, Jiri
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21360 - 21370
  • [37] From image to language: A critical analysis of Visual Question Answering (VQA) approaches, challenges, and opportunities
    Ishmam, Md. Farhan
    Shovon, Md. Sakib Hossain
    Mridha, M. F.
    Dey, Nilanjan
    [J]. INFORMATION FUSION, 2024, 106
  • [38] ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
    Masry, Ahmed
    Long, Do Xuan
    Tan, Jia Qing
    Joty, Shafiq
    Hogue, Enamul
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2263 - 2279
  • [39] Regulating Balance Degree for More Reasonable Visual Question Answering Benchmark
    Lin, Ken
    Mao, Aihua
    Liu, Jiangfeng
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [40] A-OKVQA: A Benchmark for Visual Question Answering Using World Knowledge
    Schwenk, Dustin
    Khandelwal, Apoorv
    Clark, Christopher
    Marino, Kenneth
    Mottaghi, Roozbeh
    [J]. COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 146 - 162