Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset

被引:0
|
作者
Chen, Zhanwen [1 ]
Li, Shiyao [1 ]
Rashedi, Roxanne [1 ]
Zi, Xiaoman [1 ]
Elrod-Erickson, Morgan [2 ]
Hollis, Bryan [2 ]
Maliakal, Angela [1 ]
Shen, Xinyu [1 ]
Zhao, Simeng [1 ]
Kunda, Maithilee [1 ]
机构
[1] Vanderbilt Univ, Dept Elect Engn & Comp Sci, Creat Writing Program, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Dept English, Creat Writing Program, 221 Kirkland Hall, Nashville, TN 37235 USA
关键词
MEDIATION;
D O I
10.1109/icdl-epirob48136.2020.9278057
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern social intelligence includes the ability to watch videos and answer questions about social and theory-of-mind-related content, e.g., for a scene in Harry Potter, "Is the father really upset about the boys flying the car?" Social visual question answering (social VQA) is emerging as a valuable methodology for studying social reasoning in both humans (e.g., children with autism) and AI agents. However, this problem space spans enormous variations in both videos and questions. We discuss methods for creating and characterizing social VQA datasets, including 1) crowdsourcing versus in-house authoring, including sample comparisons of two new datasets that we created (TinySocial-Crowd and TinySocial-InHouse) and the previously existing Social-IQ dataset; 2) a new rubric for characterizing the difficulty and content of a given video; and 3) a new rubric for characterizing question types. We close by describing how having well-characterized social VQA datasets will enhance the explainability of AI agents and can also inform assessments and educational interventions for people.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Visual question answering: A survey of methods and datasets
    Wu, Qi
    Teney, Damien
    Wang, Peng
    Shen, Chunhua
    Dick, Anthony
    van den Hengel, Anton
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 163 : 21 - 40
  • [2] Visual question answering: Datasets, algorithms, and future challenges
    Kafle, Kushal
    Kanan, Christopher
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 163 : 3 - 20
  • [3] Dataset bias: A case study for visual question answering
    Das, Anubrata
    Anjum, Samreen
    Gurari, Danna
    [J]. Proceedings of the Association for Information Science and Technology, 2019, 56 (01): : 58 - 67
  • [4] Improvisation of Dataset Efficiency in Visual Question Answering Domain
    Mohamed, Sheerin Sitara Noor
    Srinivasan, Kavitha
    [J]. STATISTICS AND APPLICATIONS, 2022, 20 (02): : 279 - 289
  • [5] OVQA: A Clinically Generated Visual Question Answering Dataset
    Huang, Yefan
    Wang, Xiaoli
    Liu, Feiyan
    Huang, Guofeng
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2924 - 2938
  • [6] A Large Visual Question Answering Dataset for Cultural Heritage
    Asprino, Luigi
    Bulla, Luana
    Marinucci, Ludovica
    Mongiovi, Misael
    Presutti, Valentina
    [J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT II, 2022, 13164 : 193 - 197
  • [7] Cross-Dataset Adaptation for Visual Question Answering
    Chao, Wei-Lun
    Hu, Hexiang
    Sha, Fei
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5716 - 5725
  • [8] Robust Visual Question Answering: Datasets, Methods, and Future Challenges
    Ma, Jie
    Wang, Pinghui
    Kong, Dechen
    Wang, Zewei
    Liu, Jun
    Pei, Hongbin
    Zhao, Junzhou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5575 - 5594
  • [9] A survey of methods, datasets and evaluation metrics for visual question answering
    Sharma, Himanshu
    Jalal, Anand Singh
    [J]. IMAGE AND VISION COMPUTING, 2021, 116
  • [10] TwEETQA: A Social Media Focused Question Answering Dataset
    Xiong, Wenhan
    Wu, Jiawei
    Wang, Hong
    Kulkarni, Vivek
    Yu, Mo
    Chang, Shiyu
    Guo, Xiaoxiao
    Wang, William Yang
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5020 - 5031