Are You Smarter Than A Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

被引:125
|
作者
Kembhavi, Aniruddha [1 ]
Seo, Minjoon [1 ,2 ]
Schwenk, Dustin [1 ]
Choi, Jonghyun [1 ]
Farhadi, Ali [1 ,2 ]
Hajishirzi, Hannaneh [2 ]
机构
[1] Allen Inst Artificial Intelligence, Seattle, WA 98013 USA
[2] Univ Washington, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2017.571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the task of Multi-Modal Machine Comprehension ((MC)-C-3), which aims at answering multimodal questions given a context of text, diagrams and images. We present the Textbook Question Answering (TQA) dataset that includes 1,076 lessons and 26,260 multi-modal questions, taken from middle school science curricula. Our analysis shows that a significant portion of questions require complex parsing of the text and the diagrams and reasoning, indicating that our dataset is more complex compared to previous machine comprehension and visual question answering datasets. We extend state-of-the-art methods for textual machine comprehension and visual question answering to the TQA dataset. Our experiments show that these models do not perform well on TQA. The presented dataset opens new challenges for research in question answering and reasoning across multiple modalities.
引用
收藏
页码:5376 / 5384
页数:9
相关论文
共 18 条
  • [1] Are you smarter than a fifth grader?
    Tyckoson, David A.
    REFERENCE & USER SERVICES QUARTERLY, 2007, 47 (01) : 8 - 9
  • [2] Are You Smarter Than a Fifth Grader?
    Wilson, Jennifer
    Kendrick, Rica
    Martin, Leah
    Johnson, Renay
    Petrie, Daniel
    JOURNAL OF PEDIATRIC NURSING-NURSING CARE OF CHILDREN & FAMILIES, 2010, 25 (02): : E14 - E14
  • [3] When You're Not Smarter Than a Fifth Grader
    Certo, Janine
    JOURNAL OF LANGUAGE AND LITERACY EDUCATION, 2016, 12 (02): : 154 - 155
  • [4] Are You Smarter Than a Sixth-Generation Computer?
    Yonck, Richard
    FUTURIST, 2012, 46 (05) : 6 - 7
  • [5] Editorial Commentary: Smarter Than a Sixth Grader? Hip Arthroscopists, Check Your Training at the Door
    Nho, Shane J.
    Neal, William H.
    ARTHROSCOPY-THE JOURNAL OF ARTHROSCOPIC AND RELATED SURGERY, 2018, 34 (07): : 2150 - 2151
  • [6] JaQuAD: Japanese question answering dataset for machine reading comprehension
    So, ByungHoon
    Byun, Kyuhong
    Kang, Kyungwon
    Cho, Seongjin
    arXiv, 2022,
  • [7] You are smarter than you think: (super) machine learning in context
    Alexander P. Keil
    Jessie K. Edwards
    European Journal of Epidemiology, 2018, 33 : 437 - 440
  • [8] You are smarter than you think: (super) machine learning in context
    Keil, Alexander P.
    Edwards, Jessie K.
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2018, 33 (05) : 437 - 440
  • [9] Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
    Gao, Haoyuan
    Mao, Junhua
    Zhou, Jie
    Huang, Zhiheng
    Wang, Lei
    Xu, Wei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [10] DAQAS: Deep Arabic Question Answering System based on duplicate question detection and machine reading comprehension
    Alami, Hamza
    Mahdaouy, Abdelkader El
    Benlahbib, Abdessamad
    En-Nahnahi, Noureddine
    Berrada, Ismail
    Ouatik, Said El Alaoui
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)