Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

被引:10
|
作者
Noh, Hyeonwoo [1 ,3 ]
Kim, Taehoon [2 ,4 ]
Mun, Jonghwan [1 ,3 ]
Han, Bohyung [3 ]
机构
[1] POSTECH, Comp Vis Lab, Pohang, South Korea
[2] OpenAI, San Francisco, CA USA
[3] Seoul Natl Univ, Comp Vis Lab, ECE & ASRI, Seoul, South Korea
[4] Devsisters, Seoul, South Korea
关键词
D O I
10.1109/CVPR.2019.00858
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study how to leverage off-the-shelf visual and linguistic data to cope with out-of-vocabulary answers in visual question answering task. Existing large-scale visual datasets with annotations such as image class labels, bounding boxes and region descriptions are good sources for learning rich and diverse visual concepts. However, it is not straightforward how the visual concepts can be captured and transferred to visual question answering models due to missing link between question dependent answering models and visual data without question. We tackle this problem in two steps: 1) learning a task conditional visual classifier, which is capable of solving diverse question-specific visual recognition tasks, based on unsupervised task discovery and 2) transferring the task conditional visual classifier to visual question answering models. Specifically, we employ linguistic knowledge sources such as structured lexical database (e.g. WordNet) and visual descriptions for unsupervised task discovery, and transfer a learned task conditional visual classifier as an answering unit in a visual question answering model. We empirically show that the proposed algorithm generalizes to out-of-vocabulary answers successfully using the knowledge transferred from the visual dataset.
引用
收藏
页码:8377 / 8386
页数:10
相关论文
共 50 条
  • [1] Visual Question Answering as a Meta Learning Task
    Teney, Damien
    van den Hengel, Anton
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 229 - 245
  • [2] Visual Question Generation as Dual Task of Visual Question Answering
    Li, Yikang
    Duan, Nan
    Zhou, Bolei
    Chu, Xiao
    Ouyang, Wanli
    Wang, Xiaogang
    Zhou, Ming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6116 - 6124
  • [3] Toward Unsupervised Realistic Visual Question Answering
    Zhang, Yuwei
    Ho, Chih-Hui
    Vasconcelos, Nuno
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15567 - 15578
  • [4] Parameter-Efficient Transfer Learning for Medical Visual Question Answering
    Liu, Jiaxiang
    Hu, Tianxiang
    Zhang, Yan
    Feng, Yang
    Hao, Jin
    Lv, Junhui
    Liu, Zuozhu
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2816 - 2826
  • [5] Multitask Learning for Visual Question Answering
    Ma, Jie
    Liu, Jun
    Lin, Qika
    Wu, Bei
    Wang, Yaxian
    You, Yang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1380 - 1394
  • [6] Medical Visual Question Answering via Conditional Reasoning and Contrastive Learning
    Liu, Bo
    Zhan, Li-Ming
    Xu, Li
    Wu, Xiao-Ming
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1532 - 1545
  • [7] Multi-Question Learning for Visual Question Answering
    Lei, Chenyi
    Wu, Lei
    Liu, Dong
    Li, Zhao
    Wang, Guoxin
    Tang, Haihong
    Li, Houqiang
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11328 - 11335
  • [8] Complex Question Answering: Unsupervised Learning Approaches and Experiments
    Chali, Yllias
    Joty, Shafiq R.
    Hasan, Sadid A.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 35 : 1 - 47
  • [9] Explicit Bias Discovery in Visual Question Answering Models
    Manjunatha, Varun
    Saini, Nirat
    Davis, Larry S.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9554 - 9563
  • [10] Enhancement of Question Answering System Accuracy via Transfer Learning and BERT
    Duan, Kai
    Du, Shiyu
    Zhang, Yiming
    Lin, Yanru
    Wu, Hongzhuo
    Zhang, Quan
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (22):