Learning Unsupervised Visual Grounding Through Semantic Self-Supervision

被引:0
|
作者
Javed, Syed Ashar [1 ]
Saxena, Shreyas
Gandhi, Vineet [2 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[2] IIIT Hyderabad, CVIT, Kohli Ctr Intelligent Syst KCIS, Hyderabad, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Localizing natural language phrases in images is a challenging problem that requires joint understanding of both the textual and visual modalities. In the unsupervised setting, lack of supervisory signals exacerbate this difficulty. In this paper, we propose a novel framework for unsupervised visual grounding which uses concept learning as a proxy task to obtain self-supervision. The intuition behind this idea is to encourage the model to localize to regions which can explain some semantic property in the data, in our case, the property being the presence of a concept in a set of images We present thorough quantitative and qualitative experiments to demonstrate the efficacy of our approach and show a 5.6% improvement over the current state of the art on Visual Genome dataset, a 5.8% improvement on the ReferItGame dataset and comparable to state-of-art performance on the Flickr30k dataset.
引用
收藏
页码:796 / 802
页数:7
相关论文
共 50 条
  • [1] Unsupervised Intra-domain Adaptation for Semantic Segmentation through Self-Supervision
    Pan, Fei
    Shin, Inkyu
    Rameau, Francois
    Lee, Seokju
    Kweon, In So
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3763 - 3772
  • [2] Semantic alignment with self-supervision for class incremental learning
    Fu, Zhiling
    Wang, Zhe
    Xu, Xinlei
    Yang, Mengping
    Chi, Ziqiu
    Ding, Weichao
    KNOWLEDGE-BASED SYSTEMS, 2023, 282
  • [3] Unsupervised Domain Adaptation on Sentence Matching Through Self-Supervision
    Gui-Rong Bai
    Qing-Bin Liu
    Shi-Zhu He
    Kang Liu
    Jun Zhao
    Journal of Computer Science and Technology, 2023, 38 : 1237 - 1249
  • [4] Unsupervised Domain Adaptation on Sentence Matching Through Self-Supervision
    Bai, Gui-Rong
    Liu, Qing-Bin
    He, Shi-Zhu
    Liu, Kang
    Zhao, Jun
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (06) : 1237 - 1249
  • [5] LiRA: Learning Visual Speech Representations from Audio through Self-supervision
    Ma, Pingchuan
    Mira, Rodrigo
    Petridis, Stavros
    Schuller, Bjorn W.
    Pantic, Maja
    INTERSPEECH 2021, 2021, : 3011 - 3015
  • [6] Unsupervised Domain Adaptation in LiDAR Semantic Segmentation with Self-Supervision and Gated Adapters
    Rochan, Mrigank
    Aich, Shubhra
    Corral-Soto, Eduardo R.
    Nabatchian, Amir
    Liu, Bingbing
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2649 - 2655
  • [7] Boosting Few-Shot Visual Learning with Self-Supervision
    Gidaris, Spyros
    Bursuc, Andrei
    Komodakis, Nikos
    Perez, Patrick
    Cord, Matthieu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8058 - 8067
  • [8] Learning multi-view visual correspondences with self-supervision
    Zhang, Pengcheng
    Zhou, Lei
    Bai, Xiao
    Wang, Chen
    Zhou, Jun
    Zhang, Liang
    Zheng, Jin
    DISPLAYS, 2022, 72
  • [9] Zero-shot learning with self-supervision by shuffling semantic embeddings
    Kim, Hoseong
    Lee, Jewook
    Byun, Hyeran
    NEUROCOMPUTING, 2021, 437 : 1 - 8
  • [10] Non-Prehensile Manipulation Learning through Self-Supervision
    Gao, Ziyan
    Elibol, Armagan
    Chong, Nak Young
    2020 FOURTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2020), 2020, : 93 - 99