Learning Unsupervised Visual Grounding Through Semantic Self-Supervision

被引:0
|
作者
Javed, Syed Ashar [1 ]
Saxena, Shreyas
Gandhi, Vineet [2 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[2] IIIT Hyderabad, CVIT, Kohli Ctr Intelligent Syst KCIS, Hyderabad, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Localizing natural language phrases in images is a challenging problem that requires joint understanding of both the textual and visual modalities. In the unsupervised setting, lack of supervisory signals exacerbate this difficulty. In this paper, we propose a novel framework for unsupervised visual grounding which uses concept learning as a proxy task to obtain self-supervision. The intuition behind this idea is to encourage the model to localize to regions which can explain some semantic property in the data, in our case, the property being the presence of a concept in a set of images We present thorough quantitative and qualitative experiments to demonstrate the efficacy of our approach and show a 5.6% improvement over the current state of the art on Visual Genome dataset, a 5.8% improvement on the ReferItGame dataset and comparable to state-of-art performance on the Flickr30k dataset.
引用
收藏
页码:796 / 802
页数:7
相关论文
共 50 条
  • [41] Improving Semi-Supervised Learning for Remaining Useful Lifetime Estimation Through Self-Supervision
    Krokotsch, Tilman
    Knaak, Mirko
    Guehmann, Clemens
    INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT, 2022, 13 (01) : 1 - 19
  • [42] Towards Generalized Manipulation Learning Through Grasp Mechanics-Based Features and Self-Supervision
    Morgan, Andrew S.
    Bircher, Walter G.
    Dollar, Aaron M.
    IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (05) : 1553 - 1569
  • [43] Unsupervised Adaptation of Polyp Segmentation Models via Coarse-to-Fine Self-Supervision
    Wang, Jiexiang
    Chen, Chaoqi
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023, 2023, 13939 : 250 - 262
  • [44] Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision
    Weng, Zhenzhen
    Ogut, Mehmet Giray
    Limonchik, Shai
    Yeung, Serena
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2603 - 2612
  • [45] Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision
    Scholz, Julien
    Weber, Cornelius
    Hafez, Muhammad Burhan
    Wermter, Stefan
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] Explainable Action Prediction through Self-Supervision on Scene Graphs
    Kochakarn, Pawit
    Martini, Daniele De
    Omeiza, Daniel
    Kunze, Lars
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1479 - 1485
  • [47] Offline Meta-Reinforcement Learning with Online Self-Supervision
    Pong, Vitchyr H.
    Nair, Ashvin
    Smith, Laura
    Huang, Catherine
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [48] DEEP VIDEO INPAINTING GUIDED BY AUDIO-VISUAL SELF-SUPERVISION
    Kim, Kyuyeon
    Jung, Junsik
    Kim, Woo Jae
    Yoon, Sung-Eui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1970 - 1974
  • [49] Task-specific image summaries using semantic information and self-supervision
    Sharma, Deepak Kumar
    Singh, Anurag
    Sharma, Sudhir Kumar
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    SOFT COMPUTING, 2022, 26 (16) : 7581 - 7594
  • [50] Self-Supervised Self-Supervision by Combining Deep Learning and Probabilistic Logic
    Lang, Hunter
    Poon, Hoifung
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4978 - 4986