Learning Unsupervised Visual Grounding Through Semantic Self-Supervision

被引：0

作者：

Javed, Syed Ashar ^{[1
]}

Saxena, Shreyas

Gandhi, Vineet ^{[2
]}

机构：

[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA

[2] IIIT Hyderabad, CVIT, Kohli Ctr Intelligent Syst KCIS, Hyderabad, India

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Localizing natural language phrases in images is a challenging problem that requires joint understanding of both the textual and visual modalities. In the unsupervised setting, lack of supervisory signals exacerbate this difficulty. In this paper, we propose a novel framework for unsupervised visual grounding which uses concept learning as a proxy task to obtain self-supervision. The intuition behind this idea is to encourage the model to localize to regions which can explain some semantic property in the data, in our case, the property being the presence of a concept in a set of images We present thorough quantitative and qualitative experiments to demonstrate the efficacy of our approach and show a 5.6% improvement over the current state of the art on Visual Genome dataset, a 5.8% improvement on the ReferItGame dataset and comparable to state-of-art performance on the Flickr30k dataset.

引用

页码：796 / 802

页数：7

共 50 条

[41] Improving Semi-Supervised Learning for Remaining Useful Lifetime Estimation Through Self-Supervision
Krokotsch, Tilman
Knaak, Mirko
Guehmann, Clemens
INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT, 2022, 13 (01) : 1 - 19
[42] Towards Generalized Manipulation Learning Through Grasp Mechanics-Based Features and Self-Supervision
Morgan, Andrew S.
Bircher, Walter G.
Dollar, Aaron M.
IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (05) : 1553 - 1569
[43] Unsupervised Adaptation of Polyp Segmentation Models via Coarse-to-Fine Self-Supervision
Wang, Jiexiang
Chen, Chaoqi
INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023, 2023, 13939 : 250 - 262
[44] Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision
Weng, Zhenzhen
Ogut, Mehmet Giray
Limonchik, Shai
Yeung, Serena
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2603 - 2612
[45] Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision
Scholz, Julien
Weber, Cornelius
Hafez, Muhammad Burhan
Wermter, Stefan
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[46] Explainable Action Prediction through Self-Supervision on Scene Graphs
Kochakarn, Pawit
Martini, Daniele De
Omeiza, Daniel
Kunze, Lars
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1479 - 1485
[47] Offline Meta-Reinforcement Learning with Online Self-Supervision
Pong, Vitchyr H.
Nair, Ashvin
Smith, Laura
Huang, Catherine
Levine, Sergey
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[48] DEEP VIDEO INPAINTING GUIDED BY AUDIO-VISUAL SELF-SUPERVISION
Kim, Kyuyeon
Jung, Junsik
Kim, Woo Jae
Yoon, Sung-Eui
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1970 - 1974
[49] Task-specific image summaries using semantic information and self-supervision
Sharma, Deepak Kumar
Singh, Anurag
Sharma, Sudhir Kumar
Srivastava, Gautam
Lin, Jerry Chun-Wei
SOFT COMPUTING, 2022, 26 (16) : 7581 - 7594
[50] Self-Supervised Self-Supervision by Combining Deep Learning and Probabilistic Logic
Lang, Hunter
Poon, Hoifung
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4978 - 4986

← 1 2 3 4 5 →