Distilling Localization for Self-Supervised Representation Learning

被引：0

作者：

Zhao, Nanxuan ^{[1
]}

Wu, Zhirong ^{[2
]}

Lau, Rynson W. H. ^{[1
]}

Lin, Stephen ^{[2
]}

机构：

[1] City Univ Hong Kong, Hong Kong, Peoples R China

[2] Microsoft Res Asia, Beijing, Peoples R China

来源：

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent progress in contrastive learning has revolutionized unsupervised representation learning. Concretely, multiple views (augmentations) from the same image are encouraged to map to the similar embeddings, while views from different images are pulled apart. In this paper, through visualizing and diagnosing classification errors, we observe that current contrastive models are ineffective at localizing the foreground object, limiting their ability to extract discriminative high-level features. This is due to the fact that view generation process considers pixels in an image uniformly. To address this problem, we propose a data-driven approach for learning invariance to backgrounds. It first estimates foreground saliency in images and then creates augmentations by copy-and-pasting the foreground onto a variety of backgrounds. The learning still follows the instance discrimination pretext task, so that the representation is trained to disregard background content and focus on the foreground. We study a variety of saliency estimation methods, and find that most methods lead to improvements for contrastive learning. With this approach (DiLo), significant performance is achieved for self-supervised learning on ImageNet classification, and also for object detection on PASCAL VOC and MSCOCO.

引用

页码：10990 / 10998

页数：9

共 50 条

[1] Whitening for Self-Supervised Representation Learning
Ermolov, Aleksandr
Siarohin, Aliaksandr
Sangineto, Enver
Sebe, Nicu
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[2] Self-Supervised Representation Learning for CAD
Jones, Benjamin T.
Hu, Michael
Kodnongbua, Milin
Kim, Vladimir G.
Schulz, Adriana
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
[3] Self-Distilled Self-supervised Representation Learning
Jang, Jiho
Kim, Seonhoon
Yoo, Kiyoon
Kong, Chaerin
Kim, Jangho
Kwak, Nojun
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2828 - 2838
[4] Adaptive Self-Supervised Graph Representation Learning
Gong, Yunchi
[J]. 36TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2022), 2022, : 254 - 259
[5] Self-Supervised Relational Reasoning for Representation Learning
Patacchiola, Massimiliano
Storkey, Amos
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[6] Self-Supervised Learning for Specified Latent Representation
Liu, Chicheng
Song, Libin
Zhang, Jiwen
Chen, Ken
Xu, Jing
[J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (01) : 47 - 59
[7] Self-supervised Representation Learning on Document Images
Cosma, Adrian
Ghidoveanu, Mihai
Panaitescu-Liess, Michael
Popescu, Marius
[J]. DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 103 - 117
[8] Self-Supervised Speech Representation Learning: A Review
Mohamed, Abdelrahman
Lee, Hung-yi
Borgholt, Lasse
Havtorn, Jakob D.
Edin, Joakim
Igel, Christian
Kirchhoff, Katrin
Li, Shang-Wen
Livescu, Karen
Maaloe, Lars
Sainath, Tara N.
Watanabe, Shinji
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1179 - 1210
[9] Context Autoencoder for Self-supervised Representation Learning
Chen, Xiaokang
Ding, Mingyu
Wang, Xiaodi
Xin, Ying
Mo, Shentong
Wang, Yunhao
Han, Shumin
Luo, Ping
Zeng, Gang
Wang, Jingdong
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 132 (1) : 208 - 223
[10] SELF-SUPERVISED REPRESENTATION LEARNING FOR ULTRASOUND VIDEO
Jiao, Jianbo
Droste, Richard
Drukker, Lior
Papageorghiou, Aris T.
Noble, J. Alison
[J]. 2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1847 - 1850

← 1 2 3 4 5 →