Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引：0

作者：

Niu, Li ^{[1
]}

Li, Wen ^{[1
]}

Xu, Dong ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore

来源：

2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2015年

关键词：

EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.

引用

页码：2774 / 2783

页数：10

共 50 条

[41] Unlearning From Weakly Supervised Learning
Tang, Yi
Gao, Yi
Luo, Yong-Gang
Yang, Ju-Cheng
Xu, Miao
Zhang, Min-Ling
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5000 - 5008
[42] Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features
Fan, Bin
Yang, Yuzhu
Feng, Wensen
Wu, Fuchao
Lu, Jiwen
Liu, Hongmin
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1713 - 1726
[43] Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering
Zhao, Kaili
Chu, Wen-Sheng
Martinez, Aleix M.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2090 - 2099
[44] Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning
Qiu, Jing
Liao, Lejian
Li, Peng
2009 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS 2009), 2009, : 219 - 225
[45] Semi-Supervised Learning for Named Entity Recognition Using Weakly Labeled Training Data
Zafarian, Atefeh
Rokni, Ali
Khadivi, Shahram
Ghiasifard, Sonia
2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 129 - 135
[46] Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes
Gao, Yue
Ji, Rongrong
Liu, Wei
Dai, Qionghai
Hua, Gang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (12) : 5400 - 5411
[47] GearNet: Stepwise Dual Learning for Weakly Supervised Domain Adaptation
Xie, Renchunzi
Wei, Hongxin
Feng, Lei
An, Bo
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8717 - 8725
[48] Learning from the Web: Webly Supervised Meta-Learning for Masked Face Recognition
Zheng, Wenbo
Yan, Lan
Wang, Fei-Yue
Gou, Chao
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4299 - 4308
[49] Weakly Supervised Deep Learning for the Detection of Domain Generation Algorithms
Yu, Bin
Pan, Jie
Gray, Daniel
Hu, Jiaming
Choudhary, Chhaya
Nascimento, Anderson C. A.
De Cock, Martine
IEEE ACCESS, 2019, 7 : 51542 - 51556
[50] Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining
Bugliarello, Emanuele
Nematzadeh, Aida
Hendricks, Lisa Anne
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3052 - 3071

← 1 2 3 4 5 →