Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引：0

作者：

Niu, Li ^{[1
]}

Li, Wen ^{[1
]}

Xu, Dong ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore

来源：

2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2015年

关键词：

EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.

引用

页码：2774 / 2783

页数：10

共 50 条

[31] Weakly Supervised Learning: Application to Fish School Recognition
Lefort, Riwal
Fablet, Ronan
Boucher, Jean-Marc
NEW ADVANCES IN INTELLIGENT SIGNAL PROCESSING, 2011, 372 : 203 - +
[32] Weakly-Supervised Recognition, Localization, and Explanation of Visual Entities
Mettes, Pascal
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 1459 - 1463
[33] Cross-domain structure learning for visual data recognition
Lu, Yuwu
Luo, Xingping
Wen, Jiajun
Lai, Zhihui
Li, Xuelong
PATTERN RECOGNITION, 2022, 134
[34] Learning Domain-specific Semantic Representation from Weakly Supervised Data to Improve Research Dataset Retrieval
Luo P.
Hong L.
Wang J.
Wang S.
Guo X.
Gao Z.
Cho S.W.
Proceedings of the Association for Information Science and Technology, 2022, 59 (01) : 205 - 214
[35] Visual tracking via weakly supervised learning from multiple imperfect oracles
Zhong, Bineng
Yao, Hongxun
Chen, Sheng
Ji, Rongrong
Chin, Tat-Jun
Wang, Hanzi
PATTERN RECOGNITION, 2014, 47 (03) : 1395 - 1410
[36] Visual Tracking via Weakly Supervised Learning from Multiple Imperfect Oracles
Zhong, Bineng
Yao, Hongxun
Chen, Sheng
Ji, Rongrong
Yuan, Xiaotong
Liu, Shaohui
Gao, Wen
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1323 - 1330
[37] Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation
Liu, Chang
Rizzoli, Giulia
Zanuttigh, Pietro
Li, Fu
Niu, Yi
COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 352 - 369
[38] Multi-view Domain Generalization for Visual Recognition
Niu, Li
Li, Wen
Xu, Dong
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4193 - 4201
[39] Weakly supervised learning of biomedical information extraction from curated data
Jain, Suvir
Kashyap, R.
Kuo, Tsung-Ting
Bhargava, Shitij
Lin, Gordon
Hsu, Chun-Nan
BMC BIOINFORMATICS, 2016, 17
[40] Weakly supervised learning of biomedical information extraction from curated data
Suvir Jain
Kashyap R.
Tsung-Ting Kuo
Shitij Bhargava
Gordon Lin
Chun-Nan Hsu
BMC Bioinformatics, 17

← 1 2 3 4 5 →