Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引：0

作者：

Niu, Li ^{[1
]}

Li, Wen ^{[1
]}

Xu, Dong ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore

来源：

2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2015年

关键词：

EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.

引用

页码：2774 / 2783

页数：10

共 50 条

[1] Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization
Niu, Li
Li, Wen
Xu, Dong
Cai, Jianfei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (09) : 1985 - 1999
[2] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
Zhu, Fan
Shao, Ling
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) : 42 - 59
[3] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
Fan Zhu
Ling Shao
International Journal of Computer Vision, 2014, 109 : 42 - 59
[4] Learning Visual Features from Large Weakly Supervised Data
Joulin, Armand
van der Maaten, Laurens
Jabri, Allan
Vasilache, Nicolas
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 67 - 84
[5] Learning Signs from Subtitles: A Weakly Supervised Approach to Sign Language Recognition
Cooper, Helen
Bowden, Richard
CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2560 - 2566
[6] Attend in groups: a weakly-supervised deep learning framework for learning from web data
Zhuang, Bohan
Liu, Lingqiao
Li, Yao
Shen, Chunhua
Reid, Ian
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2915 - 2924
[7] Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition
R. Fergus
P. Perona
A. Zisserman
International Journal of Computer Vision, 2007, 71 : 273 - 303
[8] Weakly supervised scale-invariant learning of models for visual recognition
Fergus, R.
Perona, P.
Zisserman, A.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 71 (03) : 273 - 303
[9] Visual Event Recognition in Videos by Learning from Web Data
Duan, Lixin
Xu, Dong
Tsang, Ivor Wai-Hung
Luo, Jiebo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1667 - 1680
[10] Visual Event Recognition in Videos by Learning from Web Data
Duan, Lixin
Xu, Dong
Tsang, Ivor W.
Luo, Jiebo
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1959 - 1966

← 1 2 3 4 5 →