Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引:0
|
作者
Niu, Li [1 ]
Li, Wen [1 ]
Xu, Dong [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.
引用
收藏
页码:2774 / 2783
页数:10
相关论文
共 50 条
  • [1] Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization
    Niu, Li
    Li, Wen
    Xu, Dong
    Cai, Jianfei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (09) : 1985 - 1999
  • [2] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
    Zhu, Fan
    Shao, Ling
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) : 42 - 59
  • [3] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
    Fan Zhu
    Ling Shao
    International Journal of Computer Vision, 2014, 109 : 42 - 59
  • [4] Learning Visual Features from Large Weakly Supervised Data
    Joulin, Armand
    van der Maaten, Laurens
    Jabri, Allan
    Vasilache, Nicolas
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 67 - 84
  • [5] Learning Signs from Subtitles: A Weakly Supervised Approach to Sign Language Recognition
    Cooper, Helen
    Bowden, Richard
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2560 - 2566
  • [6] Attend in groups: a weakly-supervised deep learning framework for learning from web data
    Zhuang, Bohan
    Liu, Lingqiao
    Li, Yao
    Shen, Chunhua
    Reid, Ian
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2915 - 2924
  • [7] Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition
    R. Fergus
    P. Perona
    A. Zisserman
    International Journal of Computer Vision, 2007, 71 : 273 - 303
  • [8] Weakly supervised scale-invariant learning of models for visual recognition
    Fergus, R.
    Perona, P.
    Zisserman, A.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 71 (03) : 273 - 303
  • [9] Visual Event Recognition in Videos by Learning from Web Data
    Duan, Lixin
    Xu, Dong
    Tsang, Ivor Wai-Hung
    Luo, Jiebo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1667 - 1680
  • [10] Visual Event Recognition in Videos by Learning from Web Data
    Duan, Lixin
    Xu, Dong
    Tsang, Ivor W.
    Luo, Jiebo
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1959 - 1966