Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引:0
|
作者
Niu, Li [1 ]
Li, Wen [1 ]
Xu, Dong [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.
引用
收藏
页码:2774 / 2783
页数:10
相关论文
共 50 条
  • [31] Weakly Supervised Learning: Application to Fish School Recognition
    Lefort, Riwal
    Fablet, Ronan
    Boucher, Jean-Marc
    NEW ADVANCES IN INTELLIGENT SIGNAL PROCESSING, 2011, 372 : 203 - +
  • [32] Weakly-Supervised Recognition, Localization, and Explanation of Visual Entities
    Mettes, Pascal
    MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 1459 - 1463
  • [33] Cross-domain structure learning for visual data recognition
    Lu, Yuwu
    Luo, Xingping
    Wen, Jiajun
    Lai, Zhihui
    Li, Xuelong
    PATTERN RECOGNITION, 2022, 134
  • [34] Learning Domain-specific Semantic Representation from Weakly Supervised Data to Improve Research Dataset Retrieval
    Luo P.
    Hong L.
    Wang J.
    Wang S.
    Guo X.
    Gao Z.
    Cho S.W.
    Proceedings of the Association for Information Science and Technology, 2022, 59 (01) : 205 - 214
  • [35] Visual tracking via weakly supervised learning from multiple imperfect oracles
    Zhong, Bineng
    Yao, Hongxun
    Chen, Sheng
    Ji, Rongrong
    Chin, Tat-Jun
    Wang, Hanzi
    PATTERN RECOGNITION, 2014, 47 (03) : 1395 - 1410
  • [36] Visual Tracking via Weakly Supervised Learning from Multiple Imperfect Oracles
    Zhong, Bineng
    Yao, Hongxun
    Chen, Sheng
    Ji, Rongrong
    Yuan, Xiaotong
    Liu, Shaohui
    Gao, Wen
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1323 - 1330
  • [37] Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation
    Liu, Chang
    Rizzoli, Giulia
    Zanuttigh, Pietro
    Li, Fu
    Niu, Yi
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 352 - 369
  • [38] Multi-view Domain Generalization for Visual Recognition
    Niu, Li
    Li, Wen
    Xu, Dong
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4193 - 4201
  • [39] Weakly supervised learning of biomedical information extraction from curated data
    Jain, Suvir
    Kashyap, R.
    Kuo, Tsung-Ting
    Bhargava, Shitij
    Lin, Gordon
    Hsu, Chun-Nan
    BMC BIOINFORMATICS, 2016, 17
  • [40] Weakly supervised learning of biomedical information extraction from curated data
    Suvir Jain
    Kashyap R.
    Tsung-Ting Kuo
    Shitij Bhargava
    Gordon Lin
    Chun-Nan Hsu
    BMC Bioinformatics, 17