Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引:0
|
作者
Niu, Li [1 ]
Li, Wen [1 ]
Xu, Dong [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.
引用
收藏
页码:2774 / 2783
页数:10
相关论文
共 50 条
  • [41] Unlearning From Weakly Supervised Learning
    Tang, Yi
    Gao, Yi
    Luo, Yong-Gang
    Yang, Ju-Cheng
    Xu, Miao
    Zhang, Min-Ling
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5000 - 5008
  • [42] Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features
    Fan, Bin
    Yang, Yuzhu
    Feng, Wensen
    Wu, Fuchao
    Lu, Jiwen
    Liu, Hongmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1713 - 1726
  • [43] Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering
    Zhao, Kaili
    Chu, Wen-Sheng
    Martinez, Aleix M.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2090 - 2099
  • [44] Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning
    Qiu, Jing
    Liao, Lejian
    Li, Peng
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS 2009), 2009, : 219 - 225
  • [45] Semi-Supervised Learning for Named Entity Recognition Using Weakly Labeled Training Data
    Zafarian, Atefeh
    Rokni, Ali
    Khadivi, Shahram
    Ghiasifard, Sonia
    2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 129 - 135
  • [46] Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes
    Gao, Yue
    Ji, Rongrong
    Liu, Wei
    Dai, Qionghai
    Hua, Gang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (12) : 5400 - 5411
  • [47] GearNet: Stepwise Dual Learning for Weakly Supervised Domain Adaptation
    Xie, Renchunzi
    Wei, Hongxin
    Feng, Lei
    An, Bo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8717 - 8725
  • [48] Learning from the Web: Webly Supervised Meta-Learning for Masked Face Recognition
    Zheng, Wenbo
    Yan, Lan
    Wang, Fei-Yue
    Gou, Chao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4299 - 4308
  • [49] Weakly Supervised Deep Learning for the Detection of Domain Generation Algorithms
    Yu, Bin
    Pan, Jie
    Gray, Daniel
    Hu, Jiaming
    Choudhary, Chhaya
    Nascimento, Anderson C. A.
    De Cock, Martine
    IEEE ACCESS, 2019, 7 : 51542 - 51556
  • [50] Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining
    Bugliarello, Emanuele
    Nematzadeh, Aida
    Hendricks, Lisa Anne
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3052 - 3071