Exploiting Privileged Information from Web Data for Action and Event Recognition

被引:21
|
作者
Niu, Li [1 ]
Li, Wen [2 ]
Xu, Dong [3 ]
机构
[1] Nanyang Technol Univ, Interdisciplinary Grad Sch, 50 Nanyang Ave, Singapore 639798, Singapore
[2] ETH, Comp Vis Lab, Sternwartstr 7, CH-8092 Zurich, Switzerland
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
Learning using privileged information; Multi-instance learning; Domain adaptation; Action recognition; Event recognition; DOMAIN ADAPTATION; KERNEL; IMAGES; KNOWLEDGE; OBJECTS; VIDEOS;
D O I
10.1007/s11263-015-0862-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the conventional approaches for action and event recognition, sufficient labelled training videos are generally required to learn robust classifiers with good generalization capability on new testing videos. However, collecting labelled training videos is often time consuming and expensive. In this work, we propose new learning frameworks to train robust classifiers for action and event recognition by using freely available web videos as training data. We aim to address three challenging issues: (1) the training web videos are generally associated with rich textual descriptions, which are not available in test videos; (2) the labels of training web videos are noisy and may be inaccurate; (3) the data distributions between training and test videos are often considerably different. To address the first two issues, we propose a new framework called multi-instance learning with privileged information (MIL-PI) together with three new MIL methods, in which we not only take advantage of the additional textual descriptions of training web videos as privileged information, but also explicitly cope with noise in the loose labels of training web videos. When the training and test videos come from different data distributions, we further extend our MIL-PI as a new framework called domain adaptive MIL-PI. We also propose another three new domain adaptation methods, which can additionally reduce the data distribution mismatch between training and test videos. Comprehensive experiments for action and event recognition demonstrate the effectiveness of our proposed approaches.
引用
收藏
页码:130 / 150
页数:21
相关论文
共 50 条
  • [1] Exploiting Privileged Information from Web Data for Action and Event Recognition
    Li Niu
    Wen Li
    Dong Xu
    [J]. International Journal of Computer Vision, 2016, 118 : 130 - 150
  • [2] Exploiting Privileged Information from Web Data for Image Categorization
    Li, Wen
    Niu, Li
    Xu, Dong
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 437 - 452
  • [3] Exploiting Privileged Information for Facial Expression Recognition
    Vrigkas, Michalis
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2016,
  • [4] Exploring privileged information from simple actions for complex action recognition
    Liu, Fang
    Xu, Xiangmin
    Zhang, Tong
    Guo, Kailing
    Wang, Lin
    [J]. NEUROCOMPUTING, 2020, 380 : 236 - 245
  • [5] Symbiotic Attention with Privileged Information for Egocentric Action Recognition
    Wang, Xiaohan
    Wu, Yu
    Zhu, Linchao
    Yang, Yi
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12249 - 12256
  • [6] Action and Event Recognition in Videos by Learning From Heterogeneous Web Sources
    Niu, Li
    Xu, Xinxing
    Chen, Lin
    Duan, Lixin
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (06) : 1290 - 1304
  • [7] Visual Event Recognition in Videos by Learning from Web Data
    Duan, Lixin
    Xu, Dong
    Tsang, Ivor W.
    Luo, Jiebo
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1959 - 1966
  • [8] Visual Event Recognition in Videos by Learning from Web Data
    Duan, Lixin
    Xu, Dong
    Tsang, Ivor Wai-Hung
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1667 - 1680
  • [9] Exploiting Motion Information from Unlabeled Videos for Static Image Action Recognition
    Zhang, Yiyi
    Li Niu
    Pan, Ziqi
    Luo, Meichao
    Zhang, Jianfu
    Cheng, Dawei
    Zhang, Liqing
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12918 - 12925
  • [10] Learning Using Privileged Information for Zero-Shot Action Recognition
    Gao, Zhiyi
    Hou, Yonghong
    Li, Wanqing
    Guo, Zihui
    Yu, Bin
    [J]. COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 347 - 362