Exploiting Web Images for Event Recognition in Consumer Videos: A Multiple Source Domain Adaptation Approach

被引:0
|
作者
Duan, Lixin [1 ]
Xu, Dong [1 ]
Chang, Shih-Fu [2 ]
机构
[1] Nanyang Technol Univ, 50 Nanyang Ave, Singapore, Singapore
[2] Columbia Univ, New York, NY 10027 USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work has demonstrated the effectiveness of domain adaptation methods for computer vision applications. In this work, we propose a new multiple source domain adaptation method called Domain Selection Machine (DSM) for event recognition in consumer videos by leveraging a large number of loosely labeled web images from different sources (e.g., Flickr.com and Photosig.com), in which there are no labeled consumer videos. Specifically, we first train a set of SVM classifiers (referred to as source classifiers) by using the SIFT features of web images from different source domains. We propose a new parametric target decision function to effectively integrate the static SIFT features from web images/video keyframes and the space-time (ST) features from consumer videos. In order to select the most relevant source domains, we further introduce a new data-dependent regularizer into the objective of Support Vector Regression (SVR) using the epsilon-insensitive loss, which enforces the target classifier shares similar decision values on the unlabeled consumer videos with the selected source classifiers. Moreover, we develop an alternating optimization algorithm to iteratively solve the target decision function and a domain selection vector which indicates the most relevant source domains. Extensive experiments on three real-world datasets demonstrate the effectiveness of our proposed method DSM over the state-of-the-art by a performance gain up to 46.41%.
引用
收藏
页码:1338 / 1345
页数:8
相关论文
共 50 条
  • [1] Heterogeneous Multi-group Adaptation for Event Recognition in Consumer Videos
    Yao, Mingyu
    Wu, Xinxiao
    Chen, Mei
    Jia, Yunde
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 577 - 589
  • [2] Exploiting Web Images for Dataset Construction: A Domain Robust Approach
    Yao, Yazhou
    Zhang, Jian
    Shen, Fumin
    Hua, Xiansheng
    Xu, Jingsong
    Tang, Zhenmin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (08) : 1771 - 1784
  • [3] Multiple source domain adaptation in micro-expression recognition
    Zhang, Xiaorui
    Xu, Tong
    Sun, Wei
    Song, Aiguo
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (08) : 8371 - 8386
  • [4] Multiple source domain adaptation in micro-expression recognition
    Xiaorui Zhang
    Tong Xu
    Wei Sun
    Aiguo Song
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 8371 - 8386
  • [5] Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
    Sohn, Kihyuk
    Liu, Sifei
    Zhong, Guangyu
    Yu, Xiang
    Yang, Ming-Hsuan
    Chandraker, Manmohan
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5917 - 5925
  • [6] Recognition of Adult Images, Videos, and Web Page Bags
    Hu, Weiming
    Zuo, Haiqiang
    Wu, Ou
    Chen, Yunfei
    Zhang, Zhongfei
    Suter, David
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2011, 7 (01)
  • [7] Multi-Group Adaptation for Event Recognition from Videos
    Feng, Yang
    Wu, Xinxiao
    Wang, Han
    Liu, Jing
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3915 - 3920
  • [8] Visual Event Recognition in Videos by Learning from Web Data
    Duan, Lixin
    Xu, Dong
    Tsang, Ivor Wai-Hung
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1667 - 1680
  • [9] Event Recognition in Videos by Learning from Heterogeneous Web Sources
    Chen, Lin
    Duan, Lixin
    Xu, Dong
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2666 - 2673
  • [10] Visual Event Recognition in Videos by Learning from Web Data
    Duan, Lixin
    Xu, Dong
    Tsang, Ivor W.
    Luo, Jiebo
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1959 - 1966