Watch, Listen & Learn: Co-training on Captioned Images and Videos

被引:0
|
作者
Gupta, Sonal [1 ]
Kim, JoohYun [1 ]
Grauman, Kristen [1 ]
Mooney, Raymond [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing visual scenes and activities is challenging: often visual cues alone are ambiguous, and it is expensive to obtain manually labeled examples from which to learn. To cope with these constraints. we propose to leverage the text that often accompanies visual data to learn robust models of scenes and actions from partially labeled collections. Our approach uses co-training, a semi-supervised learning method that accommodates multi-modal views of data. To classify images, our method learns from captioned images of natural scenes; and to recognize human actions, it learns from videos of athletic events with commentary. We show that by exploiting both multi-modal representations and unlabeled data our approach learns more accurate image and video classifiers than standard baseline algorithms.
引用
收藏
页码:457 / 472
页数:16
相关论文
共 50 条
  • [1] Listen, Watch, Learn: SeisSound Video Products
    Kilb, Debi
    Peng, Zhigang
    Simpson, David
    Michael, Andrew
    Fisher, Meghan
    Rohrlick, Daniel
    [J]. SEISMOLOGICAL RESEARCH LETTERS, 2012, 83 (02) : 281 - 286
  • [2] Exploiting strong syntactic heuristics and co-training to learn semantic lexicons
    Phillips, W
    Riloff, E
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 125 - 132
  • [3] Adversarial co-training for semantic segmentation over medical images
    Xie, Haoyu
    Fu, Chong
    Zheng, Xu
    Zheng, Yu
    Sham, Chiu-Wing
    Wang, Xingwei
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [4] Deep Co-Training Active Learning for Mammographic Images Classification
    Yang, Zhikai
    Wu, Wei
    Zhang, Jingyang
    Zhao, Yu
    Gu, Lixu
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1059 - 1062
  • [5] DCPE Co-Training: Co-Training Based on Diversity of Class Probability Estimation
    Xu, Jin
    He, Haibo
    Man, Hong
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [6] Bayesian Co-Training
    Yu, Shipeng
    Krishnapuram, Balaji
    Rosales, Romer
    Rao, R. Bharat
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 2649 - 2680
  • [7] ROBUST CO-TRAINING
    Sun, Shiliang
    Jin, Feng
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (07) : 1113 - 1126
  • [8] Watch and learn: educational videos at your finger tips
    Pluim, Babette M.
    [J]. BRITISH JOURNAL OF SPORTS MEDICINE, 2016, 50 (04) : 202 - U84
  • [9] Target discrimination method for SAR images based on semisupervised co-training
    Wang, Yan
    Du, Lan
    Dai, Hui
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2018, 12
  • [10] Target discrimination method for SAR images based on semisupervised co-training
    [J]. Du, Lan (dulan@mail.xidian.edu.cn), 1600, SPIE (12):