Cross-task weakly supervised learning from instructional videos

被引:78
|
作者
Zhukov, Dimitri [1 ,2 ]
Alayrac, Jean-Baptiste [1 ,3 ]
Cinbis, Ramazan Gokberk [4 ]
Fouhey, David [5 ]
Laptev, Ivan [1 ,2 ]
Sivic, Josef [1 ,2 ,6 ]
机构
[1] Inria, Rocquencourt, France
[2] PSL Res Univ, Ecole Normale Super, Dept Informat, Paris, France
[3] DeepMind, London, England
[4] Middle East Tech Univ, Ankara, Turkey
[5] Univ Michigan, Ann Arbor, MI 48109 USA
[6] Czech Tech Univ, CIIRC Czech Inst Informat Robot & Cybernet, Prague, Czech Republic
基金
欧洲研究理事会;
关键词
D O I
10.1109/CVPR.2019.00365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate learning visual models for the steps of ordinary tasks using weak supervision via instructional narrations and an ordered list of steps instead of strong supervision via temporal annotations. At the heart of our approach is the observation that weakly supervised learning may be easier if a model shares components while learning different steps: "pour egg" should be trained jointly with other tasks involving "pour" and "egg". We formalize this in a component model for recognizing steps and a weakly supervised learning framework that can learn this model under temporal constraints from narration and the list of steps. Past data does not permit systematic studying of sharing and so we also gather a new dataset, CrossTask, aimed at assessing cross-task sharing. Our experiments demonstrate that sharing across tasks improves performance, especially when done at the component level and that our component model can parse previously unseen tasks by virtue of its compositionality.
引用
下载
收藏
页码:3532 / 3540
页数:9
相关论文
共 50 条
  • [31] CROSS-TASK VALIDATION OF FUNCTIONAL MEASUREMENT
    ANDERSON, NH
    PERCEPTION & PSYCHOPHYSICS, 1972, 12 (05): : 389 - &
  • [32] The costs and benefits of cross-task priming
    Florian Waszak
    Bernhard Hommel
    Memory & Cognition, 2007, 35 : 1175 - 1186
  • [33] Weakly Supervised Multi-task Learning for Semantic Parsing
    Shao, Bo
    Gong, Yeyun
    Bao, Junwei
    Ji, Jianshu
    Cao, Guihong
    Lin, Xiaola
    Duan, Nan
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3375 - 3381
  • [34] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170
  • [35] Learning to Initialize: Can Meta Learning Improve Cross-task Generalization in Prompt Tuning?
    Qin, Chengwei
    Joty, Shafiq
    Li, Qian
    Zhao, Ruochen
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11802 - 11832
  • [36] SAR Target Recognition Based on Cross-Domain and Cross-Task Transfer Learning
    Wang, Ke
    Zhang, Gong
    Leung, Henry
    IEEE ACCESS, 2019, 7 : 153391 - 153399
  • [37] Prime Label Learning From Multilabel Aerial Image: A Novel Weakly Supervised Task
    Zhao, Kun
    Zeng, Shiwen
    Zhou, Lijian
    Nie, Tingyuan
    Hao, Siyuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [38] Age Differences in Cross-Task Bleeding
    Nicosia, Jessica
    Balota, David
    PSYCHOLOGY AND AGING, 2020, 35 (06) : 881 - 893
  • [39] The costs and benefits of cross-task priming
    Waszak, Florian
    Hommel, Bernhard
    MEMORY & COGNITION, 2007, 35 (05) : 1175 - 1186
  • [40] CROSS-TASK FACILITATION IN SEMANTIC MEMORY
    MACLEOD, CM
    VOUMVAKIS, S
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1980, 16 (03) : 153 - 153