Cross-task weakly supervised learning from instructional videos

被引:97
|
作者
Zhukov, Dimitri [1 ,2 ]
Alayrac, Jean-Baptiste [1 ,3 ]
Cinbis, Ramazan Gokberk [4 ]
Fouhey, David [5 ]
Laptev, Ivan [1 ,2 ]
Sivic, Josef [1 ,2 ,6 ]
机构
[1] Inria, Rocquencourt, France
[2] PSL Res Univ, Ecole Normale Super, Dept Informat, Paris, France
[3] DeepMind, London, England
[4] Middle East Tech Univ, Ankara, Turkey
[5] Univ Michigan, Ann Arbor, MI 48109 USA
[6] Czech Tech Univ, CIIRC Czech Inst Informat Robot & Cybernet, Prague, Czech Republic
基金
欧洲研究理事会;
关键词
D O I
10.1109/CVPR.2019.00365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate learning visual models for the steps of ordinary tasks using weak supervision via instructional narrations and an ordered list of steps instead of strong supervision via temporal annotations. At the heart of our approach is the observation that weakly supervised learning may be easier if a model shares components while learning different steps: "pour egg" should be trained jointly with other tasks involving "pour" and "egg". We formalize this in a component model for recognizing steps and a weakly supervised learning framework that can learn this model under temporal constraints from narration and the list of steps. Past data does not permit systematic studying of sharing and so we also gather a new dataset, CrossTask, aimed at assessing cross-task sharing. Our experiments demonstrate that sharing across tasks improves performance, especially when done at the component level and that our component model can parse previously unseen tasks by virtue of its compositionality.
引用
收藏
页码:3532 / 3540
页数:9
相关论文
共 50 条
  • [21] Hierarchical Modeling for Task Recognition and Action Segmentation in Weakly-Labeled Instructional Videos
    Ghoddoosian, Reza
    Sayed, Saif
    Athitsos, Vassilis
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 120 - 130
  • [22] Cross-task strategic effects
    Kathleen Rastle
    Sachiko Kinoshita
    Stephen J. Lupker
    Max Coltheart
    Memory & Cognition, 2003, 31 : 867 - 876
  • [23] Cross-task strategic effects
    Rastle, K
    Kinoshita, S
    Lupker, SJ
    Coltheart, M
    MEMORY & COGNITION, 2003, 31 (06) : 867 - 876
  • [24] Transferability-Guided Cross-Domain Cross-Task Transfer Learning
    Tan, Yang
    Zhang, Enming
    Li, Yang
    Huang, Shao-Lun
    Zhang, Xiao-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 2423 - 2436
  • [25] Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
    Dong, Sixun
    Hu, Huazhang
    Lian, Dongze
    Luo, Weixin
    Qian, Yicheng
    Gao, Shenghua
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2437 - 2447
  • [26] Semi-Supervised Image Classification With Self-Paced Cross-Task Networks
    Wu, Si
    Ji, Qiujia
    Wang, Shufeng
    Wong, Hau-San
    Yu, Zhiwen
    Xu, Yong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (04) : 851 - 865
  • [27] Weakly Supervised Summarization of Web Videos
    Panda, Rameswar
    Das, Abir
    Wu, Ziyan
    Ernst, Jan
    Roy-Chowdhury, Amit K.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3677 - 3686
  • [28] Cross-Task Inconsistency Based Active Learning (CTIAL) for Emotion Recognition
    Xu, Yifan
    Jiang, Xue
    Wu, Dongrui
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1659 - 1668
  • [29] Meta-Modulation: A General Learning Framework for Cross-Task Adaptation
    Lu, Jiang
    Xiao, Changming
    Zhang, Changshui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [30] CROSS-TASK CODE REUSE IN GENETIC PROGRAMMING APPLIED TO VISUAL LEARNING
    Jaskowski, Wojciech
    Krawiec, Krzysztof
    Wieloch, Bartosz
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (01) : 183 - 197