Efficient Computation Sharing for Multi-Task Visual Scene Understanding

被引:0
|
作者
Shoouri, Sara [1 ]
Yang, Mingyu [1 ]
Fan, Zichen [1 ]
Kim, Hun-Seok [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
MODEL;
D O I
10.1109/ICCV51070.2023.01571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Solving multiple visual tasks using individual models can be resource-intensive, while multi-task learning can conserve resources by sharing knowledge across different tasks. Despite the benefits of multi-task learning, such techniques can struggle with balancing the loss for each task, leading to potential performance degradation. We present a novel computation- and parameter-sharing framework that balances efficiency and accuracy to perform multiple visual tasks utilizing individually-trained single-task transformers. Our method is motivated by transfer learning schemes to reduce computational and parameter storage costs while maintaining the desired performance. Our approach involves splitting the tasks into a base task and the other sub-tasks, and sharing a significant portion of activations and parameters/weights between the base and sub-tasks to decrease inter-task redundancies and enhance knowledge sharing. The evaluation conducted on NYUD-v2 and PASCAL-context datasets shows that our method is superior to the state-of-the-art transformer-based multi-task learning techniques with higher accuracy and reduced computational resources. Moreover, our method is extended to video stream inputs, further reducing computational costs by efficiently sharing information across the temporal domain as well as the task domain. Our codes are available at https://github.com/sarashoouri/EfficientMTL.
引用
收藏
页码:17084 / 17095
页数:12
相关论文
共 50 条
  • [21] EarthVQANet: Multi-task visual question answering for remote sensing image understanding
    Wang, Junjue
    Ma, Ailong
    Chen, Zihang
    Zheng, Zhuo
    Wan, Yuting
    Zhang, Liangpei
    Zhong, Yanfei
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 212 : 422 - 439
  • [22] A Contrastive Sharing Model for Multi-Task Recommendation
    Bai, Ting
    Xiao, Yudong
    Wu, Bin
    Yang, Guojun
    Yu, Hongyong
    Nie, Jian-Yun
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3239 - 3247
  • [23] Task-aware asynchronous multi-task model with class incremental contrastive learning for surgical scene understanding
    Seenivasan, Lalithkumar
    Islam, Mobarakol
    Xu, Mengya
    Lim, Chwee Ming
    Ren, Hongliang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (05) : 921 - 928
  • [24] Task-aware asynchronous multi-task model with class incremental contrastive learning for surgical scene understanding
    Lalithkumar Seenivasan
    Mobarakol Islam
    Mengya Xu
    Chwee Ming Lim
    Hongliang Ren
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 921 - 928
  • [25] CommuSpotter: Scene Text Spotting with Multi-Task Communication
    Zhao, Liang
    Wilsbacher, Greg
    Wang, Song
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [26] Efficient Controllable Multi-Task Architectures
    Aich, Abhishek
    Schulter, Samuel
    Roy-Chowdhury, Amit K.
    Chandraker, Manmohan
    Suh, Yumin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5717 - 5728
  • [27] MSSM: A Multiple-level Sparse Sharing Model for Efficient Multi-Task Learning
    Ding, Ke
    Dong, Xin
    He, Yong
    Cheng, Lei
    Fu, Chilin
    Huan, Zhaoxin
    Li, Hai
    Yan, Tan
    Zhang, Liang
    Zhang, Xiaolu
    Mo, Linjian
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2237 - 2241
  • [28] A Multi-Task Oriented Framework for Mobile Computation Offloading
    Lu, Junyu
    Li, Qiang
    Guo, Bing
    Li, Jie
    Shen, Yan
    Li, Gongliang
    Su, Hong
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (01) : 187 - 201
  • [29] Multi-task Deep Learning for Image Understanding
    Yu, Bo
    Lane, Ian
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 37 - 42
  • [30] Generative Modeling for Multi-task Visual Learning
    Bao, Zhipeng
    Hebert, Martial
    Wang, Yu-Xiong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,