MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning

被引:8
|
作者
Xu, Xiaogang [1 ]
Zhao, Hengshuang [2 ,3 ]
Vineet, Vibhav [4 ]
Lim, Ser-Nam [5 ]
Torralba, Antonio [2 ]
机构
[1] CUHK, Hong Kong, Peoples R China
[2] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] HKU, Hong Kong, Peoples R China
[4] Microsoft Res, Redmond, WA USA
[5] Meta AI, New York, NY USA
来源
关键词
Multi-task learning; Transformer; Cross-task reasoning;
D O I
10.1007/978-3-031-19812-0_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we explore the advantages of utilizing transformer structures for addressing multi-task learning (MTL). Specifically, we demonstrate that models with transformer structures are more appropriate for MTL than convolutional neural networks (CNNs), and we propose a novel transformer-based architecture named MTFormer for MTL. In the framework, multiple tasks share the same transformer encoder and transformer decoder, and lightweight branches are introduced to harvest task-specific outputs, which increases the MTL performance and reduces the time-space complexity. Furthermore, information from different task domains can benefit each other, and we conduct cross-task reasoning. We propose a cross-task attention mechanism for further boosting the MTL results. The cross-task attention mechanism brings little parameters and computations while introducing extra performance improvements. Besides, we design a self-supervised cross-task contrastive learning algorithm for further boosting the MTL performance. Extensive experiments are conducted on two multi-task learning datasets, on which MTFormer achieves state-of-the-art results with limited network parameters and computations. It also demonstrates significant superiorities for few-shot learning and zero-shot learning.
引用
收藏
页码:304 / 321
页数:18
相关论文
共 50 条
  • [1] Cross-task Attention Mechanism for Dense Multi-task Learning
    Lopes, Ivan
    Tuan-Hung Vu
    de Charette, Raoul
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2328 - 2337
  • [2] Cross-Task Knowledge Distillation in Multi-Task Recommendation
    Yang, Chenxiao
    Pan, Junwei
    Gao, Xiaofeng
    Jiang, Tingyu
    Liu, Dapeng
    Chen, Guihai
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4318 - 4326
  • [3] Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy
    Chavarrias Solano, Pedro Esteban
    Bulpitt, Andrew
    Subramanian, Venkataraman
    Ali, Sharib
    Medical Image Analysis, 2025, 99
  • [4] Cross-Task Attention Network: Improving Multi-task Learning for Medical Imaging Applications
    Kim, Sangwook
    Purdie, Thomas G.
    McIntosh, Chris
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 119 - 128
  • [5] Cross-task feature enhancement strategy in multi-task learning for harvesting Sichuan pepper
    Wang, Yihan
    Deng, Xinglong
    Luo, Jianqiao
    Li, Bailin
    Xiao, Shide
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 207
  • [6] Learning Cross-Task Attribute - Attribute Similarity for Multi-task Attribute-Value Extraction
    Jain, Mayank
    Bhattacharya, Sourangshu
    Jain, Harshit
    Shaik, Karimulla
    Chelliah, Muthusamy
    ECNLP 4: THE FOURTH WORKSHOP ON E-COMMERCE AND NLP, 2021, : 79 - 87
  • [7] Multi-task Learning with Selective Cross-Task Transfer for Predicting Bleeding and other Important Patient Outcomes
    Ngufor, Che
    Upadhyaya, Sudhindra
    Murphree, Dennis
    Kor, Daryl
    Pathak, Jyotishman
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 511 - 518
  • [8] Hotspot Detection via Multi-task Learning and Transformer Encoder
    Zhu, Binwu
    Chen, Ran
    Zhang, Xinyun
    Yang, Fan
    Zeng, Xuan
    Yu, Bei
    Wong, Martin D. F.
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [9] Multi-task Supervised Learning via Cross-learning
    Cervino, Juan
    Andres Bazerque, Juan
    Calvo-Fullana, Miguel
    Ribeiro, Alejandro
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1381 - 1385
  • [10] Calibration of cine MRI segmentation probability for uncertainty estimation using a multi-task cross-task learning architecture
    Hasan, S. M. Kamrul
    Linte, Cristian A.
    MEDICAL IMAGING 2022: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2022, 12034