Multi-Task Learning as Multi-Objective Optimization

被引:0
|
作者
Sener, Ozan [1 ]
Koltun, Vladlen [1 ]
机构
[1] Intel Labs, Santa Clara, CA 95054 USA
关键词
MINIMUM NORM POINT; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise is to optimize a proxy objective that minimizes a weighted linear combination of per-task losses. However, this workaround is only valid when the tasks do not compete, which is rarely the case. In this paper, we explicitly cast multi-task learning as multi-objective optimization, with the overall objective of finding a Pareto optimal solution. To this end, we use algorithms developed in the gradient-based multi-objective optimization literature. These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks. We therefore propose an upper bound for the multi-objective loss and show that it can be optimized efficiently. We further prove that optimizing this upper bound yields a Pareto optimal solution under realistic assumptions. We apply our method to a variety of multi-task deep learning problems including digit classification, scene understanding (joint semantic segmentation, instance segmentation, and depth estimation), and multi-label classification. Our method produces higher-performing models than recent multi-task learning formulations or per-task training.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [32] Multi-objective no-wait multi-task scheduling problem of batch process
    Gu, X. (xsgu@ecust.edu.cn), 1600, Materials China (64):
  • [33] Multi-Objective Optimization in Learning to Rank
    Dai, Na
    Shokouhi, Milad
    Davison, Brian D.
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1241 - 1242
  • [34] Multi-objective optimization by learning automata
    Liao, H. L.
    Wu, Q. H.
    JOURNAL OF GLOBAL OPTIMIZATION, 2013, 55 (02) : 459 - 487
  • [35] Multi-objective optimization by learning automata
    H. L. Liao
    Q. H. Wu
    Journal of Global Optimization, 2013, 55 : 459 - 487
  • [36] A novel multi-objective generative design approach for sustainable building using multi-task learning (ANN) integration
    Li, Mingchen
    Wang, Zhe
    Chang, Hao
    Wang, Zhoupeng
    Guo, Juanli
    APPLIED ENERGY, 2024, 376
  • [37] Deep learning and optimization enabled multi-objective for task scheduling in cloud computing
    Komarasamy, Dinesh
    Ramaganthan, Siva Malar
    Kandaswamy, Dharani Molapalayam
    Mony, Gokuldhev
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2025, 36 (01) : 79 - 108
  • [38] Multi-Objective Software Defect Prediction via Multi-Source Uncertain Information Fusion and Multi-Task Multi-View Learning
    Yang, Minghao
    Yang, Shunkun
    Wong, W. Eric
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (08) : 2054 - 2076
  • [39] Choosing Chunk Trees in the Task of Multi-Objective Optimization
    Anisimov, Ivan
    Makarova, Elena
    Polyakov, Vladimir
    2017 COMPUTING CONFERENCE, 2017, : 96 - 100
  • [40] A Multi-objective Optimization Algorithm of Task Scheduling in WSN
    Dai, L.
    Xu, H. K.
    Chen, T.
    Qian, C.
    Xie, L. J.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2014, 9 (02) : 160 - 171