Multi-Task Learning as Multi-Objective Optimization

被引:0
|
作者
Sener, Ozan [1 ]
Koltun, Vladlen [1 ]
机构
[1] Intel Labs, Santa Clara, CA 95054 USA
关键词
MINIMUM NORM POINT; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise is to optimize a proxy objective that minimizes a weighted linear combination of per-task losses. However, this workaround is only valid when the tasks do not compete, which is rarely the case. In this paper, we explicitly cast multi-task learning as multi-objective optimization, with the overall objective of finding a Pareto optimal solution. To this end, we use algorithms developed in the gradient-based multi-objective optimization literature. These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks. We therefore propose an upper bound for the multi-objective loss and show that it can be optimized efficiently. We further prove that optimizing this upper bound yields a Pareto optimal solution under realistic assumptions. We apply our method to a variety of multi-task deep learning problems including digit classification, scene understanding (joint semantic segmentation, instance segmentation, and depth estimation), and multi-label classification. Our method produces higher-performing models than recent multi-task learning formulations or per-task training.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A Multi-task Multi-view based Multi-objective Clustering Algorithm
    Mitra, Sayantan
    Saha, Sriparna
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4720 - 4727
  • [22] Learning-guided coevolution multi-objective particle swarm optimization for heterogeneous UAV cooperative multi-task reallocation problem
    Wang F.
    Fu Q.-P.
    Han M.-C.
    Xing L.-N.
    Wu H.-S.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (06): : 1009 - 1017
  • [23] Content and concentration of rare earth element components based on multi-task learning multi-objective optimization multidimensional soft measurement
    Zhang S.-P.
    Zhang Q.-H.
    Wang B.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (03): : 454 - 467
  • [24] A Multiple Gradient Descent Design for Multi-Task Learning on Edge Computing: Multi-Objective Machine Learning Approach
    Zhou, Xiaojun
    Gao, Yuan
    Li, Chaojie
    Huang, Zhaoke
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01): : 121 - 133
  • [25] A Multi-Workflow Scheduling Approach With Explicit Evolutionary Multi-Objective Multi-Task Optimization Algorithm in Cloud Environment
    Zhang, Qiqi
    Li, Bohui
    Geng, Shaojin
    Cai, Xingjuan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (01):
  • [26] Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection
    Chen, Yaran
    Zhao, Dongbin
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 28 - 35
  • [27] End to end multi-task learning with attention for multi-objective fault diagnosis under small sample
    Xie, Zongliang
    Chen, Jinglong
    Feng, Yong
    Zhang, Kaiyu
    Zhou, Zitong
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 301 - 316
  • [28] Multi-objective Bayesian alloy design using multi-task Gaussian processes
    Khatamsaz, Danial
    Vela, Brent
    Arroyave, Raymundo
    MATERIALS LETTERS, 2023, 351
  • [29] Multi-task multi-objective evolutionary network for hyperspectral image classification and pansharpening
    Wu, Xiande
    Feng, Jie
    Shang, Ronghua
    Wu, Jinjian
    Zhang, Xiangrong
    Jiao, Licheng
    Gamba, Paolo
    INFORMATION FUSION, 2024, 108
  • [30] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369