Multi-Task Learning as Multi-Objective Optimization

被引:0
|
作者
Sener, Ozan [1 ]
Koltun, Vladlen [1 ]
机构
[1] Intel Labs, Santa Clara, CA 95054 USA
关键词
MINIMUM NORM POINT; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise is to optimize a proxy objective that minimizes a weighted linear combination of per-task losses. However, this workaround is only valid when the tasks do not compete, which is rarely the case. In this paper, we explicitly cast multi-task learning as multi-objective optimization, with the overall objective of finding a Pareto optimal solution. To this end, we use algorithms developed in the gradient-based multi-objective optimization literature. These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks. We therefore propose an upper bound for the multi-objective loss and show that it can be optimized efficiently. We further prove that optimizing this upper bound yields a Pareto optimal solution under realistic assumptions. We apply our method to a variety of multi-task deep learning problems including digit classification, scene understanding (joint semantic segmentation, instance segmentation, and depth estimation), and multi-label classification. Our method produces higher-performing models than recent multi-task learning formulations or per-task training.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Multi-Task Multi-View Learning Based on Cooperative Multi-Objective Optimization
    Zhou, Di
    Wang, Jun
    Jiang, Bin
    Guo, Hua
    Li, Yajun
    IEEE ACCESS, 2018, 6 : 19465 - 19477
  • [2] Multi-objective multi-criteria evolutionary algorithm for multi-objective multi-task optimization
    Du, Ke-Jing
    Li, Jian-Yu
    Wang, Hua
    Zhang, Jun
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 1211 - 1228
  • [3] Multi-objective multi-criteria evolutionary algorithm for multi-objective multi-task optimization
    Ke-Jing Du
    Jian-Yu Li
    Hua Wang
    Jun Zhang
    Complex & Intelligent Systems, 2023, 9 : 1211 - 1228
  • [4] MULTI-OBJECTIVE MULTI-TASK LEARNING ON RNNLM FOR SPEECH RECOGNITION
    Song, Minguang
    Zhao, Yunxin
    Wang, Shaojun
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 197 - 203
  • [5] Multi-objective Optimization for Multi-task Allocation in Mobile Crowd Sensing
    Li, Mingchu
    Gao, Yuan
    Wang, Mingliang
    Guo, Cheng
    Tan, Xing
    16TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING (MOBISPC 2019),THE 14TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS (FNC-2019),THE 9TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY, 2019, 155 : 360 - 368
  • [6] A Q-learning-based multi-task multi-objective particle swarm optimization algorithm
    Han H.-G.
    Xu Z.-A.
    Wang J.-J.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (11): : 3039 - 3047
  • [7] Multi-Task Learning for Multi-Objective Evolutionary Neural Architecture Search
    Cai, Ronghong
    Luo, Jianping
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 1680 - 1687
  • [8] A Multi-objective / Multi-task Learning Framework Induced by Pareto Stationarity
    Momma, Michinari
    Dong, Chaosheng
    Liu, Jia
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] Multi-objective optimization based multi-task learning for end-to-end license plates recognition
    Zhou X.-J.
    Gao Y.
    Li C.-J.
    Yang C.-H.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (05): : 676 - 688
  • [10] Decision variable classification based multi-objective multifactorial memetic algorithm for multi-objective multi-task optimization problem
    Xu, Zhiwei
    Xu, Jiafeng
    Zhang, Kai
    Xu, Xin
    He, Juanjuan
    Wu, Ni
    APPLIED SOFT COMPUTING, 2024, 152