Multi-Task Learning as Multi-Objective Optimization

被引：0

作者：

Sener, Ozan ^{[1
]}

Koltun, Vladlen ^{[1
]}

机构：

[1] Intel Labs, Santa Clara, CA 95054 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

关键词：

MINIMUM NORM POINT; ALGORITHM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise is to optimize a proxy objective that minimizes a weighted linear combination of per-task losses. However, this workaround is only valid when the tasks do not compete, which is rarely the case. In this paper, we explicitly cast multi-task learning as multi-objective optimization, with the overall objective of finding a Pareto optimal solution. To this end, we use algorithms developed in the gradient-based multi-objective optimization literature. These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks. We therefore propose an upper bound for the multi-objective loss and show that it can be optimized efficiently. We further prove that optimizing this upper bound yields a Pareto optimal solution under realistic assumptions. We apply our method to a variety of multi-task deep learning problems including digit classification, scene understanding (joint semantic segmentation, instance segmentation, and depth estimation), and multi-label classification. Our method produces higher-performing models than recent multi-task learning formulations or per-task training.

引用

页数：12

共 50 条

[21] A Multi-task Multi-view based Multi-objective Clustering Algorithm
Mitra, Sayantan
Saha, Sriparna
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4720 - 4727
[22] Learning-guided coevolution multi-objective particle swarm optimization for heterogeneous UAV cooperative multi-task reallocation problem
Wang F.
Fu Q.-P.
Han M.-C.
Xing L.-N.
Wu H.-S.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (06): : 1009 - 1017
[23] Content and concentration of rare earth element components based on multi-task learning multi-objective optimization multidimensional soft measurement
Zhang S.-P.
Zhang Q.-H.
Wang B.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (03): : 454 - 467
[24] A Multiple Gradient Descent Design for Multi-Task Learning on Edge Computing: Multi-Objective Machine Learning Approach
Zhou, Xiaojun
Gao, Yuan
Li, Chaojie
Huang, Zhaoke
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01): : 121 - 133
[25] A Multi-Workflow Scheduling Approach With Explicit Evolutionary Multi-Objective Multi-Task Optimization Algorithm in Cloud Environment
Zhang, Qiqi
Li, Bohui
Geng, Shaojin
Cai, Xingjuan
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (01):
[26] Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection
Chen, Yaran
Zhao, Dongbin
ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 28 - 35
[27] End to end multi-task learning with attention for multi-objective fault diagnosis under small sample
Xie, Zongliang
Chen, Jinglong
Feng, Yong
Zhang, Kaiyu
Zhou, Zitong
JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 301 - 316
[28] Multi-objective Bayesian alloy design using multi-task Gaussian processes
Khatamsaz, Danial
Vela, Brent
Arroyave, Raymundo
MATERIALS LETTERS, 2023, 351
[29] Multi-task multi-objective evolutionary network for hyperspectral image classification and pansharpening
Wu, Xiande
Feng, Jie
Shang, Ronghua
Wu, Jinjian
Zhang, Xiangrong
Jiao, Licheng
Gamba, Paolo
INFORMATION FUSION, 2024, 108
[30] Multi-task gradient descent for multi-task learning
Bai, Lu
Ong, Yew-Soon
He, Tiantian
Gupta, Abhishek
MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369

← 1 2 3 4 5 →