Multi-Task Learning as Multi-Objective Optimization

被引：0

作者：

Sener, Ozan ^{[1
]}

Koltun, Vladlen ^{[1
]}

机构：

[1] Intel Labs, Santa Clara, CA 95054 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

关键词：

MINIMUM NORM POINT; ALGORITHM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise is to optimize a proxy objective that minimizes a weighted linear combination of per-task losses. However, this workaround is only valid when the tasks do not compete, which is rarely the case. In this paper, we explicitly cast multi-task learning as multi-objective optimization, with the overall objective of finding a Pareto optimal solution. To this end, we use algorithms developed in the gradient-based multi-objective optimization literature. These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks. We therefore propose an upper bound for the multi-objective loss and show that it can be optimized efficiently. We further prove that optimizing this upper bound yields a Pareto optimal solution under realistic assumptions. We apply our method to a variety of multi-task deep learning problems including digit classification, scene understanding (joint semantic segmentation, instance segmentation, and depth estimation), and multi-label classification. Our method produces higher-performing models than recent multi-task learning formulations or per-task training.

引用

页数：12

共 50 条

[31] Multi-task gradient descent for multi-task learning
Lu Bai
Yew-Soon Ong
Tiantian He
Abhishek Gupta
Memetic Computing, 2020, 12 : 355 - 369
[32] Multi-objective no-wait multi-task scheduling problem of batch process
Gu, X. (xsgu@ecust.edu.cn), 1600, Materials China (64):
[33] Multi-Objective Optimization in Learning to Rank
Dai, Na
Shokouhi, Milad
Davison, Brian D.
PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1241 - 1242
[34] Multi-objective optimization by learning automata
Liao, H. L.
Wu, Q. H.
JOURNAL OF GLOBAL OPTIMIZATION, 2013, 55 (02) : 459 - 487
[35] Multi-objective optimization by learning automata
H. L. Liao
Q. H. Wu
Journal of Global Optimization, 2013, 55 : 459 - 487
[36] A novel multi-objective generative design approach for sustainable building using multi-task learning (ANN) integration
Li, Mingchen
Wang, Zhe
Chang, Hao
Wang, Zhoupeng
Guo, Juanli
APPLIED ENERGY, 2024, 376
[37] Deep learning and optimization enabled multi-objective for task scheduling in cloud computing
Komarasamy, Dinesh
Ramaganthan, Siva Malar
Kandaswamy, Dharani Molapalayam
Mony, Gokuldhev
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2025, 36 (01) : 79 - 108
[38] Multi-Objective Software Defect Prediction via Multi-Source Uncertain Information Fusion and Multi-Task Multi-View Learning
Yang, Minghao
Yang, Shunkun
Wong, W. Eric
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (08) : 2054 - 2076
[39] Choosing Chunk Trees in the Task of Multi-Objective Optimization
Anisimov, Ivan
Makarova, Elena
Polyakov, Vladimir
2017 COMPUTING CONFERENCE, 2017, : 96 - 100
[40] A Multi-objective Optimization Algorithm of Task Scheduling in WSN
Dai, L.
Xu, H. K.
Chen, T.
Qian, C.
Xie, L. J.
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2014, 9 (02) : 160 - 171

← 1 2 3 4 5 →