Survey on multi-task learning for object classification and recognition

被引:0
|
作者
Li H. [1 ]
Wang F. [2 ]
Ding W. [1 ]
机构
[1] Research Institute of Unmanned System, Beihang University, Beijing
[2] School of Electronics and Information Engineering, Beihang University, Beijing
基金
中国国家自然科学基金;
关键词
Deep learning; Fine-grained classification; Multi-task learning; Object classification; Object re-identification;
D O I
10.7527/S1000-6893.2021.24889
中图分类号
学科分类号
摘要
Multi-Task Learning(MTL) aims to enhance the model performance by jointly leveraging supervisory signals and sharing useful information among multiple related tasks. This paper comprehensively summarizes and analyzes the mechanism and mainstream methods of multi-task learning for object classification and recognition applications. First, we review the definitions, principles and methods of MTL. Second, taking the representative and widely used fine-grained classification and object re-identification as examples, we emphatically introduce two types of multi-task learning for object classification and recognition: task-based multi-task learning and feature-based multi-task learning, and further categorize each type and analyze the design ideas, and advantages and disadvantages of different MTL algorithms. Third, we compare the performance of various MTL algorithms reviewed in this paper on common datasets. Finally, prospects on development trends of MTL algorithms for object classification and recognition are discussed. © 2022, Beihang University Aerospace Knowledge Press. All right reserved.
引用
收藏
相关论文
共 67 条
  • [1] CARUANA R., Multitask learning, Machine Learning, 28, 1, pp. 41-75, (1997)
  • [2] RUDER S., An overview of multi-task learning in deep neural networks, (2017)
  • [3] ZHANG Y, YANG Q., A survey on multi-task learning, (2017)
  • [4] ZHANG Y, LIU J W, ZUO X., Survey of multi-task learning, Chinese Journal of Computers, 43, 7, pp. 1340-1378, (2020)
  • [5] VANDENHENDE S, GEORGOULIS S, VAN GOOL L., MTI-net: Multi-scale task interaction networks for multi-task learning, Computer Vision-ECCV 2020, pp. 527-543, (2020)
  • [6] ZHENG L, SHEN L Y, TIAN L, Et al., Scalable person re-identification: A benchmark, 2015 IEEE International Conference on Computer Vision(ICCV), pp. 1116-1124, (2015)
  • [7] LI W, ZHAO R, XIAO T, Et al., DeepReID: Deep filter pairing neural network for person Re-identification, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 152-159, (2014)
  • [8] RISTANI E, SOLERA F, ZOU R, Et al., Performance measures and a data set for multi-target, multi-camera tracking, Computer Vision-ECCV 2016 Workshops, (2016)
  • [9] WAH C, BRANSON S, WELINDER P, Et al., The caltech-ucsd birds-200-2011 dataset: CNS-TR-2010-001, (2011)
  • [10] KRAUSE J, STARK M, JIA D, Et al., 3D object representations for fine-grained categorization, 2013 IEEE International Conference on Computer Vision Workshops, pp. 554-561, (2013)