Multi-task Deep Learning for Image Understanding

被引:0
|
作者
Yu, Bo [1 ,2 ,3 ]
Lane, Ian [3 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, State Key Lab Remote Sensing Sci, Beijing 100101, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
[3] Carnegie Mellon Univ, Moffett Field, CA 94043 USA
关键词
image segmentation; deep learning; multi-task learning; FACE DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning models can obtain state-of-the-art performance across many speech and image processing tasks, often significantly outperforming earlier methods. In this paper, we attempt to further improve the performance of these models by introducing multi-task training, in which a combined deep learning model is trained for two inter-related tasks. We show that by introducing a secondary task (such as shape identification in the object classification task) we are able to significantly improve the performance of the main task for which the model is trained. Using public datasets we evaluated our approach on two image understanding tasks, image segmentation and object classification. On the image segmentation task, we observed that the multi-task model almost doubled the accuracy of segmentation at the pixel-level (from 18.7% to 35.6%) compared to the single task model, and improved the performance of face-detection by 10.2% (from 70.1% to 80.3%). For the object classification task, we observed a 2.1% improvement in classification accuracy (from 91.6% to 93.7%) compared to a single-task model. The proposed multi-task models obtained significantly higher accuracies than previously published results on these datasets, obtaining 22.0% and 6.2% higher accuracies on the face-detetction and object classification tasks respectively. These results demonstrate the effectiveness of multi-task training of deep learning models for image understanding tasks.
引用
收藏
页码:37 / 42
页数:6
相关论文
共 50 条
  • [1] Hand Image Understanding via Deep Multi-Task Learning
    Zhang, Xiong
    Huang, Hongsheng
    Tan, Jianchao
    Xu, Hongmin
    Yang, Cheng
    Peng, Guozhu
    Wang, Lei
    Liu, Ji
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11261 - 11272
  • [2] Deep multi-task learning for malware image classification
    Bensaoud, Ahmed
    Kalita, Jugal
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [3] Image Captioning with Deep Bidirectional LSTMs and Multi-Task Learning
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (02)
  • [4] Deep multi-task learning for image/video distortions identification
    Zoubida Ameur
    Sid Ahmed Fezza
    Wassim Hamidouche
    [J]. Neural Computing and Applications, 2022, 34 : 21607 - 21623
  • [5] MULTI-TASK DEEP LEARNING FOR SATELLITE IMAGE PANSHARPENING AND SEGMENTATION
    Khalel, Andrew
    Tasar, Onur
    Charpiat, Guillaume
    Tarabalka, Yuliya
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4869 - 4872
  • [6] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    [J]. Neural Computing and Applications, 2022, 34 (24) : 21607 - 21623
  • [7] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21607 - 21623
  • [8] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141
  • [9] Multi-task deep learning for medical image computing and analysis: A review
    Zhao, Yan
    Wang, Xiuying
    Che, Tongtong
    Bao, Guoqing
    Li, Shuyu
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [10] Deep Multi-Task Learning for Large-Scale Image Classification
    Kuang, Zhenzhong
    Li, Zongmin
    Zhao, Tianyi
    Fan, Jianping
    [J]. 2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 310 - 317