Deep multi-task learning for image/video distortions identification

被引:0
|
作者
Zoubida Ameur
Sid Ahmed Fezza
Wassim Hamidouche
机构
[1] University of Rennes,INSA Rennes, CNRS, IETR, UMR
[2] National Institute of Telecommunications and ICT,undefined
来源
关键词
Distortion; Identification; Multi-task learning; Deep learning; Natural image; Laparoscopic video; Multiple distortions; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
Identifying distortions in images and videos is important and useful in various visual applications, such as image quality enhancement and assessment techniques. Instead of applying them blindly, these techniques can be applied or adjusted depending on the type of distortion identified. In this paper, we propose a deep multi-task learning (MTL) model for identifying the types of distortion in both images and videos, considering both single and multiple distortions. The proposed MTL model is composed of one convolutional neural network (CNN) shared between all tasks and N parallel classifiers, where each classifier is dedicated to identify a type of distortion. The proposed architecture also allows to adjust the number of tasks according to the number of distortion types considered, making the solution scalable. The proposed method has been evaluated on natural scene images and laparoscopic videos databases, each presenting a rich set of distortions. The experimental results demonstrate that our model achieves the best performance among the state-of-the-art methods for both single and multiple distortions (Code is available at: https://github.com/zoubidaameur/Deep-Multi-Task-Learning-for-Image-Video-Distortions-Identification).
引用
收藏
页码:21607 / 21623
页数:16
相关论文
共 50 条
  • [1] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21607 - 21623
  • [2] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    [J]. Neural Computing and Applications, 2022, 34 (24) : 21607 - 21623
  • [3] Multi-task Deep Learning for Image Understanding
    Yu, Bo
    Lane, Ian
    [J]. 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 37 - 42
  • [4] Deep multi-task learning for malware image classification
    Bensaoud, Ahmed
    Kalita, Jugal
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [5] Image Captioning with Deep Bidirectional LSTMs and Multi-Task Learning
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (02)
  • [6] MULTI-TASK DEEP LEARNING FOR SATELLITE IMAGE PANSHARPENING AND SEGMENTATION
    Khalel, Andrew
    Tasar, Onur
    Charpiat, Guillaume
    Tarabalka, Yuliya
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4869 - 4872
  • [7] Hand Image Understanding via Deep Multi-Task Learning
    Zhang, Xiong
    Huang, Hongsheng
    Tan, Jianchao
    Xu, Hongmin
    Yang, Cheng
    Peng, Guozhu
    Wang, Lei
    Liu, Ji
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11261 - 11272
  • [8] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141
  • [9] Multi-task deep learning for medical image computing and analysis: A review
    Zhao, Yan
    Wang, Xiuying
    Che, Tongtong
    Bao, Guoqing
    Li, Shuyu
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [10] Deep Multi-Task Learning for Large-Scale Image Classification
    Kuang, Zhenzhong
    Li, Zongmin
    Zhao, Tianyi
    Fan, Jianping
    [J]. 2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 310 - 317