Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy

被引:0
|
作者
Chavarrias Solano, Pedro Esteban [1 ]
Bulpitt, Andrew [1 ]
Subramanian, Venkataraman [2 ,3 ]
Ali, Sharib [1 ]
机构
[1] School of Computer Science, Faculty of Engineering and Physical Sciences, University of Leeds, Leeds,LS2 9JT, United Kingdom
[2] Department of Gastroenterology, Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom
[3] Division of Gastroenterology and Surgical Sciences Leeds Institute of Medical Research at St James's University of Leeds, Leeds, United Kingdom
关键词
Multi-task learning;
D O I
10.1016/j.media.2024.103379
中图分类号
学科分类号
摘要
Colonoscopy screening is the gold standard procedure for assessing abnormalities in the colon and rectum, such as ulcers and cancerous polyps. Measuring the abnormal mucosal area and its 3D reconstruction can help quantify the surveyed area and objectively evaluate disease burden. However, due to the complex topology of these organs and variable physical conditions, for example, lighting, large homogeneous texture, and image modality estimating distance from the camera (aka depth) is highly challenging. Moreover, most colonoscopic video acquisition is monocular, making the depth estimation a non-trivial problem. While methods in computer vision for depth estimation have been proposed and advanced on natural scene datasets, the efficacy of these techniques has not been widely quantified on colonoscopy datasets. As the colonic mucosa has several low-texture regions that are not well pronounced, learning representations from an auxiliary task can improve salient feature extraction, allowing estimation of accurate camera depths. In this work, we propose to develop a novel multi-task learning (MTL) approach with a shared encoder and two decoders, namely a surface normal decoder and a depth estimator decoder. Our depth estimator incorporates attention mechanisms to enhance global context awareness. We leverage the surface normal prediction to improve geometric feature extraction. Also, we apply a cross-task consistency loss among the two geometrically related tasks, surface normal and camera depth. We demonstrate an improvement of 15.75% on relative error and 10.7% improvement on δ1.25 accuracy over the most accurate baseline state-of-the-art Big-to-Small (BTS) approach. All experiments are conducted on a recently released C3VD dataset, and thus, we provide a first benchmark of state-of-the-art methods on this dataset. © 2024 The Authors
引用
收藏
相关论文
共 50 条
  • [1] Cross-task Attention Mechanism for Dense Multi-task Learning
    Lopes, Ivan
    Tuan-Hung Vu
    de Charette, Raoul
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2328 - 2337
  • [2] MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning
    Xu, Xiaogang
    Zhao, Hengshuang
    Vineet, Vibhav
    Lim, Ser-Nam
    Torralba, Antonio
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 304 - 321
  • [3] Cross-Task Knowledge Distillation in Multi-Task Recommendation
    Yang, Chenxiao
    Pan, Junwei
    Gao, Xiaofeng
    Jiang, Tingyu
    Liu, Dapeng
    Chen, Guihai
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4318 - 4326
  • [4] Cross-Task Attention Network: Improving Multi-task Learning for Medical Imaging Applications
    Kim, Sangwook
    Purdie, Thomas G.
    McIntosh, Chris
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 119 - 128
  • [5] Calibration of cine MRI segmentation probability for uncertainty estimation using a multi-task cross-task learning architecture
    Hasan, S. M. Kamrul
    Linte, Cristian A.
    MEDICAL IMAGING 2022: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2022, 12034
  • [6] Cross-task feature enhancement strategy in multi-task learning for harvesting Sichuan pepper
    Wang, Yihan
    Deng, Xinglong
    Luo, Jianqiao
    Li, Bailin
    Xiao, Shide
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 207
  • [7] Learning Cross-Task Attribute - Attribute Similarity for Multi-task Attribute-Value Extraction
    Jain, Mayank
    Bhattacharya, Sourangshu
    Jain, Harshit
    Shaik, Karimulla
    Chelliah, Muthusamy
    ECNLP 4: THE FOURTH WORKSHOP ON E-COMMERCE AND NLP, 2021, : 79 - 87
  • [8] Multi-Task Consistency for Active Learning
    Hekimoglu, Aral
    Friedrich, Philipp
    Zimmer, Walter
    Schmidt, Michael
    Marcos-Ramiro, Alvaro
    Knoll, Alois
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3407 - 3416
  • [9] Multi-task Learning with Selective Cross-Task Transfer for Predicting Bleeding and other Important Patient Outcomes
    Ngufor, Che
    Upadhyaya, Sudhindra
    Murphree, Dennis
    Kor, Daryl
    Pathak, Jyotishman
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 511 - 518
  • [10] Semi-Supervised Depth Estimation by Multi-Task Learning
    Fu, Qingshun
    Dong, Xuan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3765 - 3771