Multi-view orientational attention network combining point-based affinity for polyp segmentation

被引:3
|
作者
Liu, Yan [1 ,2 ]
Yang, Yan [1 ,2 ]
Jiang, Yongquan [1 ,2 ]
Xie, Zhuyang [1 ,2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 610031, Sichuan, Peoples R China
[2] Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Polyp segmentation; Deep learning; Multi-view orientational attention; Point-based affinity; COLONOSCOPY; DIAGNOSIS;
D O I
10.1016/j.eswa.2024.123663
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing deep learning-based polyp segmentation methods neglect two important aspects of polyps: the geometric orientation information of polyps and the point information of the entire colonoscopy area. In this paper, we introduce a multi-view orientational attention network (MVOA-Net), which incorporates orientation and point awareness to effectively address the issue of intra-class inconsistency resulting from variations in polyp shape, size, and position, as well as the inter-class indistinction caused by the high similarity between polyp lesions and surrounding tissues. To achieve robust orientation awareness, we propose a novel geometric orientation transformer encoder (GOTE) based on horizontal and vertical views. Moreover, To simultaneously capture the global context information of GOTE and emphasize the important local information of the convolution-based attention encoder (CBAE), a global and local cross attention fusion module (CAFM) is also proposed to simultaneously model the long-range dependencies of polyps and pay sufficient attention to the local boundaries of polyps. Additionally, a efficient atrous spatial pyramid pooling (E-ASPP) module is proposed to enhance the semantic representation of high-level features. Finally, a point-based affinity module (PBAM) and a multi-scale fusion module (MSFM) are proposed to distinguish the disguise of polyps, further alleviating inter-class indistinction. The ablation study results demonstrate the effectiveness of each component. Quantitative and qualitative experimental results show that MVOA-Net achieves the best segmentation accuracy across domain polyp datasets and has obvious advantages in segmenting multiple polyp objects.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Point-Based Multi-View Stereo Network
    Chen, Rui
    Han, Songfang
    Xu, Jing
    Su, Hao
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1538 - 1547
  • [2] Visibility-Aware Point-Based Multi-View Stereo Network
    Chen, Rui
    Han, Songfang
    Xu, Jing
    Su, Hao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3695 - 3708
  • [3] Multi-view stereo network with point attention
    Zhao, Rong
    Gu, Zhuoer
    Han, Xie
    He, Ligang
    Sun, Fusheng
    Jiao, Shichao
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26622 - 26636
  • [4] Multi-view stereo network with point attention
    Rong Zhao
    Zhuoer Gu
    Xie Han
    Ligang He
    Fusheng Sun
    Shichao Jiao
    Applied Intelligence, 2023, 53 : 26622 - 26636
  • [5] Brain Tumor Segmentation using Multi-View Attention based Ensemble Network
    Mushtaq, Noreen
    Khan, Arfat Ahmad
    Khan, Faizan Ahmed
    Ali, Muhammad Junaid
    Shahid, Malik Muhammad Ali
    Wechtaisong, Chitapong
    Uthansakul, Peerapong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5793 - 5806
  • [6] Multi-view Network with Transformer for Point Cloud Semantic Segmentation
    Hua, Zhongwei
    Du, Daming
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 161 - 165
  • [7] Multi-level feature fusion network combining attention mechanisms for polyp segmentation
    Liu, Junzhuo
    Chen, Qiaosong
    Zhang, Ye
    Wang, Zhixiang
    Deng, Xin
    Wang, Jin
    INFORMATION FUSION, 2024, 104
  • [8] Attention based multi-scale parallel network for polyp segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [9] Multi-View Attention Network for Visual Dialog
    Park, Sungjin
    Whang, Taesun
    Yoon, Yeochan
    Lim, Heuiseok
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [10] PYRAMIDAL MULTI-VIEW PBR A Point-based Algorithm for Multi-view Multi-resolution Rendering of Large Data Sets from Range Images
    Farooq, Sajid
    Siebert, J. Paul
    GRAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS THEORY AND APPLICATIONS, 2009, : 211 - 216