Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video

被引:15
|
作者
Lin, Shan [1 ]
Qin, Fangbo [2 ]
Peng, Haonan [1 ]
Bly, Randall A. [3 ]
Moe, Kris S. [3 ]
Hannaford, Blake [1 ]
机构
[1] Univ Washington, Dept Elect & Comp Engn, Seattle, WA 98195 USA
[2] Chinese Acad Sci, Res Ctr Precis Sensing & Control, Inst Automat, Beijing 100190, Peoples R China
[3] UW, Dept Otolaryngol Head & Neck Surg, Seattle, WA 98105 USA
基金
美国国家科学基金会;
关键词
Computer vision for medical robotics; deep learning for visual perception; object detection; segmentation and categorization;
D O I
10.1109/LRA.2021.3096156
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep learning-based methods have achieved promising results on surgical instrument segmentation. However, the high computation cost may limit the application of deep models to time-sensitive tasks such as online surgical video analysis for robotic-assisted surgery. Moreover, current methods may still suffer from challenging conditions in surgical images such as various lighting conditions and the presence of blood. We propose a novel Multi-frame Feature Aggregation (MFFA) module to aggregate video frame features temporally and spatially in a recurrent mode. By distributing the computation load of deep feature extraction over sequential frames, we can use a lightweight encoder to reduce the computation costs at each time step. Moreover, public surgical videos usually are not labeled frame by frame, so we develop a method that can randomly synthesize a surgical frame sequence from a single labeled frame to assist network training. We demonstrate that our approach achieves superior performance to corresponding deeper segmentation models on two public surgery datasets.
引用
收藏
页码:6773 / 6780
页数:8
相关论文
共 50 条
  • [41] Multi-frame Motion Segmentation by Combining Two-Frame Results
    Arrigoni, Federica
    Ricci, Elisa
    Pajdla, Tomas
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (03) : 696 - 728
  • [42] Efficient Segmentation for Multi-frame Iris Acquisition on Smartphones
    Thavalengal, Shejin
    Bigioi, Petronel
    Corcoran, Peter
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2016,
  • [43] Real-time video frame differentiation in multihomed VANETs
    Rui Lopes
    Miguel Luís
    Susana Sargento
    Wireless Networks, 2021, 27 : 2559 - 2575
  • [44] Real-time video frame differentiation in multihomed VANETs
    Lopes, Rui
    Luis, Miguel
    Sargento, Susana
    WIRELESS NETWORKS, 2021, 27 (04) : 2559 - 2575
  • [45] Multi-frame Motion Segmentation by Combining Two-Frame Results
    Federica Arrigoni
    Elisa Ricci
    Tomas Pajdla
    International Journal of Computer Vision, 2022, 130 : 696 - 728
  • [46] Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation
    Sun, Peng
    Wu, Jiaxiang
    Li, Songyuan
    Lin, Peiwen
    Huang, Junzhou
    Li, Xi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (05) : 1506 - 1525
  • [47] Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation
    Peng Sun
    Jiaxiang Wu
    Songyuan Li
    Peiwen Lin
    Junzhou Huang
    Xi Li
    International Journal of Computer Vision, 2021, 129 : 1506 - 1525
  • [48] Fast Multi-Frame Motion Estimation for Video Processing
    Wang, Demin
    Zhang, Liang
    Vincent, Andre
    BMSB: 2009 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, VOLS 1 AND 2, 2009, : 361 - 368
  • [49] Real-time Semantic Segmentation with Context Aggregation Network
    Yang, Michael Ying
    Kumaar, Saumya
    Lyu, Ye
    Nex, Francesco
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 178 : 124 - 134
  • [50] A Real-Time Filtering Approach for Plot-Sequences Outputted by Multi-Frame Track-Before-Detection
    Liu, Kezhu
    Li, Wujun
    Kong, Lingjiang
    ICCAIS 2019: THE 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES, 2019,