Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video

被引:15
|
作者
Lin, Shan [1 ]
Qin, Fangbo [2 ]
Peng, Haonan [1 ]
Bly, Randall A. [3 ]
Moe, Kris S. [3 ]
Hannaford, Blake [1 ]
机构
[1] Univ Washington, Dept Elect & Comp Engn, Seattle, WA 98195 USA
[2] Chinese Acad Sci, Res Ctr Precis Sensing & Control, Inst Automat, Beijing 100190, Peoples R China
[3] UW, Dept Otolaryngol Head & Neck Surg, Seattle, WA 98105 USA
基金
美国国家科学基金会;
关键词
Computer vision for medical robotics; deep learning for visual perception; object detection; segmentation and categorization;
D O I
10.1109/LRA.2021.3096156
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep learning-based methods have achieved promising results on surgical instrument segmentation. However, the high computation cost may limit the application of deep models to time-sensitive tasks such as online surgical video analysis for robotic-assisted surgery. Moreover, current methods may still suffer from challenging conditions in surgical images such as various lighting conditions and the presence of blood. We propose a novel Multi-frame Feature Aggregation (MFFA) module to aggregate video frame features temporally and spatially in a recurrent mode. By distributing the computation load of deep feature extraction over sequential frames, we can use a lightweight encoder to reduce the computation costs at each time step. Moreover, public surgical videos usually are not labeled frame by frame, so we develop a method that can randomly synthesize a surgical frame sequence from a single labeled frame to assist network training. We demonstrate that our approach achieves superior performance to corresponding deeper segmentation models on two public surgery datasets.
引用
收藏
页码:6773 / 6780
页数:8
相关论文
共 50 条
  • [21] Multi-frame simultaneous motion estimation and segmentation
    Feghali, R
    ICCE: 2005 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2005, : 237 - 238
  • [22] Multi-Frame Quality Enhancement for Compressed Video
    Yang, Ren
    Xu, Mai
    Wang, Zulin
    Li, Tianyi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6664 - 6673
  • [23] ENHANCEMENT FOR TEMPORAL RESOLUTION OF VIDEO BASED ON MULTI-FRAME FEATURE TRAJECTORY AND OCCLUSION COMPENSATION
    Cho, Yang-Ho
    Lee, Ho-Yeong
    Park, Du-Sik
    Kim, Chang-Yeong
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 389 - 392
  • [24] Visual Odometry by Multi-frame Feature Integration
    Badino, Hernan
    Yamamoto, Akihiro
    Kanade, Takeo
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 222 - 229
  • [25] Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision
    Qin, Fangbo
    Lin, Shan
    Li, Yangming
    Bly, Randall A.
    Moe, Kris S.
    Hannaford, Blake
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 6639 - 6646
  • [26] Video Deflickering Using Multi-Frame Optimization
    Li, Chao
    Chen, Zhihua
    Sheng, Bin
    Li, Ping
    He, Gaoqi
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [27] Multi-Frame Pyramid Refinement Network for Video Frame Interpolation
    Zhang, Haoxian
    Wang, Ronggang
    Zhao, Yang
    IEEE ACCESS, 2019, 7 : 130610 - 130621
  • [28] DSMRSeg: Dual-Stage Feature Pyramid and Multi-Range Context Aggregation for Real-Time Semantic Segmentation
    Yang, Mingdong
    Shi, Ying
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 265 - 273
  • [29] An adaptive multi-frame parallel iterative method for accelerating real-time magnetic particle imaging reconstruction
    Shen, Yusong
    Zhang, Liwen
    Shang, Yaxin
    Jia, Guang
    Yin, Lin
    Zhang, Hui
    Tian, Jie
    Yang, Guanyu
    Hui, Hui
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (24):
  • [30] Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
    Kim, Seunghwan
    Kim, Changjong
    Kim, Sunggon
    IEEE ACCESS, 2025, 13 : 1522 - 1533