Shot boundary detection in endoscopic surgery videos using a variational Bayesian framework

被引:11
|
作者
Loukas, Constantinos [1 ,3 ]
Nikiteas, Nikolaos [1 ]
Schizas, Dimitrios [2 ]
Georgiou, Evangelos [1 ]
机构
[1] Univ Athens, Sch Med, Simulat Ctr, Lab Med Phys, Athens, Greece
[2] Univ Athens, Laiko Gen Hosp, Dept Surg 1, Athens, Greece
[3] Univ Athens, Sch Med, Simulat Ctr, Med Phys Lab, Mikras Asias 75 Str, GR-11527 Athens, Greece
关键词
Video content analysis; Shot detection; Border detection; Variational Bayes; Tracking; Surgery; SURGICAL WORKFLOW; RETRIEVAL; CLASSIFICATION; DATABASE;
D O I
10.1007/s11548-016-1431-2
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Over the last decade, the demand for content management of video recordings of surgical procedures has greatly increased. Although a few research methods have been published toward this direction, the related literature is still in its infancy. In this paper, we address the problem of shot detection in endoscopic surgery videos, a fundamental step in content-based video analysis. The video is first decomposed into short clips that are processed sequentially. After feature extraction, we employ spatiotemporal Gaussian mixture models (GMM) for each clip and apply a variational Bayesian (VB) algorithm to approximate the posterior distribution of the model parameters. The proper number of components is handled automatically by the VBGMM algorithm. The estimated components are matched along the video sequence via their Kullback-Leibler divergence. Shot borders are defined when component tracking fails, signifying a different visual appearance of the surgical scene. Experimental evaluation was performed on laparoscopic videos containing a variable number of shots. Performance was measured via precision, recall, coverage and overflow metrics. The proposed method was compared with GMM and a shot detection method based on spatiotemporal motion differences (MotionDiff). The results demonstrate that VBGMM has higher performance than all other methods for most assessment metrics: precision and recall > 80 %, coverage: 84 %. Overflow for VBGMM was worse than MotionDiff (37 vs. 27 %). The proposed method generated promising results for shot border detection. Spatiotemporal modeling via VBGMMs provides a means to explore additional applications such as component tracking.
引用
收藏
页码:1937 / 1949
页数:13
相关论文
共 50 条
  • [41] A unified framework using spatial color descriptor and motion-based post refinement for shot boundary detection
    Chu, WT
    Cheng, WH
    He, SF
    Wang, CW
    Wu, JL
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 3, PROCEEDINGS, 2004, 3333 : 558 - 565
  • [42] V2T: video to text framework using a novel automatic shot boundary detection algorithm
    Alok Singh
    Thoudam Doren Singh
    Sivaji Bandyopadhyay
    [J]. Multimedia Tools and Applications, 2022, 81 : 17989 - 18009
  • [43] V2T: video to text framework using a novel automatic shot boundary detection algorithm
    Singh, Alok
    Singh, Thoudam Doren
    Bandyopadhyay, Sivaji
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) : 17989 - 18009
  • [44] Gradual shot boundary detection using localized edge blocks
    Hun-Woo Yoo
    Han-Jin Ryoo
    Dong-Sik Jang
    [J]. Multimedia Tools and Applications, 2006, 28 : 283 - 300
  • [45] Automatic shot boundary detection using Gaussian Mixture Model
    Reddy, A. Adhipathi
    Varadharajan, Sridhar
    [J]. VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 547 - 550
  • [46] Shot Boundary Detection Using Color Clustering and Attention Mechanism
    Yuan, Ye
    Zhang, Jiawan
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [47] Smoke detection in endoscopic surgery videos: a first step towards retrieval of semantic events
    Loukas, Constantinos
    Georgiou, Evangelos
    [J]. INTERNATIONAL JOURNAL OF MEDICAL ROBOTICS AND COMPUTER ASSISTED SURGERY, 2015, 11 (01): : 80 - 94
  • [48] Gradual shot boundary detection using localized edge blocks
    Yoo, Hun-Woo
    Ryoo, Han-Jin
    Jang, Dong-Sik
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 28 (03) : 283 - 300
  • [49] Automatic video shot boundary detection using machine learning
    Ren, W
    Singh, S
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 285 - 292
  • [50] Enhanced shot boundary detection using video text information
    Liang, LH
    Liu, Y
    Lu, H
    Xue, XY
    Tan, YP
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2005, 51 (02) : 580 - 588