Interactive Surgical Training in Neuroendoscopy: Real-Time Anatomical Feature Localization Using Natural Language Expressions

被引:2
|
作者
Matasyoh, Nevin M. [1 ]
Schmidt, Ruediger [2 ,3 ]
Zeineldin, Ramy A. [1 ]
Spetzger, Uwe [4 ,5 ]
Mathis-Ullrich, Franziska [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Dept Artificial Intelligence Biomed Engn, D-91052 Erlangen, Germany
[2] Klinikum Karlsruhe, Dept Neurosurg, Karlsruhe, Germany
[3] Klin Hirslanden, Ctr Endoscop & Minimally Invas Neurosurg, Zurich, Switzerland
[4] Karlsruhe Inst Technol, Inst Anthropomat & Robot, Karlsruhe, Germany
[5] Dept Neurosurg, Aachen, Germany
关键词
Surgery; Training; Neurosurgery; Biomedical imaging; Transformers; Visualization; Task analysis; Anatomical feature localization; endoscopic third ventriculostomy; feature fusion; multimodal deep learning; neuroendoscopy; surgical training; transformer;
D O I
10.1109/TBME.2024.3405814
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
- Objective: This study addresses challenges in surgical education, particularly in neuroendoscopy, where the demand for optimized workflow conflicts with the need for trainees' active participation in surgeries. To overcome these challenges, we propose a framework that accurately identifies anatomical structures within images guided by language descriptions, facilitating authentic and interactive learning experiences in neuroendoscopy. Methods: Utilizing the encoder-decoder architecture of a conventional transformer, our framework processes multimodal inputs (images and language descriptions) to identify and localize features in neuroendoscopic images. We curate a dataset from recorded endoscopic third ventriculostomy (ETV) procedures for training and evaluation. Utilizing evaluation metrics, including "R@n," "IoU=theta," "mIoU," and top-1 accuracy, we systematically benchmark our framework against state-of-the-art methodologies. Results: The framework demonstrates excellent generalization, surpassing the compared methods with 93.67% % accuracy and 76.08% % mIoU on unseen data. It also exhibits better computational speed compared with other methods. Qualitative results affirms the framework's effectiveness in precise localization of referred anatomical features within neuroendoscopic images. Conclusion: The framework's adeptness at localizing anatomical features using language descriptions positions it as a valuable tool for integration into future interactive clinical learning systems, enhancing surgical training in neuroendoscopy. Significance: The exemplary performance reinforces the framework's potential in enhancing surgical education, leading to improved skills and outcomes for trainees in neuroendoscopy.
引用
收藏
页码:2991 / 2999
页数:9
相关论文
共 50 条
  • [31] Real-Time Localization of a Person Using Smart Phone
    Poorani, M.
    Kumaresh, S.
    Karthick, K.
    Leena, V.
    Vaidehi, V.
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING (ICRTAC-CPS 2018), 2018, : 66 - 72
  • [32] Towards Real-Time Ball Localization Using CNNs
    Speck, Daniel
    Bestmann, Marc
    Barros, Pablo
    ROBOT WORLD CUP XXII, ROBOCUP 2018, 2019, 11374 : 337 - 348
  • [33] A training scheme for autism rehabilitation based on language comprehension using real-time fMRI neurofeedback
    Zhu, Huaping
    Cao, Yangbo
    Zhang, Zuo
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [34] CONCATENATIVE ARTICULATORY VIDEO SYNTHESIS USING REAL-TIME MRI DATA FOR SPOKEN LANGUAGE TRAINING
    Desai, Urvish
    Yarra, Chiranjeevi
    Ghosh, Prasanta Kumar
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4999 - 5003
  • [35] Objective assessment of training surgical skills using simulated tissue interface with real-time feedback
    Rafiq, Azhar
    Tamariz, Francisco
    Boanco, Cosmin
    Lavrentyev, Vladimir
    Merrell, Ronald C.
    JOURNAL OF SURGICAL EDUCATION, 2008, 65 (04) : 270 - 274
  • [36] A Real-Time Interactive Augmented Reality Depth Estimation Technique for Surgical Robotics
    Kalia, M.
    Navab, N.
    Salcudean, T.
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8291 - 8297
  • [37] USING NASREM FOR REAL-TIME SENSORY INTERACTIVE ROBOT CONTROL
    LUMIA, R
    ROBOTICA, 1994, 12 : 127 - 135
  • [38] Real-time interactive ocean wave simulation using multithread
    Prachumrak, K.
    Kanchanapornchai, T.
    World Academy of Science, Engineering and Technology, 2011, 56 : 235 - 238
  • [39] Real-time interactive ocean wave simulation using multithread
    Prachumrak, K.
    Kanchanapornchai, T.
    World Academy of Science, Engineering and Technology, 2011, 80 : 235 - 238
  • [40] Real-time Document Localization in Natural Images by Recursive Application of a CNN
    Javed, Khurram
    Shafait, Faisal
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 105 - 110