Intelligent robotic sonographer: Mutual information-based disentangled reward learning from few demonstrations

被引:6
|
作者
Jiang, Zhongliang [1 ]
Bi, Yuan [1 ]
Zhou, Mingchuan [2 ]
Hu, Ying [3 ]
Burke, Michael [4 ]
Navab, Nassir [1 ,5 ]
机构
[1] Tech Univ Munich, Chair Comp Aided Med Procedures & Augmented Real, Munich, Germany
[2] Zhejiang Univ, Coll Biosyst Engn & Food Sci, 866 Yuhangtang Rd, Hangzhou 310027, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, 1068 Xueyuan Ave, Shenzhen 518055, Peoples R China
[4] Monash Univ, Dept Elect & Comp Syst Engn, Clayton, Vic, Australia
[5] Johns Hopkins Univ, Lab Comp Aided Med Procedures, Baltimore, MD USA
来源
关键词
Robotic ultrasound; medical robotics; learning from demonstration; latent feature disentanglement; FETAL ULTRASOUND; LOCALIZATION; GUIDANCE; SYSTEM;
D O I
10.1177/02783649231223547
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Ultrasound (US) imaging is widely used for biometric measurement and diagnosis of internal organs due to the advantages of being real-time and radiation-free. However, due to inter-operator variations, resulting images highly depend on the experience of sonographers. This work proposes an intelligent robotic sonographer to autonomously "explore" target anatomies and navigate a US probe to standard planes by learning from the expert. The underlying high-level physiological knowledge from experts is inferred by a neural reward function, using a ranked pairwise image comparison approach in a self-supervised fashion. This process can be referred to as understanding the "language of sonography." Considering the generalization capability to overcome inter-patient variations, mutual information is estimated by a network to explicitly disentangle the task-related and domain features in latent space. The robotic localization is carried out in coarse-to-fine mode based on the predicted reward associated with B-mode images. To validate the effectiveness of the proposed reward inference network, representative experiments were performed on vascular phantoms ("line" target), two types of ex vivo animal organ phantoms (chicken heart and lamb kidney representing "point" target), and in vivo human carotids. To further validate the performance of the autonomous acquisition framework, physical robotic acquisitions were performed on three phantoms (vascular, chicken heart, and lamb kidney). The results demonstrated that the proposed advanced framework can robustly work on a variety of seen and unseen phantoms as well as in vivo human carotid data. Code: https://github.com/yuan-12138/MI-GPSR. Video: https://youtu.be/u4ThAA9onE0.
引用
收藏
页码:981 / 1002
页数:22
相关论文
共 49 条
  • [1] Reward Learning From Very Few Demonstrations
    Eteke, Cem
    Kebude, Dogancan
    Akgun, Baris
    IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (03) : 893 - 904
  • [2] Information-Based Disentangled Representation Learning for Unsupervised MR Harmonization
    Zuo, Lianrui
    Dewey, Blake E.
    Carass, Aaron
    Liu, Yihao
    He, Yufan
    Calabresi, Peter A.
    Prince, Jerry L.
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2021, 2021, 12729 : 346 - 359
  • [3] Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
    Patil, Vihang
    Hofmarcher, Markus
    Dinu, Marius-Constantin
    Dorfer, Matthias
    Blies, Patrick
    Brandstetter, Johannes
    Arjona-Medina, Jose
    Hochreiter, Sepp
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Mutual information-based feature disentangled network for anomaly detection under variable working conditions
    Hu, Chenye
    Wu, Jingyao
    Sun, Chuang
    Chen, Xuefeng
    Yan, Ruqiang
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 204
  • [5] Traffic Prediction With Transfer Learning: A Mutual Information-Based Approach
    Huang, Yunjie
    Song, Xiaozhuang
    Zhu, Yuanshao
    Zhang, Shiyao
    Yu, James J. Q.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8236 - 8252
  • [6] Conditional Mutual Information-Based Generalization Bound for Meta Learning
    Rezazadeh, Arezou
    Jose, Sharu Theresa
    Durisi, Giuseppe
    Simeone, Osvaldo
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1176 - 1181
  • [7] Mutual Information-Based Feature Selection and Ensemble Learning for Classification
    Qi, Chengming
    Zhou, Zhangbing
    Wang, Qun
    Hu, Lishuan
    2016 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2016, : 116 - 121
  • [8] Mutual information-based multi-output tree learning algorithm
    Kang, Hyun-Seok
    Jun, Chi-Hyuck
    INTELLIGENT DATA ANALYSIS, 2021, 25 (06) : 1525 - 1545
  • [9] A Mutual Information-Based Assessment of Reverse Engineering on Rewards of Reinforcement Learning
    Chen T.
    Liu J.
    Baker T.
    Wu Y.
    Xiang Y.
    Li Y.
    Niu W.
    Tong E.
    Zomaya A.Y.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1089 - 1100
  • [10] Reinforcement learning from suboptimal demonstrations based on Reward Relabeling
    Peng, Yong
    Zeng, Junjie
    Hu, Yue
    Fang, Qi
    Yin, Quanjun
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255