A Study on Speech Recognition Control for a Surgical Robot

被引:93
|
作者
Zinchenko, Kateryna [1 ]
Wu, Chien-Yu [2 ]
Song, Kai-Tai [3 ]
机构
[1] Natl Chiao Tung Univ, Hsinchu 30010, Taiwan
[2] Fair Friend Grp, Ind Div 4 0, Taipei 300, Taiwan
[3] Natl Chiao Tung Univ, Inst Elect Control Engn, Hsinchu 30010, Taiwan
关键词
Automated system; human-robot inter-face; motion control; robotic surgery; speech recognition control; SURGERY;
D O I
10.1109/TII.2016.2625818
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech recognition is common in electronic appliances and personal services, but its use for industrial and medical purposes is rare because of the presence of motion ambiguity. For minimally invasive surgical robotic assistants, this ambiguity arises because the robotic motion is not calibrated to the camera images. This paper presents a design for a speech recognition interface for an HIWIN robotic endoscope holder. A new intentional speech control is proposed to control movement over long distances. To decrease ambiguity, a method is proposed for voice-to-motion calibration that compares the degree of change in the endoscope image for a voice command. A speech recognition algorithm is implemented on Ubuntu OS, using CMU Sphinx. The control signal is sent to the robot controller using serial-port communication through a RS232 cable. The experimental results show that the proposed intentional speech control strategy has a navigation precision of up to 3.1 degrees of angular displacement for the endoscope. The overall system processing time, including robotic motion, is 3.22 s for similar to 1.8-s speech duration. The reference image navigation range is from 2.5 mm for similar to 0.5- s speech duration up to 6 mm for similar to 1.8-s speech duration, using a setup with camera tip that is located at a distance of 5 cm from the remote center of motion point.
引用
收藏
页码:607 / 615
页数:9
相关论文
共 50 条
  • [31] Wheelchair Control Using Speech Recognition
    Ghule, P. B.
    Bhalerao, M. G.
    Chile, R. H.
    Asutkar, V. G.
    2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 140 - 145
  • [32] Embedding speech recognition to control lights
    Sosi, Alessandro
    Brugnara, Fabio
    Cristoforetti, Luca
    Matassoni, Marco
    Ravanelli, Mirco
    Omologo, Maurizio
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 759 - 760
  • [33] Robot-Assisted and Manual Cochlear Implantation: An Intra-Individual Study of Speech Recognition
    Maheo, Clementine
    Marie, Antoine
    Torres, Renato
    Archutick, Jerrid
    Leclere, Jean-Christophe
    Marianowski, Remi
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (20)
  • [34] Development of a speech interface for control of a biped robot
    Dwivedi, S
    Dutta, A
    Mukerjee, A
    Kulkarni, P
    RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 601 - 605
  • [35] SGR-AutoLap: Surgical gesture recognition-based autonomous laparoscope control for human-robot shared control in semi-autonomous minimally invasive surgical robot
    Sun, Yanwen
    Shi, Xiaojing
    Zhai, Shixun
    Zhang, Kaige
    Pan, Bo
    Fu, Yili
    ROBOTIC INTELLIGENCE AND AUTOMATION, 2025, 45 (01): : 106 - 120
  • [36] Supervisory Control of a DaVinci Surgical Robot
    Chow, Der-Lin
    Xu, Peng
    Tuna, Eser
    Huang, Siqi
    Cavusoglu, M. Cenk
    Newman, Wyatt
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 5043 - 5049
  • [37] A Study on Noisy Speech Recognition
    Saeed, Khalid
    Szczepanski, Adam
    ICBAKE: 2009 INTERNATIONAL CONFERENCE ON BIOMETRICS AND KANSEI ENGINEERING, 2009, : 142 - 147
  • [38] Bibliometric Study of Speech Recognition
    Yu Wen-Jen
    2011 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES AND SOCIETY (ICSSS 2011), VOL 4, 2011, : 206 - 214
  • [39] A Study of Speech Recognition for the Elderly
    Teikyo University of Science & Technology, 2525 Yatsusawa, Uenohara-machi, Yamanashi
    409-0193, Japan
    不详
    356-8502, Japan
    Eur. Conf. Speech Commun. Technol., EUROSPEECH, 1600, (101-104):
  • [40] A Robust Speech Recognition System against the Ego Noise of a Robot
    Ince, Goekhan
    Nakadai, Kazuhiro
    Rodemann, Tobias
    Tsujino, Hiroshi
    Imura, Jun-ichi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2070 - +