A Study on Speech Recognition Control for a Surgical Robot

被引:93
|
作者
Zinchenko, Kateryna [1 ]
Wu, Chien-Yu [2 ]
Song, Kai-Tai [3 ]
机构
[1] Natl Chiao Tung Univ, Hsinchu 30010, Taiwan
[2] Fair Friend Grp, Ind Div 4 0, Taipei 300, Taiwan
[3] Natl Chiao Tung Univ, Inst Elect Control Engn, Hsinchu 30010, Taiwan
关键词
Automated system; human-robot inter-face; motion control; robotic surgery; speech recognition control; SURGERY;
D O I
10.1109/TII.2016.2625818
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech recognition is common in electronic appliances and personal services, but its use for industrial and medical purposes is rare because of the presence of motion ambiguity. For minimally invasive surgical robotic assistants, this ambiguity arises because the robotic motion is not calibrated to the camera images. This paper presents a design for a speech recognition interface for an HIWIN robotic endoscope holder. A new intentional speech control is proposed to control movement over long distances. To decrease ambiguity, a method is proposed for voice-to-motion calibration that compares the degree of change in the endoscope image for a voice command. A speech recognition algorithm is implemented on Ubuntu OS, using CMU Sphinx. The control signal is sent to the robot controller using serial-port communication through a RS232 cable. The experimental results show that the proposed intentional speech control strategy has a navigation precision of up to 3.1 degrees of angular displacement for the endoscope. The overall system processing time, including robotic motion, is 3.22 s for similar to 1.8-s speech duration. The reference image navigation range is from 2.5 mm for similar to 0.5- s speech duration up to 6 mm for similar to 1.8-s speech duration, using a setup with camera tip that is located at a distance of 5 cm from the remote center of motion point.
引用
收藏
页码:607 / 615
页数:9
相关论文
共 50 条
  • [41] Psychoacoustic masking effect for noise robust speech recognition robot
    Miyanaga, Yoshikazu
    ISSCS 2019 - International Symposium on Signals, Circuits and Systems, 2019,
  • [42] Speech Recognition via STT API for Autonomous Mobile Robot
    Masek, Petr
    Ruzicka, Michal
    PROCEEDINGS OF THE 2014 16TH INTERNATIONAL CONFERENCE ON MECHATRONICS (MECHATRONIKA 2014), 2014, : 594 - 599
  • [43] Recognition of Affective Communicative Intent in Robot-Directed Speech
    Cynthia Breazeal
    Lijin Aryananda
    Autonomous Robots, 2002, 12 : 83 - 104
  • [44] Chinese Speech Recognition and Task Analysis of Aldebaran Nao Robot
    Han, Yuanyuan
    Zhang, Mengyu
    Li, Qiao
    Liu, Shuhua
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4138 - 4142
  • [45] Recognition of affective communicative intent in robot-directed speech
    Breazeal, C
    Aryananda, L
    AUTONOMOUS ROBOTS, 2002, 12 (01) : 83 - 104
  • [46] Psychoacoustic Masking Effect for Noise Robust Speech Recognition Robot
    Miyanaga, Yoshikazu
    2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
  • [47] Object recognition through human-robot interaction by speech
    Kurnia, R
    Hossain, A
    Nakamura, A
    Kuno, Y
    RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 619 - 624
  • [48] Surgical Robot Control Based on Torque Control Method
    Zhao, Bin
    Shi, Xusheng
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2015, 21 : 1667 - 1670
  • [49] Control Virtual Human with Speech Recognition and Gesture Recognition Technology
    Zhao, Wei
    Xie, XiaoFang
    Yang, XiangHong
    ADVANCES IN ELECTRICAL ENGINEERING AND AUTOMATION, 2012, 139 : 441 - 446