A Study on Speech Recognition Control for a Surgical Robot

被引:93
|
作者
Zinchenko, Kateryna [1 ]
Wu, Chien-Yu [2 ]
Song, Kai-Tai [3 ]
机构
[1] Natl Chiao Tung Univ, Hsinchu 30010, Taiwan
[2] Fair Friend Grp, Ind Div 4 0, Taipei 300, Taiwan
[3] Natl Chiao Tung Univ, Inst Elect Control Engn, Hsinchu 30010, Taiwan
关键词
Automated system; human-robot inter-face; motion control; robotic surgery; speech recognition control; SURGERY;
D O I
10.1109/TII.2016.2625818
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech recognition is common in electronic appliances and personal services, but its use for industrial and medical purposes is rare because of the presence of motion ambiguity. For minimally invasive surgical robotic assistants, this ambiguity arises because the robotic motion is not calibrated to the camera images. This paper presents a design for a speech recognition interface for an HIWIN robotic endoscope holder. A new intentional speech control is proposed to control movement over long distances. To decrease ambiguity, a method is proposed for voice-to-motion calibration that compares the degree of change in the endoscope image for a voice command. A speech recognition algorithm is implemented on Ubuntu OS, using CMU Sphinx. The control signal is sent to the robot controller using serial-port communication through a RS232 cable. The experimental results show that the proposed intentional speech control strategy has a navigation precision of up to 3.1 degrees of angular displacement for the endoscope. The overall system processing time, including robotic motion, is 3.22 s for similar to 1.8-s speech duration. The reference image navigation range is from 2.5 mm for similar to 0.5- s speech duration up to 6 mm for similar to 1.8-s speech duration, using a setup with camera tip that is located at a distance of 5 cm from the remote center of motion point.
引用
收藏
页码:607 / 615
页数:9
相关论文
共 50 条
  • [21] Artificial Robot Navigation based on Gesture and Speech Recognition
    Lei, Ze
    Gan, ZhaoHui
    Jiang, Min
    Dong, Ke
    2014 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2014, : 323 - 327
  • [22] Assisted Robot Navigation based on Speech Recognition and Synthesis
    Alves, Silas F. R.
    Silva, Ivan N.
    Ranieri, Caetano M.
    Ferasoli Filho, Humberto
    5TH ISSNIP-IEEE BIOSIGNALS AND BIOROBOTICS CONFERENCE (2014): BIOSIGNALS AND ROBOTICS FOR BETTER AND SAFER LIVING, 2014, : 231 - 235
  • [23] Automatic speech recognition to teleoperate a robot via web
    Marín, R
    Vila, P
    Sanz, PJ
    Marzal, A
    2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, 2002, : 1278 - 1283
  • [24] Automatic Robot Processing Using Speech Recognition System
    Elavarasi, S.
    Suseendran, G.
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2019, VOL 1, 2020, 1042 : 185 - 195
  • [25] Robot arm controller using fuzzy speech recognition
    Hung, TH
    Lu, HC
    FIRST INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ELECTRONIC SYSTEMS, PROCEEDINGS 1997 - KES '97, VOLS 1 AND 2, 1997, : 87 - 93
  • [26] Automatic Speech Recognition under Robot Ego Noises
    Wang, Jianrong
    Zhang, Ju
    Wei, Jianguo
    Lu, Wenhuan
    Dang, Jianwu
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 377 - 377
  • [27] Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech
    Yamamoto, Shun'ichi
    Nakadai, Kazuhiro
    Nakano, Mikio
    Tsujino, Hiroshi
    Valin, Jean-Marc
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 111 - +
  • [28] A Study on Motion Control of a Robotic Endoscope Holder Using Speech Recognition
    Zinchenko, Kateryna
    Wu, Chien-Yu
    Song, Kai-Tai
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2016, : 1472 - 1475
  • [29] Speech-in-speech recognition: A training study
    Van Engen, Kristin J.
    LANGUAGE AND COGNITIVE PROCESSES, 2012, 27 (7-8): : 1089 - 1107
  • [30] Quadcopter Control Using Speech Recognition
    Malik, H.
    Darma, S.
    Soekirno, S.
    INTERNATIONAL CONFERENCE ON THEORETICAL AND APPLIED PHYSICS, 2018, 1011