A Study on Speech Recognition Control for a Surgical Robot

被引：93

作者：

Zinchenko, Kateryna ^{[1
]}

Wu, Chien-Yu ^{[2
]}

Song, Kai-Tai ^{[3
]}

机构：

[1] Natl Chiao Tung Univ, Hsinchu 30010, Taiwan

[2] Fair Friend Grp, Ind Div 4 0, Taipei 300, Taiwan

[3] Natl Chiao Tung Univ, Inst Elect Control Engn, Hsinchu 30010, Taiwan

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2017年 / 13卷 / 02期

关键词：

Automated system; human-robot inter-face; motion control; robotic surgery; speech recognition control; SURGERY;

D O I：

10.1109/TII.2016.2625818

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech recognition is common in electronic appliances and personal services, but its use for industrial and medical purposes is rare because of the presence of motion ambiguity. For minimally invasive surgical robotic assistants, this ambiguity arises because the robotic motion is not calibrated to the camera images. This paper presents a design for a speech recognition interface for an HIWIN robotic endoscope holder. A new intentional speech control is proposed to control movement over long distances. To decrease ambiguity, a method is proposed for voice-to-motion calibration that compares the degree of change in the endoscope image for a voice command. A speech recognition algorithm is implemented on Ubuntu OS, using CMU Sphinx. The control signal is sent to the robot controller using serial-port communication through a RS232 cable. The experimental results show that the proposed intentional speech control strategy has a navigation precision of up to 3.1 degrees of angular displacement for the endoscope. The overall system processing time, including robotic motion, is 3.22 s for similar to 1.8-s speech duration. The reference image navigation range is from 2.5 mm for similar to 0.5- s speech duration up to 6 mm for similar to 1.8-s speech duration, using a setup with camera tip that is located at a distance of 5 cm from the remote center of motion point.

引用

页码：607 / 615

页数：9

共 50 条

[41] Psychoacoustic masking effect for noise robust speech recognition robot
Miyanaga, Yoshikazu
ISSCS 2019 - International Symposium on Signals, Circuits and Systems, 2019,
[42] Speech Recognition via STT API for Autonomous Mobile Robot
Masek, Petr
Ruzicka, Michal
PROCEEDINGS OF THE 2014 16TH INTERNATIONAL CONFERENCE ON MECHATRONICS (MECHATRONIKA 2014), 2014, : 594 - 599
[43] Recognition of Affective Communicative Intent in Robot-Directed Speech
Cynthia Breazeal
Lijin Aryananda
Autonomous Robots, 2002, 12 : 83 - 104
[44] Chinese Speech Recognition and Task Analysis of Aldebaran Nao Robot
Han, Yuanyuan
Zhang, Mengyu
Li, Qiao
Liu, Shuhua
PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4138 - 4142
[45] Recognition of affective communicative intent in robot-directed speech
Breazeal, C
Aryananda, L
AUTONOMOUS ROBOTS, 2002, 12 (01) : 83 - 104
[46] Psychoacoustic Masking Effect for Noise Robust Speech Recognition Robot
Miyanaga, Yoshikazu
2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
[47] Object recognition through human-robot interaction by speech
Kurnia, R
Hossain, A
Nakamura, A
Kuno, Y
RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 619 - 624
[48] Surgical Robot Control Based on Torque Control Method
Zhao, Bin
Shi, Xusheng
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2015, 21 : 1667 - 1670
[49] Control Virtual Human with Speech Recognition and Gesture Recognition Technology
Zhao, Wei
Xie, XiaoFang
Yang, XiangHong
ADVANCES IN ELECTRICAL ENGINEERING AND AUTOMATION, 2012, 139 : 441 - 446
[50] Study of Speech Recognition System Operation for Voice-driven UAV Control
Park, Jeong-Sik
JOURNAL OF THE KOREAN SOCIETY FOR AERONAUTICAL AND SPACE SCIENCES, 2019, 47 (03) : 212 - 219

← 1 2 3 4 5 →