A Study on Speech Recognition Control for a Surgical Robot

被引：93

作者：

Zinchenko, Kateryna ^{[1
]}

Wu, Chien-Yu ^{[2
]}

Song, Kai-Tai ^{[3
]}

机构：

[1] Natl Chiao Tung Univ, Hsinchu 30010, Taiwan

[2] Fair Friend Grp, Ind Div 4 0, Taipei 300, Taiwan

[3] Natl Chiao Tung Univ, Inst Elect Control Engn, Hsinchu 30010, Taiwan

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2017年 / 13卷 / 02期

关键词：

Automated system; human-robot inter-face; motion control; robotic surgery; speech recognition control; SURGERY;

D O I：

10.1109/TII.2016.2625818

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech recognition is common in electronic appliances and personal services, but its use for industrial and medical purposes is rare because of the presence of motion ambiguity. For minimally invasive surgical robotic assistants, this ambiguity arises because the robotic motion is not calibrated to the camera images. This paper presents a design for a speech recognition interface for an HIWIN robotic endoscope holder. A new intentional speech control is proposed to control movement over long distances. To decrease ambiguity, a method is proposed for voice-to-motion calibration that compares the degree of change in the endoscope image for a voice command. A speech recognition algorithm is implemented on Ubuntu OS, using CMU Sphinx. The control signal is sent to the robot controller using serial-port communication through a RS232 cable. The experimental results show that the proposed intentional speech control strategy has a navigation precision of up to 3.1 degrees of angular displacement for the endoscope. The overall system processing time, including robotic motion, is 3.22 s for similar to 1.8-s speech duration. The reference image navigation range is from 2.5 mm for similar to 0.5- s speech duration up to 6 mm for similar to 1.8-s speech duration, using a setup with camera tip that is located at a distance of 5 cm from the remote center of motion point.

引用

页码：607 / 615

页数：9

共 50 条

[31] Wheelchair Control Using Speech Recognition
Ghule, P. B.
Bhalerao, M. G.
Chile, R. H.
Asutkar, V. G.
2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 140 - 145
[32] Embedding speech recognition to control lights
Sosi, Alessandro
Brugnara, Fabio
Cristoforetti, Luca
Matassoni, Marco
Ravanelli, Mirco
Omologo, Maurizio
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 759 - 760
[33] Robot-Assisted and Manual Cochlear Implantation: An Intra-Individual Study of Speech Recognition
Maheo, Clementine
Marie, Antoine
Torres, Renato
Archutick, Jerrid
Leclere, Jean-Christophe
Marianowski, Remi
JOURNAL OF CLINICAL MEDICINE, 2023, 12 (20)
[34] Development of a speech interface for control of a biped robot
Dwivedi, S
Dutta, A
Mukerjee, A
Kulkarni, P
RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, : 601 - 605
[35] SGR-AutoLap: Surgical gesture recognition-based autonomous laparoscope control for human-robot shared control in semi-autonomous minimally invasive surgical robot
Sun, Yanwen
Shi, Xiaojing
Zhai, Shixun
Zhang, Kaige
Pan, Bo
Fu, Yili
ROBOTIC INTELLIGENCE AND AUTOMATION, 2025, 45 (01): : 106 - 120
[36] Supervisory Control of a DaVinci Surgical Robot
Chow, Der-Lin
Xu, Peng
Tuna, Eser
Huang, Siqi
Cavusoglu, M. Cenk
Newman, Wyatt
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 5043 - 5049
[37] A Study on Noisy Speech Recognition
Saeed, Khalid
Szczepanski, Adam
ICBAKE: 2009 INTERNATIONAL CONFERENCE ON BIOMETRICS AND KANSEI ENGINEERING, 2009, : 142 - 147
[38] Bibliometric Study of Speech Recognition
Yu Wen-Jen
2011 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES AND SOCIETY (ICSSS 2011), VOL 4, 2011, : 206 - 214
[39] A Study of Speech Recognition for the Elderly
Teikyo University of Science & Technology, 2525 Yatsusawa, Uenohara-machi, Yamanashi
409-0193, Japan
不详
356-8502, Japan
Eur. Conf. Speech Commun. Technol., EUROSPEECH, 1600, (101-104):
[40] A Robust Speech Recognition System against the Ego Noise of a Robot
Ince, Goekhan
Nakadai, Kazuhiro
Rodemann, Tobias
Tsujino, Hiroshi
Imura, Jun-ichi
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2070 - +

← 1 2 3 4 5 →