Afan Oromo Speech-Based Computer Command and Control: An Evaluation with Selected Commands

被引:0
|
作者
Teshite, Kebede [1 ]
Mamo, Getachew [2 ]
Calpotura, Kris [1 ]
机构
[1] Jimma Univ, Inst Technol, Fac Elect & Comp Engn, Jimma, Ethiopia
[2] Jimma Univ, Inst Technol, Fac Comp & Informat, Jimma, Ethiopia
关键词
SPHINX;
D O I
10.1155/2023/9959015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech-based computer command and control utilize natural speech to enable computers to understand human language and execute tasks through commands. However, there has been no study or development of a speech-based command and control system for Microsoft Word in Afan Oromo. The primary aim of this research is to investigate and develop a speech-based command and control system for Afan Oromo using a selected set of command-and-control words from MS Word. To accomplish this objective, a speech recognizer was developed using the HTK toolkit, employing a small vocabulary, isolated words, speaker independence, and HMM-based techniques. The translation of the selected MS command words from English to Afan Oromo was completed in order to develop this automatic speech-based computer command system. Audio recordings were obtained from 38 speakers (16 females and 22 males) aged between 18 and 40 years, based on their availability. Word-level speech recognition was performed using MFCC and data processing, which are widely used and are effective approaches in speech recognition. Out of a total of 64 MS command words, 54 words (84.37%) were used for training and 10 words (15.63%) were used for testing. Live and nonlive evaluation techniques were employed to assess the performance of the recognizer. The live recognizer, which considers variations in the environment, outperformed the nonlive recognizer due to the influence of neighboring phones. The performance results for the monophone tied state, triphone, and triphone-based recognizers were 78.12%, 86.87%, and 88.99%, respectively. Thus, the triphone-based recognizer exhibited the best performance among the nonlive recognizers. The challenges of limited resources in this research study were limited to investigate speech-based commands for computers using only selected MS commands, which play a crucial role in text processing. In order to evaluate a speech-based interface in a real environment, there were no components available for object-as-a-service. The experimental findings of this study demonstrated that if an adequate amount of language resources was available, a computer-based Afan Oromo speech-based interface for command-and-control purposes could be developed.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] The collaborative production of computer commands in command and control
    Luff, P
    Heath, C
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2000, 52 (04) : 669 - 699
  • [2] Command and control of industrial manipulator through speech-based interfaces in Indic Languages
    N. Saravanan
    R. Sivaramakrishnan
    [J]. The Journal of Supercomputing, 2019, 75 : 5106 - 5117
  • [3] Command and control of industrial manipulator through speech-based interfaces in Indic Languages
    Saravanan, N.
    Sivaramakrishnan, R.
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (08): : 5106 - 5117
  • [4] Speech-based cursor control: understanding the effects of target size, cursor speed, and command selection
    A. Sears
    M. Lin
    A.S. Karimullah
    [J]. Universal Access in the Information Society, 2002, 2 (1) : 30 - 43
  • [5] Challenges in speech-based human-computer interfaces
    Minker, Wolfgang
    Pittermann, Johannes
    Pittermann, Angela
    Strauss, Petra-Maria
    Buehler, Dirk
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (2-3) : 109 - 119
  • [6] COMPUTER SPEECH-BASED TRAINING OF LITERACY SKILLS IN NEUROLOGICALLY IMPAIRED CHILDREN - A CONTROLLED EVALUATION
    LOVETT, MW
    BARRON, RW
    FORBES, JE
    CUKSTS, B
    STEINBACH, KA
    [J]. BRAIN AND LANGUAGE, 1994, 47 (01) : 117 - 154
  • [7] Speech-based Evaluation of Emotions-Depression Correlation
    Verde, Laura
    Campanile, Lelio
    Marulli, Fiammetta
    Marrone, Stefano
    [J]. 2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 324 - 329
  • [8] Evaluation of Information Comprehension in Concurrent Speech-based Designs
    Abu Ul Fazal, Muhammad
    Ferguson, Sam
    Johnston, Andrew
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 16 (04)
  • [9] Context-Centric Speech-Based Human-Computer Interaction
    Hung, Victor C.
    Gonzalez, Avelino J.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2013, 28 (10) : 1010 - 1037
  • [10] Compensate the Speech Recognition Delays for Accurate Speech-Based Cursor Position Control
    Tong, Qiang
    Wang, Ziyun
    [J]. HUMAN-COMPUTER INTERACTION, PT II, 2009, 5611 : 752 - 760