Automatic speech recognition systems

被引:0
|
作者
Catariov, A
机构
来源
关键词
speech recognition; signal analysis; hidden Markov model; dynamic time warping; systems; neural networks;
D O I
10.1117/12.612047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper is presented analyses in automatic speech recognition(ASR) to find out what is the state of the arts in this direction and, eventually, it can be a starting point for the implementation of a real ASR system. In the second chapter of this work, it is revealed the structure of a typical speech recognition system and the used methods for each step of the recognition process, and in special, there are described two kinds of speech recognition algorithms, namely, Dynamic Time Warping (DTW) and Hidden Markov Model (HMM). The work continues with some results of ASR, in order to make conclusions about what is needed to be improved and what is more eligible to implement an ASR system.
引用
收藏
页码:83 / 93
页数:11
相关论文
共 50 条
  • [1] SPEECH DISFLUENCIES MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasilisa, Verkhodanova O.
    Alexey, Karpov A.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2012, (363): : 10 - +
  • [2] Transfer Learning for Automatic Speech Recognition Systems
    Asefisaray, Behnam
    Haznedaroglu, Ali
    Erden, Mustafa
    Arslan, Levent M.
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [3] PRELIMINARY CONSIDERATIONS FOR AUTOMATIC SPEECH RECOGNITION SYSTEMS
    UNGEHEUER, G
    [J]. PHONETICA, 1979, 36 (4-5) : 254 - 262
  • [4] Validation of Speech Data for Training Automatic Speech Recognition Systems
    Krizaj, Janes
    Gros, Jerneja Zganec
    Dobrisek, Simon
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1165 - 1169
  • [5] TEXT NORMALIZATION FOR AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasile, Alin-Florentin
    Boros, Tiberiu
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2016, : 121 - 128
  • [6] AUTOMATIC SPEECH RECOGNITION FOR REAL TIME SYSTEMS
    Singh, Ranjodh
    Yadav, Hemant
    Sharma, Mohit
    Gosain, Sandeep
    Shah, Rajiv Ratn
    [J]. 2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 189 - 198
  • [7] AUTOMATIC SPEECH RECOGNITION AND MEDICAL EXPERT SYSTEMS
    NORWICH, KH
    LANDAU, JA
    [J]. CANADIAN MEDICAL AND BIOLOGICAL ENGINEERING SOCIETY CONFERENCE : PROCEEDINGS - 1989, 1989, : 57 - 58
  • [8] A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems
    Bhatt, Shobha
    Bansal, Shweta
    Kumar, Ankit
    Pandey, Saroj Kumar
    Ojha, Manoj Kumar
    Singh, Kamred Udham
    Chakraborty, Sanjay
    Singh, Teekam
    Swarup, Chetan
    [J]. TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1997 - 2008
  • [9] Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech
    Mengistu, Kinfe Tadesse
    Rudzicz, Frank
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 6657 : 291 - 300
  • [10] Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges
    Lin, Hung-Pang
    Zhang, Yu-Jia
    Chen, Chia-Ping
    [J]. INTERSPEECH 2021, 2021, : 4339 - 4343