Improving analysis techniques for automatic speech recognition

被引:0
|
作者
O'Shaughnessy, D [1 ]
机构
[1] INRS Telecommun, Montreal, PQ H5A 1K6, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic recognition of continuous speech for small vocabularies (e.g., telephone or credit card digit sequences) is possible with excellent accuracy, even in applications using telephone lines and serving a large population of users. However, even such simple recognition tasks suffer decreased performance in adverse conditions, e.g., significant background noise or fading on portable telephone channels. If we further impose significant limitations on the computing resources for the recognition task, then robust efficient speech recognition is still a significant challenge, even for simple vocabularies.. Since many practical recognition tasks take place over telephone lines, and in conditions that are less than optimal, speech recognition must be robust. The traditional hidden Markov model approach to speech recognition, using cepstral analysis, is computationally intensive and often does not work well under adverse acoustic conditions. We examine a simpler spectral analysis method, and. suggest a segmental approach. High recognition accuracy can be maintained, despite a large decrease in both memory and computation, compared to traditional approaches.
引用
收藏
页码:65 / 68
页数:4
相关论文
共 50 条
  • [1] Improving Speech Synthesis by Automatic Speech Recognition and Speech Discriminator
    Huang, Li-Yu
    Chen, Chia-Ping
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (01) : 189 - 200
  • [2] A STUDY ON BIAS-BASED SPEECH SIGNAL CONDITIONING TECHNIQUES FOR IMPROVING THE ROBUSTNESS OF AUTOMATIC SPEECH RECOGNITION
    Chowdhury, Md Foezur Rahman
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    [J]. 2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 366 - +
  • [3] Improving Automatic Recognition of Aphasic Speech with AphasiaBank
    Le, Duc
    Provost, Emily Mower
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2681 - 2685
  • [4] Improving Readability for Automatic Speech Recognition Transcription
    Liao, Junwei
    Eskimez, Sefik
    Lu, Liyang
    Shi, Yu
    Gong, Ming
    Shou, Linjun
    Qu, Hong
    Zeng, Michael
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (05)
  • [5] Improving the Quality of Automatic Speech Recognition in Trucks
    Korenevsky, Maxim
    Medennikov, Ivan
    Shchemelinin, Vadim
    [J]. Speech and Computer, 2016, 9811 : 362 - 369
  • [6] Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques
    Qin, Ying
    Lee, Tan
    Kong, Anthony Pak Hin
    Law, Sam Po
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [7] THE DEREVERBERATION OF SPEECH WITH A VIEW TO IMPROVING THE AUTOMATIC RECOGNITION OF SPEECH IN ROOMS
    HIRSCH, HG
    [J]. ACUSTICA, 1989, 67 (03): : 216 - 221
  • [8] An analysis on LPC, RASTA and MFCC techniques in Automatic Speech Recognition System
    Gupta, Kartiki
    Gupta, Divya
    [J]. 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 493 - 497
  • [9] Comparative study of automatic speech recognition techniques
    Cutajar, Michelle
    Gatt, Edward
    Grech, Ivan
    Casha, Owen
    Micallef, Joseph
    [J]. IET SIGNAL PROCESSING, 2013, 7 (01) : 25 - 46
  • [10] 2 USEFUL TECHNIQUES FOR AUTOMATIC SPEECH RECOGNITION
    COKER, CH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S179 - S179