Efficient automatic speech recognition

被引:0
|
作者
O'Shaughnessy, D [1 ]
机构
[1] INRS, EMT, Montreal, PQ H5A 1K6, Canada
关键词
automatic speech recognition; analysis; features; spectrum;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic recognition of certain speech (e.g., telephone numbers) is very feasible today with very good accuracy, even when using telephone lines and serving a large population. However, even such simple recognition tasks suffer decreased performance in adverse conditions, e.g., significant background noise or fading on portable telephone channels. If we further impose significant limitations on the computing resources for the recognition task, then robust and efficient speech recognition is still a significant challenge, even for small vocabularies (e.g., digits). The amount of computer resources needed for good recognition accuracy was investigated. Rather than use a traditional hidden Markov model approach with cepstral analysis, which can be computationally intensive and often does not work well under adverse acoustic conditions, a simpler spectral analysis was used, combined with a segmental approach. High recognition accuracy can be maintained despite a large decrease in both memory and computation.
引用
收藏
页码:323 / 327
页数:5
相关论文
共 50 条
  • [1] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
    Kim, Sehoon
    Gholami, Amir
    Shaw, Albert
    Lee, Nicholas
    Mangalam, Karttikeya
    Malik, Jitendra
    Mahoney, Michael W.
    Keutzer, Kurt
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] AN EFFICIENT MULTISTAGE ROVER METHOD FOR AUTOMATIC SPEECH RECOGNITION
    Xu, Haihua
    Zhu, Jie
    Wu, Guanyong
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 894 - 897
  • [3] An efficient front-end for automatic speech recognition
    Ahadi, SM
    Sheikhzadeh, H
    Brennan, RL
    Freeman, GH
    [J]. ICECS 2003: PROCEEDINGS OF THE 2003 10TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2003, : 128 - 131
  • [4] The efficient incorporation of MLP features into automatic speech recognition systems
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2011, 25 (03): : 519 - 534
  • [5] Automatic speech recognition
    O'Shaughnessy, Douglas
    [J]. 2015 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON), 2015, : 417 - 424
  • [6] AUTOMATIC SPEECH RECOGNITION
    IVALL, T
    [J]. ELECTRONICS & WIRELESS WORLD, 1984, 90 (1581): : 73 - 76
  • [7] AUTOMATIC RECOGNITION OF SPEECH
    MARILL, T
    [J]. IRE TRANSACTIONS ON HUMAN FACTORS IN ELECTRONICS, 1961, HFE2 (01): : 34 - +
  • [8] AUTOMATIC SPEECH RECOGNITION
    RAO, PVS
    PALIWAL, KK
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1986, 9 : 85 - 120
  • [9] EFFICIENT ADAPTER TRANSFER OF SELF-SUPERVISED SPEECH MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Thomas, Bethan
    Kessler, Samuel
    Karout, Salah
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7102 - 7106
  • [10] CrossASR: Efficient Differential Testing of Automatic Speech Recognition via Text-To-Speech
    Asyrofi, Muhammad Hilmi
    Thung, Ferdian
    Lo, David
    Jiang, Lingxiao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2020), 2020, : 640 - 650