Reaching over the gap: A review of efforts to link human and automatic speech recognition research

被引:63
|
作者
Scharenborg, Odette [1 ]
机构
[1] Univ Sheffield, Speech & Hearing Res Grp, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
关键词
automatic speech recognition; human speech recognition;
D O I
10.1016/j.specom.2007.01.009
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The fields of human speech recognition (HSR) and automatic speech recognition (ASR) both investigate parts of the speech recognition process and have word recognition as their central issue. Although the research fields appear closely related, their aims and research methods are quite different. Despite these differences there is, however, lately a growing interest in possible cross-fertilisation. Researchers from both ASR and HSR are realising the potential benefit of looking at the research field on the other side of the 'gap'. In this paper, we provide an overview of past and present efforts to link human and automatic speech recognition research and present an overview of the literature describing the performance difference between machines and human listeners. The focus of the paper is on the mutual benefits to be derived from establishing closer collaborations and knowledge interchange between ASR and HSR. The paper ends with an argument for more and closer collaborations between researchers of ASR and HSR to further improve research in both fields. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:336 / 347
页数:12
相关论文
共 50 条
  • [1] Bridging the gap between human and automatic speech recognition
    ten Bosch, Louis
    Kirchhoff, Katrin
    [J]. SPEECH COMMUNICATION, 2007, 49 (05) : 331 - 335
  • [2] Automatic speech recognition: A review
    Haton, JP
    [J]. ENTERPRISE INFORMATION SYSTEMS V, 2004, : 6 - 11
  • [3] Automatic speech recognition and speech variability: A review
    Benzeghiba, M.
    De Mori, R.
    Deroo, O.
    Dupont, S.
    Erbes, T.
    Jouvet, D.
    Fissore, L.
    Laface, P.
    Mertins, A.
    Ris, C.
    Rose, R.
    Tyagi, V.
    Wellekens, C.
    [J]. SPEECH COMMUNICATION, 2007, 49 (10-11) : 763 - 786
  • [4] The use of automatic speech recognition in cars: A human factors review
    Graham, R
    [J]. CONTEMPORARY ERGONOMICS 1998, 1998, : 441 - 445
  • [5] Unsupervised Automatic Speech Recognition: A review
    Aldarmaki, Hanan
    Ullah, Asad
    Ram, Sreepratha
    Zaki, Nazar
    [J]. SPEECH COMMUNICATION, 2022, 139 : 76 - 91
  • [6] Trends and developments in automatic speech recognition research
    O'Shaughnessy, Douglas
    [J]. COMPUTER SPEECH AND LANGUAGE, 2023, 83
  • [7] Bangladeshi Bangla speech corpus for automatic speech recognition research
    Kibria, Shafkat
    Samin, Ahnaf Mozib
    Kobir, M. Humayon
    Rahman, M. Shahidur
    Selim, M. Reza
    Iqbal, M. Zafar
    [J]. SPEECH COMMUNICATION, 2022, 136 : 84 - 97
  • [8] Chhattisgarhi speech corpus for research and development in automatic speech recognition
    Londhe, Narendra D.
    Kshirsagar, Ghanahshyam B.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 193 - 210
  • [9] Automatic Speech Recognition: Systematic Literature Review
    Alharbi, Sadeen
    Alrazgan, Muna
    Alrashed, Alanoud
    Alnomasi, Turkiayh
    Almojel, Raghad
    Alharbi, Rimah
    Alharbi, Saja
    Alturki, Sahar
    Alshehri, Fatimah
    Almojil, Maha
    [J]. IEEE ACCESS, 2021, 9 : 131858 - 131876
  • [10] Automatic Bimodal Audiovisual Speech Recognition: A Review
    Kandagal, Amaresh P.
    Udayashankara, V.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 940 - 945