Improving phoneme recognition of telephone quality speech

被引:0
|
作者
Huang, Q [1 ]
Cox, S [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There are some speech understanding applications in which training transcriptions are unavailable, and hence the vocabulary is unknown, but the task is to recognise key words and phrases within an utterance rather than to attempt a complete, accurate transcription. An example of such a task is call-routing, when transcriptions of training utterances (which are very expensive to produce) are unavailable. In such cases, phoneme rather than word recognition is appropriate. However, phoneme recognition of spontaneous speech spoken by a large multi-accent population over telephone connections is very inaccurate. To improve accuracy, we describe a technique in which we segment the waveform into subword-like units and use clustering and iteratively refined language model to correct the errors in the recognised phonemes. The results show a (46.76-28.06) reduction in phoneme error-rate.
引用
收藏
页码:445 / 448
页数:4
相关论文
共 50 条
  • [1] Improving English Conversational Telephone Speech Recognition
    Medennikov, Ivan
    Prudnikov, Alexey
    Zatvornitskiy, Alexander
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2 - 6
  • [2] Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech
    Thomas, Samuel
    Ganapathy, Sriram
    Hermansky, Hynek
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1521 - +
  • [3] PHONEME GROUPING FOR SPEECH RECOGNITION
    REDDY, DR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (05): : 1295 - &
  • [4] Contribution from the accuracy of phoneme recognition to the quality of automatic recognition of Russian speech
    Karpukhin I.A.
    [J]. Moscow University Computational Mathematics and Cybernetics, 2016, 40 (2) : 89 - 95
  • [5] PERFORMANCE OF HARPY SPEECH RECOGNITION SYSTEM FOR TELEPHONE QUALITY SPEECH INPUT
    YEGNANARAYANA, B
    REDDY, DR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 : S78 - S78
  • [6] Improving phoneme recognition of throat microphone speech recordings using transfer learning
    Turan, M. A. Tugtekin
    Erzin, Engin
    [J]. SPEECH COMMUNICATION, 2021, 129 : 25 - 32
  • [7] REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS
    Parada, Pablo Peso
    Sharma, Dushyant
    Naylor, Patrick A.
    van Waterschoot, Toon
    [J]. 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 567 - 571
  • [8] The Gamma MLP for speech phoneme recognition
    Lawrence, S
    Tsoi, AC
    Back, AD
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 785 - 791
  • [9] Improving the Quality of Automatic Speech Recognition in Trucks
    Korenevsky, Maxim
    Medennikov, Ivan
    Shchemelinin, Vadim
    [J]. Speech and Computer, 2016, 9811 : 362 - 369
  • [10] Conversational telephone speech recognition
    Gauvain, JL
    Lamel, L
    Schwenk, H
    Adda, G
    Chen, L
    Lefèvre, F
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 212 - 215