Improving phoneme recognition of telephone quality speech

被引:0
|
作者
Huang, Q [1 ]
Cox, S [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There are some speech understanding applications in which training transcriptions are unavailable, and hence the vocabulary is unknown, but the task is to recognise key words and phrases within an utterance rather than to attempt a complete, accurate transcription. An example of such a task is call-routing, when transcriptions of training utterances (which are very expensive to produce) are unavailable. In such cases, phoneme rather than word recognition is appropriate. However, phoneme recognition of spontaneous speech spoken by a large multi-accent population over telephone connections is very inaccurate. To improve accuracy, we describe a technique in which we segment the waveform into subword-like units and use clustering and iteratively refined language model to correct the errors in the recognised phonemes. The results show a (46.76-28.06) reduction in phoneme error-rate.
引用
收藏
页码:445 / 448
页数:4
相关论文
共 50 条
  • [41] Phoneme-grapheme based speech recognition system
    Magimai-Doss, M
    Stephenson, TA
    Bourlard, H
    Bengio, S
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 94 - 98
  • [42] Speech Emotion Recognition Using Spectrogram & Phoneme Embedding
    Yenigalla, Promod
    Kumar, Abhay
    Tripathi, Suraj
    Singh, Chirag
    Kar, Sibsambhu
    Vepa, Jithendra
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3688 - 3692
  • [43] Modulation frequency features for phoneme recognition in noisy speech
    Ganapathy, Sriram
    Thomas, Samuel
    Hermansky, Hynek
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (01): : EL8 - EL12
  • [44] MLP BASED PHONEME DETECTORS FOR AUTOMATIC SPEECH RECOGNITION
    Thomas, Samuel
    Patrick Nguyen
    Zweig, Geoffrey
    Hermansky, Hynek
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5024 - 5027
  • [45] Automatic Phoneme Border Detection to Improve Speech Recognition
    Sergio, Suarez-Guerra
    Cristian-Remington, Juarez-Murillo
    Jose Luis, Oropeza-Rodriguez
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 127 - 135
  • [46] Confusion analysis in phoneme based speech recognition in Hindi
    Shobha Bhatt
    Amita Dev
    Anurag Jain
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4213 - 4238
  • [47] Robust speech detection based on phoneme recognition features
    Mihelic, France
    Zibert, Janez
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 455 - 462
  • [48] Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
    Zhang, Guangyan
    Song, Kaitao
    Tan, Xu
    Tan, Daxin
    Yan, Yuzi
    Liu, Yanqing
    Wang, Gang
    Zhou, Wei
    Qin, Tao
    Lee, Tan
    Zhao, Sheng
    [J]. INTERSPEECH 2022, 2022, : 456 - 460
  • [49] Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
    Zhang, Guangyan
    Song, Kaitao
    Tan, Xu
    Tan, Daxin
    Yan, Yuzi
    Liu, Yanqing
    Wang, Gang
    Zhou, Wei
    Qin, Tao
    Lee, Tan
    Zhao, Sheng
    [J]. arXiv, 2022,
  • [50] Speech Enhancement Using Source Information for Phoneme Recognition of Speech with Background Music
    Khonglah, Banriskhem K.
    Dey, Abhishek
    Prasanna, S. R. Mahadeva
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (02) : 643 - 663