Innovative approaches for large vocabulary name recognition

被引:0
|
作者
Gao, Y [1 ]
Ramabhadran, B [1 ]
Chen, J [1 ]
Erdogan, H [1 ]
Picheny, M [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic name dialing is a practical and interesting application of speech recognition on telephony systems. The IBM name recognition system is a large vocabulary, speaker independent system currently in use for reaching IBM employees in the United States. In this paper, we present some innovative algorithms that improve name recognition accuracy. Unlike transcription tasks, such as the Switchboard task, recognition of names poses a variety of different problems. Several of these problems arise from the fact that foreign names are hard to pronounce for speakers who are not familiar with the names and that there are no standardized methods for pronouncing proper names. Noise robustness is another very important factor as these calls are typically made in noisy environments, such as from a car, cafeteria, airport, etc. and over different kinds of cellular and land-line telephone channels. We have performed a systematic analysis of the speech recognition errors and tackled the issues separately with techniques ranging from weighted speaker clustering, massive adaptation, rapid and unsupervised adaptation methods to pronunciation modeling methods. We find that the decoding accuracy can be improved significantly (28% relative) in this manner.
引用
收藏
页码:53 / 56
页数:4
相关论文
共 50 条
  • [1] Persian large vocabulary name recognition system (FarsName)
    Hajitabar, Alireza
    Sameti, Hossein
    Hadian, Hossein
    Safari, Arash
    [J]. 2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1580 - 1583
  • [2] Very large vocabulary proper name recognition for directory assistance
    Béchet, F
    de Mori, R
    Subsol, G
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 222 - 225
  • [3] CONNECTIONIST APPROACHES TO LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    SAWAI, H
    MINAMI, Y
    MIYATAKE, M
    WAIBEL, A
    SHIKANO, K
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1834 - 1844
  • [4] Discriminative training for large vocabulary telephone-based name recognition
    McDermott, E
    Biem, A
    Tenpaku, S
    Katagiri, S
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3739 - 3742
  • [5] Chinese large-vocabulary name recognition system using character description and syllable spelling recognition
    Wang, NJC
    Tsai, CH
    Huang, P
    Shen, JL
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 17 - 20
  • [6] Fast decoding in large vocabulary name dialing
    Suontausta, J
    Häkkinen, J
    Viikki, O
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1535 - 1538
  • [7] Large-vocabulary recognition
    Dugast, C
    [J]. PHILIPS JOURNAL OF RESEARCH, 1995, 49 (04) : 353 - 366
  • [8] Drug Name Recognition: Approaches and Resources
    Liu, Shengyu
    Tang, Buzhou
    Chen, Qingcai
    Wang, Xiaolong
    [J]. INFORMATION, 2015, 6 (04) : 790 - 810
  • [9] Large vocabulary speech recognition in French
    Adda-Decker, M
    Adda, G
    Gauvain, JL
    Lamel, L
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 45 - 48
  • [10] Advances in Large Vocabulary Speech Recognition
    Gauvain, JL
    De Mori, R
    Lamel, L
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 1 - 3