Proper Name Pronunciations for Speech Technology Applications

被引:5
|
作者
Murray F. Spiegel
机构
[1] Telcordia Technologies,Speech Technology Applications Research
关键词
name pronunciations; proper names; recognition of names; automated directory assistance;
D O I
10.1023/A:1025721319650
中图分类号
学科分类号
摘要
This paper describes a 15-year research effort to improve the automatic pronunciation of proper names and details the issues involved in applying those pronunciations to speech synthesis and speech recognition. Our approach consists primarily of a large hand-tuned rule component, supplemented by a comparatively small pronunciation dictionary, both guided by extensive survey and polling data. Compared to other state-of-the-art programs, we use language-class identification to smaller degree. We utilize alternate pronunciations, obtained from the polling data, for both synthesis and recognition purposes. While our approach yields comparatively high accuracies, a comprehensive database of names and their pronunciations verified and authenticated through customer interactions (such as auto-attendants and automated directory assistance) will likely be the best future resource defining the ultimate in accuracy.
引用
收藏
页码:419 / 427
页数:8
相关论文
共 50 条
  • [1] Proper name pronunciations for speech technology applications
    Spiegel, MF
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 175 - 178
  • [2] Dynamic generation of proper name pronunciations for directory assistance
    Béchet, F
    de Mori, R
    Subsol, G
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 745 - 748
  • [3] The proper name in speech
    Ehrmann, Maud
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2009, 50 (03): : 221 - 224
  • [4] The proper name in speech
    Mira Rueda, Concepcion
    [J]. JOURNAL OF FRENCH LANGUAGE STUDIES, 2012, 22 (02) : 309 - 310
  • [5] Learning name pronunciations in automatic speech recognition systems
    Beaufays, F
    Sankar, A
    Williams, S
    Weintraub, M
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 233 - 240
  • [6] NAME PRONUNCIATIONS
    CROOKS, HM
    [J]. CHEMICAL & ENGINEERING NEWS, 1967, 45 (21) : 8 - &
  • [7] Name Pronunciations
    Heyl, Lawrence
    [J]. LIBRARY JOURNAL, 1939, 64 (03) : 84 - 84
  • [8] An advanced system to generate pronunciations of proper nouns
    Deshmukh, N
    Ngan, J
    Hamaker, J
    Picone, J
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1467 - 1470
  • [9] Learning Personalized Pronunciations for Contact Name Recognition
    Bruguier, Antoine
    Peng, Fuchun
    Beaufays, Francoise
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3096 - 3100
  • [10] Neural Networks for Proper Name Retrieval in the Framework of Automatic Speech Recognition
    Fohr, Dominique
    Illina, Irina
    [J]. 2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND ECONOMIC INTELLIGENCE (SIIE), 2015, : 25 - 30