Improving homograph disambiguation with supervised machine learning

被引:0
|
作者
Gorman, Kyle [1 ]
Mazovetskiy, Gleb [1 ]
Nikolaev, Vitaly [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
关键词
Homograph disambiguation; machine learning; text normalization; text-to-speech synthesis;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe a pre-existing rule-based homograph disambiguation system used for text-to-speech synthesis at Google, and compare it to a novel system which performs disambiguation using classifiers trained on a small amount of labeled data. An evaluation of these systems, using a new, freely available English data set, finds that hybrid systems (making use of both rules and machine learning) are significantly more accurate than either hand-written rules or machine learning alone. The evaluation also finds minimal performance degradation when the hybrid system is configured to run on limited-resource mobile devices rather than on production servers. The two best systems described here are used for homograph disambiguation on all US English text-to-speech traffic at Google.
引用
收藏
页码:1349 / 1352
页数:4
相关论文
共 50 条
  • [31] Improving HPC System Performance by Predicting Job Resources via Supervised Machine Learning
    Tanash, Mohammed
    Dunn, Brandon
    Andresen, Daniel
    Hsu, William
    Yang, Huichen
    Okanlawon, Adedolapo
    [J]. PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [32] Supervised and semi-supervised machine learning ranking
    Vittaut, Jean-Noel
    Gallinari, Patrick
    [J]. COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 213 - 222
  • [33] Improving Supervised Learning with Multiple Clusterings
    Wemmert, Cedric
    Forestier, Germain
    Derivaux, Sebastien
    [J]. APPLICATIONS OF SUPERVISED AND UNSUPERVISED ENSEMBLE METHODS, 2009, 245 : 135 - 149
  • [34] IMPROVING SUPERVISED LEARNING BY SAMPLE DECOMPOSITI
    Rokach, Lior
    Maimon, Oded
    Arad, Omri
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2005, 5 (01) : 37 - 53
  • [35] A Survey on the Explainability of Supervised Machine Learning
    Burkart, Nadia
    Huber, Marco F.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 245 - 317
  • [36] Supervised Machine Learning with Plausible Deniability
    Rass, Stefan
    Koenig, Sandra
    Wachter, Jasmin
    Egger, Manuel
    Hobisch, Manuel
    [J]. COMPUTERS & SECURITY, 2022, 112
  • [37] Supervised Hashing with Extreme Learning Machine
    Yuan, Tongtong
    Deng, Weihong
    Hu, Jiani
    [J]. 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [38] Featuring the Attributes in Supervised Machine Learning
    Tallon-Ballesteros, Antonio J.
    Correia, Luis
    Xue, Bing
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 350 - 362
  • [39] Exploring the power of supervised learning methods for company name disambiguation in microblog posts
    Polat, Esma Nafiye
    Cakmak, Ali
    Turan, Rabia Nuray
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (05) : 2400 - 2415
  • [40] Supervised Machine Learning: A Brief Primer
    Jiang, Tammy
    Gradus, Jaimie L.
    Rosellini, Anthony J.
    [J]. BEHAVIOR THERAPY, 2020, 51 (05) : 675 - 687