PART-OF-SPEECH MODELS COMPRESSION METHODS FOR ON-DEVICE GRAPHEME-TO-PHONEME CONVERSION

被引:0
|
作者
Kubis, Marek [1 ]
Meloux, Maxime [2 ]
Skorzewski, Pawel [1 ]
Lewandowski, Marcin [2 ]
Jho, Gunu [3 ]
Park, Hyoungmin [3 ]
机构
[1] Adam Mickiewicz Univ, Poznan, Poland
[2] Samsung R&D Inst Poland, Warsaw, Poland
[3] Samsung Elect, Mobile Commun Business, Suwon, South Korea
关键词
part-of-speech tagging; model compression; grapheme-to-phoneme conversion;
D O I
10.1109/ICASSP43922.2022.9746710
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper investigates methods of compressing part-of-speech models that are developed for an on-device graphemeto-phoneme conversion module. The performance of partof-speech models is analyzed under different compression regimes. The evaluation is done with respect to French, German and Italian datasets that consist of TTS input prompts. The study shows that a proper selection of a compression method reduces the model size significantly without deteriorating the grapheme-to-phoneme conversion performance.
引用
收藏
页码:7117 / 7121
页数:5
相关论文
共 50 条
  • [41] Example-Based Grapheme-to-Phoneme Conversion for Thai
    Charoenpornsawat, Paisarn
    Schultz, Tanja
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1268 - 1271
  • [42] Multilingual grapheme-to-phoneme conversion with global character vectors
    Ni, Jinfu
    Shiga, Yoshinori
    Kawai, Hisashi
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2823 - 2827
  • [43] EVALUATING GRAPHEME-TO-PHONEME CONVERTERS IN AUTOMATIC SPEECH RECOGNITION CONTEXT
    Jouvet, Denis
    Fohr, Dominique
    Illina, Irina
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4821 - 4824
  • [44] Grapheme-to-phoneme Conversion based on Adaptive Regularization of Weight Vectors
    Kubo, Keigo
    Sakti, Sakriani
    Neubig, Graham
    Toda, Tomoki
    Nakamura, Satoshi
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1945 - 1949
  • [45] Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion
    Sun, Hao
    Tan, Xu
    Gan, Jun-Wei
    Liu, Hongzhi
    Zhao, Sheng
    Qin, Tao
    Liu, Tie-Yan
    [J]. INTERSPEECH 2019, 2019, : 2115 - 2119
  • [46] Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion
    Masmoudi, Abir
    Ellouze, Mariem
    Bougares, Fethi
    Esetye, Yannick
    Belguith, Lamia
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1457 - 1461
  • [47] The generation of letter-to-sound rules for grapheme-to-phoneme conversion
    Przybysz, Pawel
    Kasprzak, Wlodzimierz
    [J]. 2013 6TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI), 2013, : 292 - 297
  • [48] An evaluation of non-standard features for grapheme-to-phoneme conversion
    Webster, Gabriel
    Braunschweiler, Norbert
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1845 - 1848
  • [49] The SIGMORPHON 2020 Shared Task on Multilingual Grapheme-to-Phoneme Conversion
    Gorman, Kyle
    Ashby, Lucas F. E.
    Goyzueta, Aaron
    McCarthy, Arya D.
    Wu, Shijie
    You, Daniel
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 40 - 50
  • [50] ERROR DETECTION OF GRAPHEME-TO-PHONEME CONVERSION IN TEXT-TO-SPEECH SYNTHESIS USING SPEECH SIGNAL AND LEXICAL CONTEXT
    Vythelingum, Kevin
    Esteve, Yannick
    Rosec, Olivier
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 692 - 697