Iterative Grapheme-to-Phoneme Alignment for the Training of WFST-based Phonetic Conversion

被引:0
|
作者
Bohac, Marek [1 ]
Malek, Jiri [1 ]
Blavka, Karel [1 ]
机构
[1] Tech Univ Liberec, Fac Mechatron, Inst Informat Technol & Elect, Liberec 46117, Czech Republic
关键词
Alignment; conversion; Grapheme-to-phoneme; Phonetisaurus; WFST;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we propose an algorithm for grapheme-to-phoneme (G2P) alignment. Such alignment is needed mainly for the data-driven training of G2P conversion tools. Our approach utilizes a given phonetic alphabet and a set of given orthographic-phonetic word pairs as a source of prior knowledge. The development data are taken from a manually created pronunciation lexicon for a large vocabulary speech recognition system for Czech. The alignment method is based on extended Minimum Edit Distance algorithm. Moreover, we propose an approach to avoid the creation of reference alignments - we evaluate the improvements through a specially designed G2P converter, i.e. we compare the phonetic transcription directly to a set of test orthographic-phonetic word pairs. Results of our approach are comparable or even slightly better than the state-of-the-art.
引用
收藏
页码:474 / 478
页数:5
相关论文
共 50 条
  • [1] Online Discriminative Training for Grapheme-to-Phoneme Conversion
    Jiampojamarn, Sittichai
    Kondrak, Grzegorz
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1307 - 1310
  • [2] Transformer based Grapheme-to-Phoneme Conversion
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERSPEECH 2019, 2019, : 2095 - 2099
  • [3] JOINT ALIGNMENT LEARNING-ATTENTION BASED MODEL FOR GRAPHEME-TO-PHONEME CONVERSION
    Wang, Yonghe
    Bao, Feilong
    Zhang, Hui
    Gao, Guanglai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7788 - 7792
  • [4] Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework
    Novak, Josef Robert
    Minematsu, Nobuaki
    Hirose, Keikichi
    [J]. NATURAL LANGUAGE ENGINEERING, 2016, 22 (06) : 907 - 938
  • [5] Fast Bilingual Grapheme-To-Phoneme Conversion
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 289 - 296
  • [6] A Rule-Based Grapheme-to-Phoneme Conversion System
    Klosowski, Piotr
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [7] Example-Based Grapheme-to-Phoneme Conversion for Thai
    Charoenpornsawat, Paisarn
    Schultz, Tanja
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1268 - 1271
  • [8] Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
    Prabhu, Nikhil
    Kann, Katharina
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 123 - 127
  • [9] Grapheme-to-Phoneme Conversion with Convolutional Neural Networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [10] Grapheme-to-phoneme conversion in Chinese TTS system
    Dong, HH
    Tao, JH
    Xu, B
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 165 - 168