Structure Learning in Hidden Conditional Random Fields for Grapheme-to-Phoneme Conversion

被引:0
|
作者
Lehnen, Patrick [1 ]
Allauzen, Alexandre [2 ,3 ]
Lavergne, Thomas [2 ,3 ]
Yvon, Francois [2 ,3 ]
Hahn, Stefan [1 ]
Ney, Hermann [1 ,2 ,3 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Human Language Technol & Pattern Recognit, Aachen, Germany
[2] Univ Paris Sud, Orsay, France
[3] CNRS, LIMSI, Spoken Language Proc Grp, Orsay, France
关键词
grapheme-to-phoneme conversion; G2P; HCRF; discriminative models; hidden conditional random fields;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate grapheme-to-phoneme(g2p)conversion is needed for several speech processing applications, such as automatic speech synthesis and recognition. For some languages, notably English, improvements of g2p systems are very slow, due to the intricacy of the associations between letter and sounds. In recent years, several improvements have been obtained either by using variable-length associations in generative models (joint-n-grams), or by recasting the Problem as a conventional sequence labeling task, enabling to integrate rich dependencies in discriminative models. In this paper, we consider several ways to reconciliate these two approaches. Introducing hidden variable-length alignments through latent variables, our Hidden Conditional Random Field (HCRF) models are able to produce comparative performance compared to strong generative and discriminative models on the CELEX database.
引用
收藏
页码:2325 / 2329
页数:5
相关论文
共 50 条
  • [1] Improving LVCSR with Hidden Conditional Random Fields for Grapheme-to-Phoneme Conversion
    Hahn, Stefan
    Lehnen, Patrick
    Wiesler, Simon
    Schlueter, Ralf
    Ney, Hermann
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 495 - 499
  • [2] Grapheme-to-Phoneme Conversion using Conditional Random Fields
    Illina, Irina
    Fohr, Dominique
    Jouvet, Denis
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2324 - 2327
  • [3] Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion
    Masmoudi, Abir
    Ellouze, Mariem
    Bougares, Fethi
    Esetye, Yannick
    Belguith, Lamia
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1457 - 1461
  • [4] Hidden Conditional Random Fields with M-to-N Alignments for Grapheme-to-Phoneme Conversion
    Lehnen, Patrick
    Hahn, Stefan
    Guta, Vlad-Andrei
    Ney, Hermann
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2553 - 2556
  • [5] EM-STYLE OPTIMIZATION OF HIDDEN CONDITIONAL RANDOM FIELDS FOR GRAPHEME-TO-PHONEME CONVERSION
    Heigold, Georg
    Hahn, Stefan
    Lehnen, Patrick
    Ney, Hermann
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4920 - 4923
  • [6] Bangla Grapheme to Phoneme Conversion Using Conditional Random Fields
    Chowdhury, Shammur Absar
    Alam, Firoj
    Khan, Naira
    Noori, Sheak R. H.
    [J]. 2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
  • [7] INCORPORATING ALIGNMENTS INTO CONDITIONAL RANDOM FIELDS FOR GRAPHEME TO PHONEME CONVERSION
    Lehnen, Patrick
    Hahn, Stefan
    Guta, Andreas
    Ney, Hermann
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4916 - 4919
  • [8] Learning from Errors in Grapheme-to-Phoneme Conversion
    Polyakova, Tatyana
    Bonafonte, Antonio
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2442 - 2445
  • [9] Fast Bilingual Grapheme-To-Phoneme Conversion
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 289 - 296
  • [10] A Survey of Grapheme-to-Phoneme Conversion Methods
    Cheng, Shiyang
    Zhu, Pengcheng
    Liu, Jueting
    Wang, Zehua
    [J]. Applied Sciences (Switzerland), 2024, 14 (24):