Modelling pronunciation variation using multi-path HMMs for syllables

被引:0
|
作者
Hamalainen, Annika [1 ]
ten Bosch, Louis [1 ]
Boves, Lou [1 ]
机构
[1] Radboud Univ Nijmegen, CLST, Nijmegen, Netherlands
关键词
speech recognition; hidden Markov models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their re-estimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome.
引用
收藏
页码:781 / +
页数:2
相关论文
共 50 条
  • [1] Pronunciation Variant -Based Multi-Path HMMs for Syllables
    Hamalainen, Annika
    ten Bosch, Louis
    Boves, Lou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1579 - 1582
  • [2] Modelling pronunciation variation with single-path and multi-path syllable models: Issues to consider
    Hamalainen, Annika
    ten Bosch, Louis
    Boves, Lou
    SPEECH COMMUNICATION, 2009, 51 (02) : 130 - 150
  • [3] Modelling multi-path problems
    Gibbens, R. J.
    2008 42ND ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, VOLS 1-3, 2008, : 42 - 45
  • [4] Modelling of multi-path batch production lines by FSMs
    Knap, SL
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 134 - 139
  • [5] Multi-path utility maximization and multi-path TCP design
    Vo, Phuong Luu
    Tuan Anh Le
    Lee, Sungwon
    Hong, Choong Seon
    Kim, Byeongsik
    Song, Hoyoung
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (01) : 1848 - 1857
  • [6] Speed-path Analysis for Multi-path Failed Latches with Random Variation
    Ishida, Tsutomu
    Nitta, Izumi
    Homma, Katsumi
    Kanazawa, Yuzi
    Komatsu, Hiroaki
    2012 13TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2012, : 545 - 552
  • [7] Using clock changes in multi-path applications
    Lacaze, B
    Mailhes, C
    Castanié, F
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3838 - 3841
  • [8] Multi-path matroids
    Bonin, Joseph E.
    Gimenez, Omer
    COMBINATORICS PROBABILITY & COMPUTING, 2007, 16 (02): : 193 - 217
  • [9] MULTI-PATH DISTORTION
    LEGGATT, DP
    WIRELESS WORLD, 1981, 87 (1545): : 45 - 45
  • [10] An Online Learning Multi-path Selection Framework for Multi-path Transmission Protocols
    Cai, Kechao
    Lui, John C. S.
    2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,