Modelling pronunciation variation using multi-path HMMs for syllables

被引:0
|
作者
Hamalainen, Annika [1 ]
ten Bosch, Louis [1 ]
Boves, Lou [1 ]
机构
[1] Radboud Univ Nijmegen, CLST, Nijmegen, Netherlands
关键词
speech recognition; hidden Markov models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their re-estimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome.
引用
收藏
页码:781 / +
页数:2
相关论文
共 50 条
  • [31] Reliable video communication with multi-path streaming using MDC
    Lee, I
    Guan, L
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 711 - 714
  • [32] The research to implement resources reservation by using multi-path routing
    He, Minwei
    2005 International Symposium on Computer Science and Technology, Proceedings, 2005, : 466 - 470
  • [33] Multi-path SAR Change Detection
    Hu, Z.
    Bryant, Michael
    Qiu, R. C.
    2012 IEEE RADAR CONFERENCE (RADAR), 2012,
  • [34] Implementation of Multi-Path Energy Routing
    Mishra, Deepak
    Kaushik, K.
    De, Swades
    Basagni, Stefano
    Chowdhury, Kaushik
    Jana, Soumya
    Heinzelman, Wendi
    2014 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATION (PIMRC), 2014, : 1834 - 1839
  • [36] Multi-Path Routing in the Jellyfish Network
    ALzaid, Zaid
    Bhowmik, Saptarshi
    Yuan, Xin
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 832 - 841
  • [37] TMPTCP: Tailless Multi-path TCP
    Shamani, Mohammad Javad
    Zhu, Weiping
    Naghshin, Vahid
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 325 - 332
  • [38] Reliability of Multi-path Virus Nanonetworks
    Walsh, Frank
    Balasubramaniam, Sasitharan
    2013 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (IEEE ICC), 2013, : 824 - 828
  • [39] Automatic modelling of regional pronunciation variation for Russian
    Shalonova, KB
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 329 - 332
  • [40] A Dynamic Multi-path Routing for VANET
    Yuan, Z. Y.
    Wei, D.
    Zhu, J. Q.
    Hou, Y. J.
    Li, M.
    Sun, T.
    INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL AND ELECTRICAL ENGINEERING (AMEE 2015), 2015, : 766 - 773