Context-dependent phoneme duration modeling with tree-based state tying

被引:1
|
作者
Park, SJ [1 ]
Koo, MW
Jhon, CS
机构
[1] Serv Dev Lab KT, Seoul, South Korea
[2] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul, South Korea
来源
关键词
duration model; gamma distribution; tree-based state tying;
D O I
10.1093/ietisy/e88-d.3.662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter presents two methods of modeling phoneme durations. One is the context-independent phoneme duration modeling in which duration parameters are stored in each phoneme. The other is the context-dependent duration modeling in which duration parameters are stored in each state shared by context-dependent phonemes. The phoneme duration model is compared with a without-duration model and a state duration model. Experiments are performed on a database collected over the telephone network. Experimental results show that duration information rejects out-of-task (OOT) words, well and that the context-dependent duration model yields the best performance among the tested models.
引用
收藏
页码:662 / 666
页数:5
相关论文
共 50 条
  • [1] Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words
    Bianne, Anne-Laure
    Kermorvant, Christopher
    Likforman-Sulem, Laurence
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [2] Context-dependent duration modeling
    Willett, D
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 421 - 424
  • [3] Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices
    Yeh, Ching-Feng
    Lee, Hung-Yi
    Lee, Lin-Shan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1740 - 1744
  • [4] A study of phoneme and grapheme based context-dependent ASR systems
    Dines, John
    Doss, Mathew Magimai
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2008, 4892 : 215 - 226
  • [5] Tree-Based HMM State Tying for Arabic Continuous Speech Recognition
    Azim, Mona A.
    Hamid, A. Aziz A.
    Badr, Nagwa L.
    Tolba, M. F.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 96 - 103
  • [6] WORD SPOTTING USING CONTEXT-DEPENDENT PHONEME-BASED HMMS
    MATSUOKA, T
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1768 - 1772
  • [7] CONTEXT-DEPENDENT TREE AUTOMATA
    PYSTER, A
    INFORMATION AND CONTROL, 1978, 38 (01): : 81 - 102
  • [8] Performance of connected digit recognizers with context-dependent word duration modeling
    Kwon, OW
    Un, CK
    APCCAS '96 - IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS '96, 1996, : 243 - 246
  • [9] CONTEXT-DEPENDENT WORD DURATION MODELING FOR KOREAN CONNECTED DIGIT RECOGNITION
    KWON, OW
    UN, CK
    ELECTRONICS LETTERS, 1995, 31 (19) : 1630 - 1631
  • [10] Towards using context-dependent symbols in CTC without state-tying decision trees
    Chorowski, Jan
    Lancucki, Adrian
    Kostka, Bartosz
    Zapotoczny, Michal
    INTERSPEECH 2019, 2019, : 4385 - 4389