A new phonetic model for continuous speech recognition systems

被引:0
|
作者
Fagundes, RDR [1 ]
Corrêa, JS [1 ]
Dumouchel, P [1 ]
机构
[1] Pontif Catholic Univ Rio Grande do Sul, Dept Elect Engn, FENG, Porto Alegre, RS, Brazil
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main goal of this work is to describe a new model for a large vocabulary continuous speech recognition system using a phonetic-phonological approach. This work proposes a statistical phonetic structure, applied at the, phonetic-phonological level, to improve the speech recognition performance in systems with phonetic-phonological modeling. It is showed that the general likelihood scores are increased, indicating better recognition performances. This is due to the fact that the statistical phonetic structure will lead to enhance some frequent phonetic combinations from the language itself. Such structure should be considered as an additional knowledge base, containing information about the real language phonetic structure. Also this new phonetic-phonological approach should be strongly recommended to use in spontaneous speech recognition systems.
引用
收藏
页码:572 / 575
页数:4
相关论文
共 50 条
  • [1] CONTINUOUS SPEECH RECOGNITION FROM PHONETIC TRANSCRIPTION
    LEVINSON, SE
    LJOLJE, A
    [J]. SPEECH AND NATURAL LANGUAGE, 1989, : 292 - 292
  • [2] DEVELOPMENT OF AN ACOUSTIC-PHONETIC HIDDEN MARKOV MODEL FOR CONTINUOUS SPEECH RECOGNITION
    LJOLJE, A
    LEVINSON, SE
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (01) : 29 - 39
  • [3] MIDCLASS PHONETIC ANALYSIS FOR A CONTINUOUS SPEECH RECOGNITION SYSTEM
    DALBY, J
    LAVER, J
    HILLER, SM
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 347 - 354
  • [4] Sub-phonetic polynomial segment model for large vocabulary continuous speech recognition
    Yeung, SKA
    Li, CF
    Siu, MH
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 193 - 196
  • [5] Sub-phonetic polynomial segment model for large vocabulary continuous speech recognition
    Yeung, Siu-Kei Au
    Li, Chak-Fai
    Siu, Man-Hung
    [J]. ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (I193-I196):
  • [6] LEVERAGING PHONETIC CONTEXT DEPENDENT INVARIANT STRUCTURE FOR CONTINUOUS SPEECH RECOGNITION
    Zhang, Congying
    Suzuki, Masayuki
    Kurata, Gakuto
    Nishimura, Masafumi
    Minematsu, Nobuaki
    [J]. 2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 52 - 56
  • [7] ACOUSTIC PHONETIC REPRESENTATIONS FOR CONTINUOUS SPEECH RECOGNITION - NETWORKS VERSUS LATTICES
    BRENNAN, RA
    PHILLIPS, MS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S93 - S93
  • [8] Impact of Phonetic Annotation Precision on Automatic Speech Recognition Systems
    Safarik, Radek
    Mateju, Lukas
    [J]. 2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 311 - 314
  • [9] MARKOV MODEL ACOUSTIC PHONETIC COMPONENT FOR AUTOMATIC SPEECH RECOGNITION
    TAPPERT, CC
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1977, 9 (03): : 363 - 373
  • [10] Anchor point detection for continuous speech recognition in Spanish: The spotting of phonetic events
    Leandro, MA
    Pardo, JM
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2336 - 2339