Data-driven analysis of speech

被引:0
|
作者
Hermansky, H [1 ]
机构
[1] Oregon Grad Inst, Portland, OR 97291 USA
[2] Int Comp Sci Inst, Berkeley, CA 94704 USA
来源
TEXT, SPEECH AND DIALOGUE | 1999年 / 1692卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show on results taken from recent studies from our laboratory that conventional speech analysis techniques for ASR (such as Mel cepstrum or PLP) in combination with dynamic features (such as estimates of derivatives of cepstral feature trajectories) are sub-optimal and could be improved. The improvements can be derived by employing large labeled databases which allow for studying how is the linguistic information distributed in time and in frequency as well as for a design of discrimitative spectral basis and temporal RASTA filters.
引用
收藏
页码:10 / 18
页数:9
相关论文
共 50 条
  • [1] Data-driven techniques in speech synthesis
    Dutoit, T
    [J]. COMPUTATIONAL LINGUISTICS, 2002, 28 (04) : 570 - 572
  • [2] A Speech Data-Driven Stakeholder Analysis Methodology Based on the Stakeholder Graph Models
    Shirasaki, Yuta
    Kobayashi, Yuya
    Aoyama, Mikio
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 213 - 220
  • [3] A Data-Driven Affective Analysis Framework Toward Naturally Expressive Speech Synthesis
    Bellegarda, Jerome R.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1113 - 1122
  • [4] Data-Driven Pause Prediction for Speech Synthesis in Storytelling Style Speech
    Sarkar, Parakrant
    Rao, K. Sreenivasa
    [J]. 2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [5] CLOSE-A Data-Driven Approach to Speech Separation
    Ming, Ji
    Srinivasan, Ramji
    Crookes, Danny
    Jafari, Ayeh
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1355 - 1368
  • [6] A Statistical Quality Model for Data-Driven Speech Animation
    Ma, Xiaohan
    Deng, Zhigang
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (11) : 1915 - 1927
  • [7] AN EVALUATION OF MONGOLIAN DATA-DRIVEN TEXT-TO-SPEECH
    Altangerel, Chagnaa
    Purev, Jaimai
    Yesyenbyek, Kerey
    Hansakunbuntheung, Chatchawarn
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [8] Data-driven resolvent analysis
    Herrmann, Benjamin
    Baddoo, Peter J.
    Semaan, Richard
    Brunton, Steven L.
    McKeon, Beverley J.
    [J]. JOURNAL OF FLUID MECHANICS, 2021, 918
  • [9] An Overview of Data-Driven Part-of-Speech Tagging
    Tufis, Dan
    [J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2016, 19 (1-2): : 78 - 97
  • [10] Data-driven part-of-speech tagging of Kiswahili
    De Pauw, Guy
    de Schryver, Gilles-Maurice
    Wagacha, Peter W.
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 197 - 204