Data-driven analysis of speech

被引：0

作者：

Hermansky, H ^{[1
]}

机构：

[1] Oregon Grad Inst, Portland, OR 97291 USA

[2] Int Comp Sci Inst, Berkeley, CA 94704 USA

来源：

TEXT, SPEECH AND DIALOGUE | 1999年 / 1692卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We show on results taken from recent studies from our laboratory that conventional speech analysis techniques for ASR (such as Mel cepstrum or PLP) in combination with dynamic features (such as estimates of derivatives of cepstral feature trajectories) are sub-optimal and could be improved. The improvements can be derived by employing large labeled databases which allow for studying how is the linguistic information distributed in time and in frequency as well as for a design of discrimitative spectral basis and temporal RASTA filters.

引用

页码：10 / 18

页数：9

共 50 条

[1] Data-driven techniques in speech synthesis
Dutoit, T
[J]. COMPUTATIONAL LINGUISTICS, 2002, 28 (04) : 570 - 572
[2] A Speech Data-Driven Stakeholder Analysis Methodology Based on the Stakeholder Graph Models
Shirasaki, Yuta
Kobayashi, Yuya
Aoyama, Mikio
[J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 213 - 220
[3] A Data-Driven Affective Analysis Framework Toward Naturally Expressive Speech Synthesis
Bellegarda, Jerome R.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1113 - 1122
[4] Data-Driven Pause Prediction for Speech Synthesis in Storytelling Style Speech
Sarkar, Parakrant
Rao, K. Sreenivasa
[J]. 2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[5] CLOSE-A Data-Driven Approach to Speech Separation
Ming, Ji
Srinivasan, Ramji
Crookes, Danny
Jafari, Ayeh
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1355 - 1368
[6] A Statistical Quality Model for Data-Driven Speech Animation
Ma, Xiaohan
Deng, Zhigang
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (11) : 1915 - 1927
[7] AN EVALUATION OF MONGOLIAN DATA-DRIVEN TEXT-TO-SPEECH
Altangerel, Chagnaa
Purev, Jaimai
Yesyenbyek, Kerey
Hansakunbuntheung, Chatchawarn
[J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[8] Data-driven resolvent analysis
Herrmann, Benjamin
Baddoo, Peter J.
Semaan, Richard
Brunton, Steven L.
McKeon, Beverley J.
[J]. JOURNAL OF FLUID MECHANICS, 2021, 918
[9] An Overview of Data-Driven Part-of-Speech Tagging
Tufis, Dan
[J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2016, 19 (1-2): : 78 - 97
[10] Data-driven part-of-speech tagging of Kiswahili
De Pauw, Guy
de Schryver, Gilles-Maurice
Wagacha, Peter W.
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 197 - 204

← 1 2 3 4 5 →