A comparative study of speech rate estimation techniques

被引:0
|
作者
Dekens, Tomas [1 ]
Demol, Mike [1 ]
Verhelst, Werner [1 ]
Verhoeve, Piet [2 ]
机构
[1] Vrije Univ Brussel, Dept ETRO DSSP, Pl Laan 2, B-1050 Brussels, Belgium
[2] TELEVIC Nv, Corp R&D Dept, B-8870 Izegem, Belgium
关键词
cross lingual comparison; speech rate estimation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we evaluate the performance of 8 different speech rate estimators [1, 2, 3, 4, 5] previously described in the literature by applying them on a multilingual test database [6]. All the estimators show an underestimation at high speech rates and some also suffer from an overestimation at low speech rates. Overall the tested methods obtain high correlation coefficients with the reference speech rate. The Temporal Correlation and Selected Sub-band Correlation method (tcssbc), which uses sub-band and time domain correlation for detecting the number of vowels or diphthongs present in the speech signal, shows little errors and appears to be the most appropriate overall technique for speech rate estimation.
引用
收藏
页码:225 / +
页数:2
相关论文
共 50 条
  • [1] Enhancing Speech Rate Estimation Techniques To Improve Dysarthria Diagnosis
    Carmichael, James Nathaniel
    [J]. 2017 8TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2017, : 309 - 313
  • [2] Comparative study of automatic speech recognition techniques
    Cutajar, Michelle
    Gatt, Edward
    Grech, Ivan
    Casha, Owen
    Micallef, Joseph
    [J]. IET SIGNAL PROCESSING, 2013, 7 (01) : 25 - 46
  • [3] A Comparative Study of Audio/Speech Steganalysis Techniques
    Paulin, Catherine
    Selouani, Sid-Ahmed
    Hervet, Eric
    [J]. 2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
  • [4] Comparative study of automatic speech recognition techniques
    Faculty of Information and Communication Technology, Department of Microelectronics and Nanoelectronics, University of Malta, Tal-Qroqq, Msida
    MSD 2080, Malta
    [J]. IET Signal Proc., 2013, 1 (25-46):
  • [5] A Comparative Study of Frequency Estimation Techniques
    Park, Chul-Won
    Kim, Yoon Sang
    Han, Moon-Seog
    [J]. T& D ASIA: 2009 TRANSMISSION & DISTRIBUTION CONFERENCE & EXPOSITION: ASIA AND PACIFIC, 2009, : 701 - +
  • [6] A comparative study of fractal dimension estimation for speech
    Fekkai, S
    Al-Akaidi, M
    Blackledge, J
    [J]. SIMULATION AND MODELLING: ENABLERS FOR A BETTER QUALITY OF LIFE, 2000, : 676 - 680
  • [7] A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition
    Zhou, Xinhui
    Kwan, Chiman
    Ayhan, Bulent
    Kim, Chanwoo
    Kumar, K.
    Stern, Richard
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 494 - 502
  • [8] A comparative study of glottal source estimation techniques
    Drugman, Thomas
    Bozkurt, Baris
    Dutoit, Thierry
    [J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (01): : 20 - 34
  • [9] Robust speech rate estimation for spontaneous speech
    Wang, Dagen
    Narayanan, Shrikanth S.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2190 - 2201
  • [10] SUBJECTIVE ESTIMATION OF SPEECH RATE
    VAANE, E
    [J]. PHONETICA, 1982, 39 (2-3) : 136 - 149