TUSK: A framework for overviewing the performance of F0 estimators

被引:2
|
作者
Morise, Masanori [1 ]
Kawahara, Hideki [2 ]
机构
[1] Univ Yamanashi, Interdisciplinary Grad Sch, Kofu, Yamanashi, Japan
[2] Wakayama Univ, Fac Engn, Wakayama, Japan
关键词
Speech analysis; fundamental frequency; temporal variation; noise robustness; FUNDAMENTAL-FREQUENCY ESTIMATION; PITCH EXTRACTION; TANDEM-STRAIGHT; SPEECH;
D O I
10.21437/Interspeech.2016-140
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article presents a framework for overviewing the performance of fundamental frequency (F0) estimators and evaluates its effectiveness. Over the past few decades, many F0 estimators and evaluation indices have been proposed and have been evaluated using various speech databases. In speech analysis/synthesis research, modem estimators are used as the algorithm to fulfill the demand for high-quality speech synthesis, but at the same time, they are competing with one another on minor issues. Specifically, while all of them meet the demands for high-quality speech synthesis, the result depends on the speech database used in the evaluation. Since there are various types of speech, it is inadvisable to discuss the effectiveness of each estimator on the basis of minor differences. It would be better to select the appropriate F0 estimator in accordance with the speech characteristics. The framework we propose, TUSK, does not rank the estimators but rather attempts to overview them. In TUSK, six parameters are introduced to observe the trends in the characteristics in each F0 estimator. The signal is artificially generated so that six parameters can be controllable independently. In this article, we introduce the concept of TUSK and determine its effectiveness using several modem F0 estimators.
引用
收藏
页码:1790 / 1794
页数:5
相关论文
共 50 条
  • [21] Quark-gluonium content of the scalar-isoscalar states f0(980), f0(1300), f0(1500), f0(1750), and f0(1420+150-70) from hadronic decays
    Anisovich, VV
    Nikonov, VA
    Sarantsev, AV
    PHYSICS OF ATOMIC NUCLEI, 2003, 66 (04) : 741 - 754
  • [22] F0 downtrends
    HeidarZadeh, S
    Naylor, P
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 797 - 800
  • [23] 标量介子f0(1370),f0(1500)和f0(1710)的混合与衰变
    陈建兴
    张立梅
    夏环宇
    辽宁师范大学学报(自然科学版), 2008, 31 (04) : 411 - 415
  • [24] An F0 contour control model using an F0 contour codebook
    Kagoshima, Takehiko
    Morita, Masahiro
    Seto, Shigenobu
    Akamine, Masami
    Shiga, Yoshinori
    Systems and Computers in Japan, 2007, 38 (01): : 62 - 72
  • [25] STATISTICAL F0 PREDICTION FOR ELECTROLARYNGEAL SPEECH ENHANCEMENT CONSIDERING GENERATIVE PROCESS OF F0 CONTOURS WITHIN PRODUCT OF EXPERTS FRAMEWORK
    Tanaka, Kou
    Kameoka, Hirokazu
    Toda, Tomoki
    Nakamura, Satoshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5665 - 5669
  • [26] Production of f0(1710), f0(1500), and f0(1370) in J/ψ hadronic decays -: art. no. 094022
    Close, FE
    Zhao, Q
    PHYSICAL REVIEW D, 2005, 71 (09): : 1 - 9
  • [27] Generating F0 contours by statistical manipulation of natural F0 shapes
    Saito, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1100 - 1106
  • [28] Predictions for ηc → ηπ+π- producing f0(500), f0(980) and a0(980)
    Debastiani, V. R.
    Liang, Wei-Hong
    Xie, Ju-Jun
    Oset, E.
    PHYSICS LETTERS B, 2017, 766 : 59 - 64
  • [29] Hadronic and radiative three-body decays of J/ψ involving the scalars f0(1370), f0(1500), and f0(1710)
    Chatzis, Paulos
    Faessler, Amand
    Gutsche, Thomas
    Lyubovitskij, Valery E.
    PHYSICAL REVIEW D, 2011, 84 (03):
  • [30] A study of the f0(1370), f0(1500), f0(2000) and f2(1950) observed in the centrally produced 4π final states
    Barberis, D
    Binon, FG
    Close, FE
    Danielsen, KM
    Donskov, SV
    Earl, BC
    Evans, D
    French, BR
    Hino, T
    Inaba, S
    Jacholkowski, A
    Jacobsen, T
    Khaustov, GV
    Kinson, JB
    Kirk, A
    Kondashov, AA
    Lednev, AA
    Lenti, V
    Minashvili, I
    Peigneux, JP
    Romanovsky, V
    Russakovich, N
    Semenov, A
    Shagin, PM
    Shimizu, H
    Singovsky, AV
    Sobol, A
    Stassinaki, M
    Stroot, JP
    Takamatsu, K
    Tsuru, T
    Baillie, OV
    Votruba, MF
    Yasu, Y
    PHYSICS LETTERS B, 2000, 474 (3-4) : 423 - 426