TUSK: A framework for overviewing the performance of F0 estimators

被引:2
|
作者
Morise, Masanori [1 ]
Kawahara, Hideki [2 ]
机构
[1] Univ Yamanashi, Interdisciplinary Grad Sch, Kofu, Yamanashi, Japan
[2] Wakayama Univ, Fac Engn, Wakayama, Japan
关键词
Speech analysis; fundamental frequency; temporal variation; noise robustness; FUNDAMENTAL-FREQUENCY ESTIMATION; PITCH EXTRACTION; TANDEM-STRAIGHT; SPEECH;
D O I
10.21437/Interspeech.2016-140
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article presents a framework for overviewing the performance of fundamental frequency (F0) estimators and evaluates its effectiveness. Over the past few decades, many F0 estimators and evaluation indices have been proposed and have been evaluated using various speech databases. In speech analysis/synthesis research, modem estimators are used as the algorithm to fulfill the demand for high-quality speech synthesis, but at the same time, they are competing with one another on minor issues. Specifically, while all of them meet the demands for high-quality speech synthesis, the result depends on the speech database used in the evaluation. Since there are various types of speech, it is inadvisable to discuss the effectiveness of each estimator on the basis of minor differences. It would be better to select the appropriate F0 estimator in accordance with the speech characteristics. The framework we propose, TUSK, does not rank the estimators but rather attempts to overview them. In TUSK, six parameters are introduced to observe the trends in the characteristics in each F0 estimator. The signal is artificially generated so that six parameters can be controllable independently. In this article, we introduce the concept of TUSK and determine its effectiveness using several modem F0 estimators.
引用
收藏
页码:1790 / 1794
页数:5
相关论文
共 50 条
  • [1] Study of f0(980) and f0(1500) from Bs→f0(980)π,f0(1500)π decays
    Zhi-Qing Zhang
    The European Physical Journal C, 2010, 69 : 433 - 443
  • [2] Study of f0(980) and f0(1500) from Bs → f0(980)π,f0(1500)π decays
    Zhang, Zhi-Qing
    EUROPEAN PHYSICAL JOURNAL C, 2010, 69 (3-4): : 433 - 443
  • [3] Properties of the scalar mesons f0(1370), f0(1500) and f0(1710)
    Li, DM
    Yu, H
    Shen, QX
    EUROPEAN PHYSICAL JOURNAL C, 2001, 19 (03): : 529 - 533
  • [4] The f0(1790) and f0(1800) puzzle
    Khemchandani, K. P.
    Martinez Torres, A.
    Nielsen, M.
    Navarra, F. S.
    Jido, D.
    Hosaka, A.
    Oset, E.
    CHIRAL SYMMETRY IN HADRONS AND NUCLEI, 2015, : 74 - 77
  • [5] Study of f0(980) and f0(1500) from Bs → f0(980)K, f0(1500)K decays
    Zhang, Zhi-Qing
    JOURNAL OF PHYSICS G-NUCLEAR AND PARTICLE PHYSICS, 2010, 37 (08)
  • [6] Glueball-quarkonia content of the f0(1370), f0(1500) and f0(1710)
    Li, DM
    Yu, H
    Shen, QX
    COMMUNICATIONS IN THEORETICAL PHYSICS, 2000, 34 (03) : 507 - 512
  • [7] The mixing of the f0(1370), f0(1500) and f0(1710) and the search for the scalar glueball
    Close, FE
    Kirk, A
    PHYSICS LETTERS B, 2000, 483 (04) : 345 - 352
  • [8] The mixing of the f0(1370), f0(1500) and f0(1710) and the search for the scalar glueball
    Kirk, A
    HIGH ENERGY PHYSICS, VOLS I AND II, 2001, : 347 - 350
  • [9] Perturbative QCD analysis of neutral B-meson decays into σ σ, σ f0 and f0 f0
    Niu, Hua-Dian
    Li, Guo-Dong
    Ren, Jia-Le
    Liu, Xin
    EUROPEAN PHYSICAL JOURNAL C, 2022, 82 (02):
  • [10] F0 Transformation within the Voice Conversion Framework
    Hanzlicek, Zdenek
    Matousek, Jindrich
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 681 - 684