Speaker normalization using Dynamic Frequency Warping

被引:0
|
作者
Huang, Zhenhua [1 ]
Hou, Limin [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
关键词
D O I
10.1109/ICALIP.2008.4590058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an effort to reduce the degradation in a gender-independence isolated word recognition performance caused by variation character among different speaker a dynamic frequency warping approach to speaker normalization is investigated There are a lot of discrepancy. in ftequency domain which caused by vocal tract length difference among different speakers. Dynamic Frequency Warping (DFW) is an exact analog of Dynamic Time Warping (DTW) which is used to reduce the discrepancy ftequency scale of speech and normalize the ftequency accurately. In this paper the DFW method is to be introduced to normalize the frequency scale of speech and then applied it to a gender-independence isolated word recognition system. The results of experiments show a large improvement in average word error rate.
引用
收藏
页码:1091 / 1095
页数:5
相关论文
共 50 条
  • [1] Speaker normalization using efficient frequency warping procedures
    Lee, L
    Rose, RC
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 353 - 356
  • [2] A frequency warping approach to speaker normalization
    Lee, L
    Rose, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 49 - 60
  • [3] Speaker normalization based on frequency warping
    Zhan, PM
    Westphal, M
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1039 - 1042
  • [4] Frequency-warping and speaker-normalization
    Umesh, S
    Cohen, L
    Nelson, D
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 983 - 986
  • [5] Improving robustness in frequency warping-based speaker normalization
    Rose, Richard C.
    Miguel, A.
    Keyvani, A.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 225 - 228
  • [6] Study of non-linear frequency warping functions for speaker normalization
    Kumar, S. V. Bharath
    Umesh, S.
    Sinha, R.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1245 - 1248
  • [7] Speaker Normalization Method Based On the Piece-Wise Linear Frequency Warping
    Feng, Hongcai
    Yuan, Cao
    Li, Yaqin
    [J]. IEEE: 2009 INTERNATIONAL CONFERENCE ON E-LEARNING, E-BUSINESS, ENTERPRISE INFORMATION SYSTEMS AND E-GOVERNMENT, 2009, : 80 - 83
  • [8] SPEAKER VERIFICATION USING THE DYNAMIC TIME WARPING
    Segarceanu, Svetlana
    Zaharia, Tiberius
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2013, 75 (01): : 179 - 194
  • [9] Speaker verification using the dynamic time warping
    Segarceanu, Svetlana
    Zaharia, Tiberius
    [J]. UPB Scientific Bulletin, Series C: Electrical Engineering, 2013, 75 (01): : 179 - 194
  • [10] DYNAMIC FREQUENCY WARPING FOR SPEAKER ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
    PALIWAL, KK
    AINSWORTH, WA
    [J]. JOURNAL OF PHONETICS, 1985, 13 (02) : 123 - 134