EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE

被引:0
|
作者
Zhao, Chen [1 ]
Wang, Hongcui [1 ]
Hyon, Songgun [1 ]
Wei, Jianguo [1 ]
Dang, Jianwu [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
关键词
speaker identification; feature extraction; phoneme mean F-ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The features used for speaker recognition should have more speaker individual information while attenuating the linguistic information. In order to discard the linguistic information effectively, in this paper, we employed the phoneme mean F-ratio method to investigate the different contributions of different frequency region from the point of view of Chinese phoneme, and apply it for speaker identification. It is found that the speaker individual information depending on the phonemes is distributed in different frequency regions of speech sound. Based on the contribution rate, we extracted the new features and combined with GMM model. The experiment for speaker identification task is conducted with a King-ASR Chinese database. Compared with the MFCC feature, the identification error rate with the proposed feature was reduced by 32.94%. The results confirmed that the efficiency of the phoneme mean F-ratio method for improving speaker recognition performance for Chinese.
引用
收藏
页码:345 / 348
页数:4
相关论文
共 50 条
  • [1] A method of speaker identification based on phoneme mean F-ratio contribution
    Hyon, Songgun
    Wang, Hongcui
    Zhao, Chen
    Wei, Jianguo
    Dang, Jianwu
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2669 - 2672
  • [2] An investigation of dependencies between frequency components and speaker characteristics based on phoneme mean F-ratio contribution
    Hyon, Songgun
    Wang, Hongcui
    Wei, Jianguo
    Dang, Jianwu
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [3] MFCC Extraction Based on F-Ratio and correlated distance criterion in speaker Recognition
    Zhu Jian-wei
    Sun Shui-fa
    Dan Zhi-ping
    Lei Bang-jun
    MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 329 - 333
  • [4] Optimization of TESPAR features using robust F-ratio for speaker recognition
    Prasad, K. Satya
    Sheela, K. Anitha
    Sridevi, M.
    2007 INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING, COMMUNICATIONS AND NETWORKING, VOLS 1 AND 2, 2006, : 20 - +
  • [5] Hand gesture recognition using DWT and F-ratio based feature descriptor
    Sahoo, Jaya Prakash
    Ari, Samit
    Ghosh, Dipak Kumar
    IET IMAGE PROCESSING, 2018, 12 (10) : 1780 - 1787
  • [6] An F-ratio based optimization technique for automatic speaker recognition system
    Saha, G
    Chakroborty, S
    Senapati, S
    Proceedings of the IEEE INDICON 2004, 2004, : 70 - 73
  • [7] A New Feature Extraction Method for Bone-conducted Life Sounds based on F-ratio
    An, Yeteng
    Wang, Hongcui
    Hyon, Songgun
    Chen, Sai
    Dang, Jianwu
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1598 - 1604
  • [8] An F-ratio based optimization on noisy data for speaker recognition application
    Saha, G
    Senapati, S
    Chakroborty, S
    INDICON 2005 PROCEEDINGS, 2005, : 352 - 355
  • [9] Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
    Sun, Yanqing
    Zhou, Yu
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2417 - 2430
  • [10] Non-linear speech feature extraction for phoneme classification and speaker recognition
    Chetouani, M
    Faundez-Zanuy, M
    Gas, B
    Zarader, JL
    NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 344 - 350