EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE

被引：0

作者：

Zhao, Chen ^{[1
]}

Wang, Hongcui ^{[1
]}

Hyon, Songgun ^{[1
]}

Wei, Jianguo ^{[1
]}

Dang, Jianwu ^{[1
]}

机构：

[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China

来源：

2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING | 2012年

关键词：

speaker identification; feature extraction; phoneme mean F-ratio;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The features used for speaker recognition should have more speaker individual information while attenuating the linguistic information. In order to discard the linguistic information effectively, in this paper, we employed the phoneme mean F-ratio method to investigate the different contributions of different frequency region from the point of view of Chinese phoneme, and apply it for speaker identification. It is found that the speaker individual information depending on the phonemes is distributed in different frequency regions of speech sound. Based on the contribution rate, we extracted the new features and combined with GMM model. The experiment for speaker identification task is conducted with a King-ASR Chinese database. Compared with the MFCC feature, the identification error rate with the proposed feature was reduced by 32.94%. The results confirmed that the efficiency of the phoneme mean F-ratio method for improving speaker recognition performance for Chinese.

引用

页码：345 / 348

页数：4

共 50 条

[1] A method of speaker identification based on phoneme mean F-ratio contribution
Hyon, Songgun
Wang, Hongcui
Zhao, Chen
Wei, Jianguo
Dang, Jianwu
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2669 - 2672
[2] An investigation of dependencies between frequency components and speaker characteristics based on phoneme mean F-ratio contribution
Hyon, Songgun
Wang, Hongcui
Wei, Jianguo
Dang, Jianwu
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[3] MFCC Extraction Based on F-Ratio and correlated distance criterion in speaker Recognition
Zhu Jian-wei
Sun Shui-fa
Dan Zhi-ping
Lei Bang-jun
MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 329 - 333
[4] Optimization of TESPAR features using robust F-ratio for speaker recognition
Prasad, K. Satya
Sheela, K. Anitha
Sridevi, M.
2007 INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING, COMMUNICATIONS AND NETWORKING, VOLS 1 AND 2, 2006, : 20 - +
[5] Hand gesture recognition using DWT and F-ratio based feature descriptor
Sahoo, Jaya Prakash
Ari, Samit
Ghosh, Dipak Kumar
IET IMAGE PROCESSING, 2018, 12 (10) : 1780 - 1787
[6] An F-ratio based optimization technique for automatic speaker recognition system
Saha, G
Chakroborty, S
Senapati, S
Proceedings of the IEEE INDICON 2004, 2004, : 70 - 73
[7] A New Feature Extraction Method for Bone-conducted Life Sounds based on F-ratio
An, Yeteng
Wang, Hongcui
Hyon, Songgun
Chen, Sai
Dang, Jianwu
INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1598 - 1604
[8] An F-ratio based optimization on noisy data for speaker recognition application
Saha, G
Senapati, S
Chakroborty, S
INDICON 2005 PROCEEDINGS, 2005, : 352 - 355
[9] Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
Sun, Yanqing
Zhou, Yu
Zhao, Qingwei
Yan, Yonghong
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2417 - 2430
[10] Non-linear speech feature extraction for phoneme classification and speaker recognition
Chetouani, M
Faundez-Zanuy, M
Gas, B
Zarader, JL
NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 344 - 350

← 1 2 3 4 5 →