An approach to statistical lip modelling for speaker identification via chromatic feature extraction

被引：0

作者：

Wark, T ^{[1
]}

Sridharan, S ^{[1
]}

Chandran, V ^{[1
]}

机构：

[1] Queensland Univ Technol, Sch Elect Elect & Syst Engn, Speech Res Lab, Brisbane, Qld 4001, Australia

来源：

FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2 | 1998年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel technique for the tracking of moving lips for the purpose of speaker identification. In our system, a model of the lip contour is formed directly from chromatic information in the lip region. Iterative refinement of contour point estimates is not required. Colour features are extracted from the lips via concatenated profiles taken around the lip contour. Reduction of order in lip features is obtained via principal component analysis (PCA) followed by linear discriminant analysis (LDA). Statistical speaker models are built from the lip features based on the Gaussian Mixture Model (GMM). identification experiments performed on the M2VTS(1) database [5], show encouraging results.

引用

页码：123 / 125

页数：3

共 50 条

[31] An Approach towards Ear Feature Extraction for Human Identification
Saranya, M.
Cyril, G. L. Infant
Santhosh, R. R.
[J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 4824 - 4828
[32] An approach to dynamic modelling and topographic feature extraction of wake EEG
Lowe, D
[J]. INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 1999, : 145 - 153
[33] Speaker identification using discriminative feature selection - a growing neural gas approach
Sabac, B
Gavat, I
[J]. NEUREL 2000: PROCEEDINGS OF THE 5TH SEMINAR ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, 2000, : 105 - 108
[34] POLSAR IMAGE SEGMENTATION - ADVANCED STATISTICAL MODELLING VERSUS SIMPLE FEATURE EXTRACTION
Doulgeris, A. P.
Eltoft, T.
[J]. 2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014, : 1021 - 1024
[35] STUDY OF STATISTICAL ROBUST CLOSED SET SPEAKER IDENTIFICATION WITH FEATURE AND SCORE-BASED FUSION
Al-Kaltakchi, Musab T. S.
Woo, Wai L.
Dlay, Satnam S.
Chambers, Jonathon A.
[J]. 2016 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2016,
[36] Automatic Speaker Recognition :An Approach using DWT based Feature Extraction and Vector Quantization
Singhai, Jyoti
Singhai, Rakesh
[J]. IETE TECHNICAL REVIEW, 2007, 24 (05) : 395 - 402
[37] Robust Feature Extraction Using Temporal Context Averaging for Speaker Identification in Diverse Acoustic Environments
Terraf, Yassin
Iraqi, Youssef
[J]. IEEE ACCESS, 2024, 12 : 14094 - 14115
[38] The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices
Al Hindawi, Noor Ahmad
Shahin, Ismail
Nassif, Ali Bou
[J]. 2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 269 - 273
[39] EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE
Zhao, Chen
Wang, Hongcui
Hyon, Songgun
Wei, Jianguo
Dang, Jianwu
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 345 - 348
[40] An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions
Li, Qi
Huang, Yan
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1791 - 1801

← 1 2 3 4 5 →