ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION

被引：0

作者：

Prasad, Abhay ^{[1
]}

Periyasamy, Vijitha ^{[2
]}

Ghosh, Prasanta Kumar ^{[2
]}

机构：

[1] Manipal Inst Technol, Manipal 576104, Karnataka, India

[2] Indian Inst Sci IISc, Dept Elect Engn, Bangalore 560012, Karnataka, India

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

关键词：

speech articulation; invariant gestures; speaker identification; FEATURES; PURSUIT;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech articulation varies across speakers for producing a speech sound due to the differences in their vocal tract morphologies, though the speech motor actions are executed in terms of relatively invariant gestures [1]. While the invariant articulatory gestures are driven by the linguistic content of the spoken utterance, the component of speech articulation that varies across speakers reflects speaker-specific and other paralinguistic information. In this work, we present a formulation to decompose the speech articulation from multiple speakers into the variant and invariant aspects when they speak the same sentence. The variant component is found to be a better representation for discriminating speakers compared to the speech articulation which includes the invariant part. Experiments with real-time magnetic resonance imaging (rtMRI) videos of speech production from multiple speakers reveal that the variant component of speech articulation yields a better frame-level speaker identification accuracy compared to the speech articulation as well as acoustic features by 29.9% and 9.4% (absolute) respectively.

引用

页码：4265 / 4269

页数：5

共 50 条

[11] A new frequency scale of Chinese whispered speech in the application of speaker identification
Lin Wei
Yang Lili
Xu Boling
PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2006, 16 (10) : 1072 - 1078
[12] A new frequency scale of Chinese whispered speech in the application of speaker identification
LIN Wei
ProgressinNaturalScience, 2006, (10) : 1072 - 1078
[13] Studies on inter-speaker variability in speech and its application in automatic speech recognition
S UMESH
Sadhana, 2011, 36 : 853 - 883
[14] Studies on inter-speaker variability in speech and its application in automatic speech recognition
Umesh, S.
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 853 - 883
[15] EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION
ATAL, BS
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06): : 1304 - 1312
[16] Place of Articulation from Direct Imaging for Validation of Its Estimation from Speech Analysis for Use in Speech Training
Nataraj, K. S.
Pandey, Prem C.
2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
[17] Identification of soundbite and its speaker name using transcripts of broadcast news speech
Liu F.
Liu Y.
ACM Transactions on Asian Language Information Processing, 2010, 9 (01):
[18] SPEAKER IDENTIFICATION BY SPEECH SPECTROGRAMS - A SCIENTISTS VIEW OF ITS RELIABILITY FOR LEGAL PURPOSES
BOLT, RH
COOPER, FS
DAVID, EE
DENES, PB
PICKETT, JM
STEVENS, KN
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 47 (02): : 597 - &
[19] Support Vector Machines Approaches and its Application to Speaker Identification
Boujelbene, S. Zribi
Mezghani, D. Ben Ayed
Ellouze, N.
2009 3RD IEEE INTERNATIONAL CONFERENCE ON DIGITAL ECOSYSTEMS AND TECHNOLOGIES, 2009, : 236 - +
[20] Adaptation of ANN for FPGA implementation and its application for speaker identification
Elmisery, FA
Khalil, AH
Salama, AE
Algeldawy, F
ICEEC'04: 2004 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONIC AND COMPUTER ENGINEERING, PROCEEDINGS, 2004, : 317 - 320

← 1 2 3 4 5 →