ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION

被引:0
|
作者
Prasad, Abhay [1 ]
Periyasamy, Vijitha [2 ]
Ghosh, Prasanta Kumar [2 ]
机构
[1] Manipal Inst Technol, Manipal 576104, Karnataka, India
[2] Indian Inst Sci IISc, Dept Elect Engn, Bangalore 560012, Karnataka, India
关键词
speech articulation; invariant gestures; speaker identification; FEATURES; PURSUIT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech articulation varies across speakers for producing a speech sound due to the differences in their vocal tract morphologies, though the speech motor actions are executed in terms of relatively invariant gestures [1]. While the invariant articulatory gestures are driven by the linguistic content of the spoken utterance, the component of speech articulation that varies across speakers reflects speaker-specific and other paralinguistic information. In this work, we present a formulation to decompose the speech articulation from multiple speakers into the variant and invariant aspects when they speak the same sentence. The variant component is found to be a better representation for discriminating speakers compared to the speech articulation which includes the invariant part. Experiments with real-time magnetic resonance imaging (rtMRI) videos of speech production from multiple speakers reveal that the variant component of speech articulation yields a better frame-level speaker identification accuracy compared to the speech articulation as well as acoustic features by 29.9% and 9.4% (absolute) respectively.
引用
收藏
页码:4265 / 4269
页数:5
相关论文
共 50 条
  • [41] Extended Weighted Linear Prediction (XLP) Analysis of Speech and its Application to Speaker Verification in Adverse Conditions
    Pohjalainen, Jouni
    Saeidi, Rahim
    Kinnunen, Tomi
    Alku, Paavo
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1477 - +
  • [42] Identification of Linear Time-Invariant Systems under Periodic Disturbance with Its Estimation
    Kaneko, Osamu
    Ohmura, Kazuki
    Hayashi, Yuuki
    Yamamoto, Shigeru
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2016, 99 (07) : 32 - 39
  • [43] An Improved Array Steering Vector Estimation Method and Its Application in Speech Enhancement
    Zhu Liang Yu
    Meng Hwa Er
    EURASIP Journal on Advances in Signal Processing, 2005
  • [44] An improved array steering vector estimation method and its application in speech enhancement
    Yu, ZL
    Er, MH
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2930 - 2937
  • [45] Adaptive frequency estimation based on normal realizations and its application in speech processing
    Zhou, J
    Li, G
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL VI, PROCEEDINGS: SIGNAL PROCESSING THEORY AND METHODS, 2003, : 201 - 204
  • [46] HARMONICS ESTIMATION BASED ON INSTANTANEOUS FREQUENCY AND ITS APPLICATION TO PITCH DETERMINATION OF SPEECH
    ABE, T
    KOBAYASHI, T
    IMAI, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (09) : 1188 - 1194
  • [47] High resolution formant estimation and its application in frequency-scaling of speech
    Nelson, D
    Umesh, S
    Cohen, L
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VIII PTS 1 AND 2, 2000, 4119 : 294 - 301
  • [48] Eigenspace estimation with missing values and its application to eigenvoice adaptation for speech recognition
    Ou, Zhijian
    Luo, Jun
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1214 - 1218
  • [49] ESTIMATION OF DETERIORATION CHARACTERISTICS OF SOFT ROCKS AND ITS APPLICATION.
    Yoshikawa, Keiya
    Sakurai, Takashi
    Tatematsu, Hidenobu
    Quarterly Report of RTRI (Railway Technical Research Institute) (Japan), 1984, 25 (01): : 3 - 6
  • [50] Metric of Highlighting the Synchronicity of Time Series and Its Application in Analyzing the Fundamental Frequencies of the Speaker's Speech Signal
    Kataeva, Elena
    Yakimuk, Alexey
    Konev, Anton
    Shelupanov, Alexander
    SYMMETRY-BASEL, 2020, 12 (12): : 1 - 25