Investigating Long-Term and Short-Term Time-Varying Speaker Verification

被引:0
|
作者
Qin, Xiaoyi [1 ,2 ]
Li, Na [3 ]
Duan, Shufei [4 ]
Li, Ming [1 ,2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou Municipal Key Lab Multimodal Intelligent Sy, Suzhou 215316, Peoples R China
[3] Tencent AI Lab, Shenzhen 518054, Peoples R China
[4] Taiyuan Univ Technol, Coll Elect Informat & Opt Engn, Taiyuan 030024, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Aging; Time-varying systems; Videos; Recording; Face recognition; Databases; Cross-age; reinforcement learning; speaker verification; template updating; time-varying; AGE;
D O I
10.1109/TASLP.2024.3428910
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The performance of speaker verification systems can be adversely affected by time domain variations. However, limited research has been conducted on time-varying speaker verification due to the absence of appropriate datasets. This paper aims to investigate the impact of long-term and short-term time-varying in speaker verification and proposes solutions to mitigate these effects. For long-term speaker verification (i.e., cross-age speaker verification), we introduce an age-decoupling adversarial learning method to learn age-invariant speaker representation by mining age information from the VoxCeleb dataset. For short-term speaker verification, we collect the SMIIP-TimeVarying (SMIIP-TV) Dataset, which includes recordings at multiple time slots every day from 373 speakers for 90 consecutive days and other relevant meta information. Using this dataset, we analyze the time-varying of speaker embeddings and propose a novel but realistic time-varying speaker verification task, termed incremental sequence-pair speaker verification. This task involves continuous interaction between enrollment audios and a sequence of testing audios with the aim of improving performance over time. We introduce the template updating method to counter the negative effects over time, and then formulate the template updating processing as a Markov Decision Process and propose a template updating method based on deep reinforcement learning (DRL). The policy network of DRL is treated as an agent to determine if and how much should the template be updated. In summary, this paper releases our collected database, investigates both the long-term and short-term time-varying scenarios and provides insights and solutions into time-varying speaker verification.
引用
收藏
页码:3408 / 3423
页数:16
相关论文
共 50 条
  • [1] SHORT-TERM AND LONG-TERM UNEMPLOYMENT - A PARAMETRIC MODEL WITH TIME-VARYING EFFECTS
    PORTUGAL, P
    ADDISON, JT
    [J]. OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 1995, 57 (02) : 205 - 227
  • [2] A Bayesian time-varying autoregressive model for improved short-term and long-term prediction
    Berninger, Christoph
    Stoecker, Almond
    Rugamer, David
    [J]. JOURNAL OF FORECASTING, 2022, 41 (01) : 181 - 200
  • [3] Distributed stochastic power control for time-varying long-term and short-term fading wireless networks
    Olama, Mohannned M.
    Djouadi, Seddik M.
    Charalambous, Charalambous D.
    Sahyoun, Samir
    [J]. 2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 4970 - +
  • [4] Augmenting Short-term Cepstral Features with Long-term Discriminative Features for Speaker Verification of Telephone Data
    Do, Cong-Thanh
    Barras, Claude
    Le, Viet-Bac
    Sarkar, Achintya K.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2483 - 2487
  • [5] A SHORT-TERM IN LONG-TERM
    KELLY, FL
    [J]. AMERICAN JOURNAL OF NURSING, 1988, 88 (11) : 1479 - 1480
  • [6] When a Short-Term Outlook Is the Best Long-Term Strategy: Time-Varying Risk of Readmission After Acute Myocardial Infarction
    Levy, Andrew E.
    Allen, Larry A.
    [J]. JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2018, 7 (21):
  • [7] SHORT-TERM PROACTIVE INTERFERENCE - SHORT-TERM OR LONG-TERM
    LOFTUS, GR
    PATTERSON, KK
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1974, 4 (NA4) : 240 - 240
  • [8] Cooperating in Long-term Relationships with Time-Varying Structure
    Crandall, Jacob W.
    Huy Pham
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1898 - 1900
  • [9] Time-varying long-term memory in Bitcoin market
    Jiang Yonghong
    Nie He
    Ruan Weihua
    [J]. FINANCE RESEARCH LETTERS, 2018, 25 : 280 - 284
  • [10] Communications in physiology: Short-term and long-term time scales
    Hoyer, Dirk
    Stein, Phyllis K.
    Schmidt, Hendrik
    [J]. INFLAMMATION RESEARCH, 2007, 56 : S231 - S231