Graph-based Label Propagation for Semi-Supervised Speaker Identification

被引:6
|
作者
Chen, Long [1 ]
Ravichandran, Venkatesh [1 ]
Stolcke, Andreas [1 ]
机构
[1] Amazon Alexa, Seattle, WA 98121 USA
来源
关键词
semi-supervised learning; speaker recognition; label propagation; graph-based learning;
D O I
10.21437/Interspeech.2021-1209
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speaker identification in the household scenario (e.g., for smart speakers) is typically based on only a few enrollment utterances but a much larger set of unlabeled data, suggesting semi-supervised learning to improve speaker profiles. We propose a graph-based semi-supervised learning approach for speaker identification in the household scenario, to leverage the unlabeled speech samples. In contrast to most of the works in speaker recognition that focus on speaker-discriminative embeddings, this work focuses on speaker label inference (scoring). Given a pre-trained embedding extractor, graph-based learning allows us to integrate information about both labeled and unlabeled utterances. Considering each utterance as a graph node, we represent pairwise utterance similarity scores as edge weights. Graphs are constructed per household, and speaker identities are propagated to unlabeled nodes to optimize a global consistency criterion. We show in experiments on the VoxCeleb dataset that this approach makes effective use of unlabeled data and improves speaker identification accuracy compared to two state-of-the-art scoring methods as well as their semi-supervised variants based on pseudo-labels.
引用
收藏
页码:4588 / 4592
页数:5
相关论文
共 50 条
  • [41] Graph-based multimodal semi-supervised image classification
    Xie, Wenxuan
    Lu, Zhiwu
    Peng, Yuxin
    Xiao, Jianguo
    NEUROCOMPUTING, 2014, 138 : 167 - 179
  • [42] Graph-Based Semi-Supervised Learning with Redundant Views
    Gong, Yun-Chao
    Chen, Chuan-Liang
    Tian, Yin-Jie
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1393 - +
  • [43] Graph-based Semi-supervised Learning for Text Classification
    Widmann, Natalie
    Verberne, Suzan
    ICTIR'17: PROCEEDINGS OF THE 2017 ACM SIGIR INTERNATIONAL CONFERENCE THEORY OF INFORMATION RETRIEVAL, 2017, : 59 - 66
  • [44] Learning Flexible Graph-Based Semi-Supervised Embedding
    Dornaika, Fadi
    El Traboulsi, Youssof
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (01) : 206 - 218
  • [45] Graph-based Semi-Supervised Regression and Its Extensions
    Guo, Xinlu
    Uehara, Kuniaki
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (06) : 260 - 269
  • [46] Graph-Based Semi-Supervised Learning on Evolutionary Data
    Song, Yanglei
    Yang, Yifei
    Dou, Weibei
    Zhang, Changshui
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 467 - 476
  • [47] Graph-Based Semi-Supervised Learning as a Generative Model
    He, Jingrui
    Carbonell, Jaime
    Liu, Yan
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2492 - 2497
  • [48] Robust and Flexible Graph-based Semi-supervised Embedding
    Dornaika, F.
    El Traboulsi, Y.
    Zhu, R.
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 465 - 470
  • [49] Coded Distributed Graph-Based Semi-Supervised Learning
    Du, Ying
    Tan, Siqi
    Han, Kaifeng
    Jiang, Jiamo
    Wang, Zhiqin
    Chen, Li
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 367 - 372