Speaker Diarization with Session-Level Speaker Embedding Refinement Using Graph Neural Networks

被引:0
|
作者
Wang, Jixuan [1 ]
Xiao, Xiong [2 ]
Wu, Jian [2 ]
Ramamurthy, Ranjani [2 ]
Rudzicz, Frank [1 ]
Brudno, Michael [1 ]
机构
[1] University of Toronto, Canada
[2] Microsoft, United States
关键词
Compendex;
D O I
9054176
中图分类号
学科分类号
摘要
Graph neural networks - Speech recognition - Deep neural networks - Clustering algorithms - Matrix algebra
引用
收藏
页码:7109 / 7113
相关论文
共 50 条
  • [21] SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION
    Hu, Mathieu
    Sharma, Dushyant
    Doclo, Simon
    Brookes, Mike
    Naylor, Patrick A.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5743 - 5747
  • [22] Speaker diarization system using HXLPS and deep neural network
    Ramaiah, V. Subba
    Rao, R. Rajeswara
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2018, 57 (01) : 255 - 266
  • [23] Speaker Diarization and Detection System using A Priori Speaker Information
    Kenai, Ouassila
    Asbai, Nassim
    Ouamour, Siham
    Guerti, Mhania
    Djeghiour, Salim
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING (ICNLSP), 2018, : 73 - 78
  • [24] MULTI-SCALE SPEAKER EMBEDDING-BASED GRAPH ATTENTION NETWORKS FOR SPEAKER DIARISATION
    Kwon, Youngki
    Heo, Hee-Soo
    Jung, Jee-Weon
    Kim, You Jin
    Lee, Bong-Jin
    Chung, Joon Son
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8367 - 8371
  • [25] ARTIFICIAL NEURAL NETWORK FEATURES FOR SPEAKER DIARIZATION
    Yella, Harsha
    Stolcke, Andreas
    Slaney, Malcolm
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 402 - 406
  • [26] Speaker Diarization through Waveform and Neural Net
    Latypov, Rustam
    Stolov, Evgeni
    [J]. PROCEEDINGS OF THE 2021 29TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), VOL 1, 2021, : 234 - 239
  • [27] CONVOLUTIONAL NEURAL NETWORK FOR SPEAKER CHANGE DETECTION IN TELEPHONE SPEAKER DIARIZATION SYSTEM
    Hruz, Marek
    Zajic, Zbynek
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4945 - 4949
  • [28] TOWARDS END-TO-END SPEAKER DIARIZATION WITH GENERALIZED NEURAL SPEAKER CLUSTERING
    Zhang, Chunlei
    Shi, Jiatong
    Weng, Chao
    Yu, Meng
    Yu, Dong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8372 - 8376
  • [29] Combining speaker turn embedding and incremental structure prediction for low-latency speaker diarization
    Wisniewksi, Guillaume
    Bredin, Herve
    Gelly, Gregory
    Barras, Claude
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3582 - 3586
  • [30] Speaker Diarization Using Gesture and Speech
    Gebre, Binyam Gebrekidan
    Wittenburg, Peter
    Drude, Sebastian
    Huijbregts, Marijn
    Heskes, Tom
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 582 - 586