Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning

被引:0
|
作者
Bhuyan, Amit Kumar [1 ]
Dutta, Hrishikesh [1 ]
Biswas, Subir [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48823 USA
关键词
Mel frequency cepstral coefficient; Computational modeling; Accuracy; Oral communication; Training; Bayes methods; Feature extraction; Data models; Computational intelligence; Unsupervised Learning; Bayesian methods; federated learning; distributed processing; Hotelling's t-squared statistic; Bayesian information criterion; cepstral analysis; SEGMENTATION;
D O I
10.1109/TETCI.2024.3482855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a computationally efficient and distributed speaker diarization framework for networked IoT-style audio devices. The work proposes a Federated Learning model which can identify the participants in a conversation without the requirement of a large audio database for training. An unsupervised online update mechanism is proposed for the Federated Learning model which depends on cosine similarity of speaker embeddings. Moreover, the proposed diarization system solves the problem of speaker change detection via. unsupervised segmentation techniques using Hotelling's t-squared Statistic and Bayesian Information Criterion. In this new approach, speaker change detection is biased around detected quasi-silences, which reduces the severity of the trade-off between the missed detection and false detection rates. Additionally, the computational overhead due to frame-by-frame identification of speakers is reduced via. unsupervised clustering of speech segments. The results demonstrate the effectiveness of the proposed training method in the presence of non-IID speech data. It also shows a considerable improvement in the reduction of false and missed detection at the segmentation stage, while reducing the computational overhead. Improved accuracy and reduced computational cost makes the mechanism suitable for real-time speaker diarization across a distributed IoT audio network.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Federated Learning for IoT Networks: Enhancing Efficiency and Privacy
    Zahri, Sofia
    Bennouri, Hajar
    Chehri, Abdellah
    Abdelmoniem, Ahmed M.
    2023 IEEE 9TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2023,
  • [32] SPEAKER DIARIZATION WITH PLDA I-VECTOR SCORING AND UNSUPERVISED CALIBRATION
    Sell, Gregory
    Garcia-Romero, Daniel
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 413 - 417
  • [33] Federated Deep Learning for Intrusion Detection in IoT Networks
    Belarbi, Othmane
    Spyridopoulos, Theodoros
    Anthi, Eirini
    Mavromatis, Ioannis
    Carnelli, Pietro
    Khan, Aftab
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 237 - 242
  • [34] Explainable Federated Learning for Botnet Detection in IoT Networks
    Kalakoti, Rajesh
    Bahsi, Hayretdin
    Nomm, Sven
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 22 - 29
  • [35] Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings
    Dawalatabad, Nauman
    Madikeri, Srikanth
    Sekhar, C. Chandra
    Murthy, Hema A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 14 - 27
  • [36] A federated learning approach to network intrusion detection using residual networks in industrial IoT networks
    Chaurasia, Nisha
    Ram, Munna
    Verma, Priyanka
    Mehta, Nakul
    Bharot, Nitesh
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (13): : 18325 - 18346
  • [37] An Unsupervised Neural Prediction Framework for Learning Speaker Embeddings using Recurrent Neural Networks
    Jati, Arindam
    Georgiou, Panayiotis
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1131 - 1135
  • [38] ACCELERATED UNSUPERVISED CLUSTERING IN ACOUSTIC SENSOR NETWORKS USING FEDERATED LEARNING AND A VARIATIONAL AUTOENCODER
    Becker, Luca
    Nelus, Alexandra
    Glitza, Rene
    Martin, Rainer
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [39] Identifying Malicious Nodes in Multihop IoT Networks using Diversity and Unsupervised Learning
    Liu, Xin
    Abdelhakim, Mai
    Krishnamurthy, Prashant
    Tipper, David
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [40] Federated Learning for Privacy-Preserving Machine Learning in IoT Networks
    Anitha, G.
    Jegatheesan, A.
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 338 - 342