Unsupervised Audio Segmentation based on Restricted Boltzmann Machines

被引:0
|
作者
Pikrakis, Aggelos [1 ]
机构
[1] Univ Piraeus, Sch Informat & Commun Technol, Dept Informat, Piraeus, Greece
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper the Conditional Restricted Boltzmann Machine (CRBM) is employed in the context of unsupervised audio segmentation. The CRBM acts as a temporal modeling method and learns, from a maximum likelihood perspective, the temporal relationships of the feature vectors that have been extracted from a large corpus of training data. After the CRBM has been trained, we quantify the correlation of the activation of the neurons of the hidden layer for successive feature vectors by means of an appropriately defined similarity function. A simple thresholding scheme is then applied on the output of the similarity function to segment automatically the audio recording. Our experiments have been carried out on a large corpus of documentaries. We provide an interpretation of the segmentation results and comment on the segmentation efficiency of the method.
引用
收藏
页码:311 / 314
页数:4
相关论文
共 50 条
  • [31] Hyperparameters Adaptation for Restricted Boltzmann Machines Based on Free Energy
    Yao, Chengwei
    Chen, Gencai
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 243 - 248
  • [32] Collaborative Filtering Based on Dual Conditional Restricted Boltzmann Machines
    Li, Chunchun
    Li, Jun
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10871 - 10874
  • [33] Nonequilibrium thermodynamics of restricted Boltzmann machines
    Salazar, Domingos S. P.
    [J]. PHYSICAL REVIEW E, 2017, 96 (02)
  • [34] Generalising the Discriminative Restricted Boltzmann Machines
    Cherla, Srikanth
    Tran, Son N.
    Garcez, Artur d'Avila
    Weyde, Tillman
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 111 - 119
  • [35] Restricted Boltzmann machines in quantum physics
    Roger G. Melko
    Giuseppe Carleo
    Juan Carrasquilla
    J. Ignacio Cirac
    [J]. Nature Physics, 2019, 15 : 887 - 892
  • [36] INFORMATION AND REGULARIZATION IN RESTRICTED BOLTZMANN MACHINES
    Vera, Matias
    Rey Vega, Leonardo
    Piantanida, Pablo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3155 - 3159
  • [37] Restricted Boltzmann Machines for Gender Classification
    Mansanet, Jordi
    Albiol, Alberto
    Paredes, Roberto
    Villegas, Mauricio
    Albiol, Antonio
    [J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 274 - 281
  • [38] Training restricted Boltzmann machines: An introduction
    Fischer, Asja
    Igel, Christian
    [J]. PATTERN RECOGNITION, 2014, 47 (01) : 25 - 39
  • [39] Restricted Boltzmann machines in quantum physics
    Melko, Roger G.
    Carleo, Giuseppe
    Carrasquilla, Juan
    Cirac, J. Ignacio
    [J]. NATURE PHYSICS, 2019, 15 (09) : 887 - 892
  • [40] Wasserstein Training of Restricted Boltzmann Machines
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Cuturi, Marco
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29