Unsupervised Bayesian Surprise Detection in Spatial Audio with Convolutional Variational Autoencoder and LSTM Model

被引:0
|
作者
Khah, Arman Nik [1 ]
Htun, Chitsein [1 ]
Prakash, Ravi [1 ]
机构
[1] Univ Texas Dallas, Richardson, TX 75083 USA
关键词
360 degrees video; spatial audio; visual attention; Bayesian surprise; unsupervised learning; VAE-LSTM; AMBISONICS;
D O I
10.1145/3672406.3672422
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Understanding user visual attention (VA) is crucial for Field-of-View (FoV) prediction and resultant bandwidth optimization for 360 degrees video streaming. The influence of spatial audio on VA has been largely overlooked. Traditional methods, using saliency, characterize important stimuli as statistical outliers [4] but fail to capture the Temporal Evolution of Attention (TEA), where initially salient stimuli become routine and less attention-grabbing due to continual exposure [2, 20]. This paper introduces a novel unsupervised deep learning approach using a Convolutional Variational Autoencoder and Long Short-Term Memory (CVAE-LSTM) model to detect Bayesian surprise [2] in spatial audio streams, considering factors such as time, context, and user expectations. Our findings highlight the importance of temporal context in determining the surprisal value of audio events and the selective nature of sensory processing and attention in complex environments.
引用
收藏
页码:116 / 121
页数:6
相关论文
共 50 条
  • [41] Hybrid robust convolutional autoencoder for unsupervised anomaly detection of machine tools under noises
    Yan, Shen
    Shao, Haidong
    Xiao, Yiming
    Liu, Bin
    Wan, Jiafu
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 79
  • [42] Unsupervised Convolutional Autoencoder-Based Feature Learning for Automatic Detection of Plant Diseases
    Pardede, Hilman F.
    Suryawati, Endang
    Sustika, Rika
    Zilvan, Vicky
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 158 - 162
  • [43] Satellite Unsupervised Anomaly Detection Based on Deconvolution-Reconstructed Temporal Convolutional Autoencoder
    Zhao, Haotian
    Liu, Ming
    Qiu, Shi
    Cao, Xibin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2989 - 2998
  • [44] Bayesian finite element model updating with a variational autoencoder and polynomial chaos expansion
    Li, Qiang
    Ni, Pinghe
    Du, Xiuli
    Han, Qiang
    Xu, Kun
    Bai, Yulei
    ENGINEERING STRUCTURES, 2024, 316
  • [45] Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery
    Glarner, Thomas
    Hanebrink, Patrick
    Ebbers, Janek
    Haeb-Umbach, Reinhold
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2688 - 2692
  • [46] 3D CAD model retrieval based on sketch and unsupervised variational autoencoder
    Qin, Feiwei
    Qiu, Shi
    Gao, Shuming
    Bai, Jing
    ADVANCED ENGINEERING INFORMATICS, 2022, 51
  • [47] A deep-learning-based unsupervised model on esophageal manometry using variational autoencoder
    Kou, Wenjun
    Carlson, Dustin A.
    Baumann, Alexandra J.
    Donnan, Erica
    Luo, Yuan
    Pandolfino, John E.
    Etemadi, Mozziyar
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 112
  • [48] VESC: a new variational autoencoder based model for anomaly detection
    Zhang, Chunkai
    Wang, Xinyu
    Zhang, Jiahua
    Li, Shaocong
    Zhang, Hanyu
    Liu, Chuanyi
    Han, Peiyi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 683 - 696
  • [49] Video anomaly detection and localization via Gaussian Mixture Fully Convolutional Variational Autoencoder
    Fan, Yaxiang
    Wen, Gongjian
    Li, Deren
    Qiu, Shaohua
    Levine, Martin D.
    Xiao, Fei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 195
  • [50] VESC: a new variational autoencoder based model for anomaly detection
    Chunkai Zhang
    Xinyu Wang
    Jiahua Zhang
    Shaocong Li
    Hanyu Zhang
    Chuanyi Liu
    Peiyi Han
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 683 - 696