Unsupervised Bayesian Surprise Detection in Spatial Audio with Convolutional Variational Autoencoder and LSTM Model

被引：0

作者：

Khah, Arman Nik ^{[1
]}

Htun, Chitsein ^{[1
]}

Prakash, Ravi ^{[1
]}

机构：

[1] Univ Texas Dallas, Richardson, TX 75083 USA

来源：

PROCEEDINGS OF THE 2024 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE MEDIA EXPERIENCES WORKSHOPS, IMXW 2024 | 2024年

关键词：

360 degrees video; spatial audio; visual attention; Bayesian surprise; unsupervised learning; VAE-LSTM; AMBISONICS;

D O I：

10.1145/3672406.3672422

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Understanding user visual attention (VA) is crucial for Field-of-View (FoV) prediction and resultant bandwidth optimization for 360 degrees video streaming. The influence of spatial audio on VA has been largely overlooked. Traditional methods, using saliency, characterize important stimuli as statistical outliers [4] but fail to capture the Temporal Evolution of Attention (TEA), where initially salient stimuli become routine and less attention-grabbing due to continual exposure [2, 20]. This paper introduces a novel unsupervised deep learning approach using a Convolutional Variational Autoencoder and Long Short-Term Memory (CVAE-LSTM) model to detect Bayesian surprise [2] in spatial audio streams, considering factors such as time, context, and user expectations. Our findings highlight the importance of temporal context in determining the surprisal value of audio events and the selective nature of sensory processing and attention in complex environments.

引用

页码：116 / 121

页数：6

共 50 条

[41] Hybrid robust convolutional autoencoder for unsupervised anomaly detection of machine tools under noises
Yan, Shen
Shao, Haidong
Xiao, Yiming
Liu, Bin
Wan, Jiafu
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 79
[42] Unsupervised Convolutional Autoencoder-Based Feature Learning for Automatic Detection of Plant Diseases
Pardede, Hilman F.
Suryawati, Endang
Sustika, Rika
Zilvan, Vicky
2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 158 - 162
[43] Satellite Unsupervised Anomaly Detection Based on Deconvolution-Reconstructed Temporal Convolutional Autoencoder
Zhao, Haotian
Liu, Ming
Qiu, Shi
Cao, Xibin
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2989 - 2998
[44] Bayesian finite element model updating with a variational autoencoder and polynomial chaos expansion
Li, Qiang
Ni, Pinghe
Du, Xiuli
Han, Qiang
Xu, Kun
Bai, Yulei
ENGINEERING STRUCTURES, 2024, 316
[45] Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery
Glarner, Thomas
Hanebrink, Patrick
Ebbers, Janek
Haeb-Umbach, Reinhold
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2688 - 2692
[46] 3D CAD model retrieval based on sketch and unsupervised variational autoencoder
Qin, Feiwei
Qiu, Shi
Gao, Shuming
Bai, Jing
ADVANCED ENGINEERING INFORMATICS, 2022, 51
[47] A deep-learning-based unsupervised model on esophageal manometry using variational autoencoder
Kou, Wenjun
Carlson, Dustin A.
Baumann, Alexandra J.
Donnan, Erica
Luo, Yuan
Pandolfino, John E.
Etemadi, Mozziyar
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 112
[48] VESC: a new variational autoencoder based model for anomaly detection
Zhang, Chunkai
Wang, Xinyu
Zhang, Jiahua
Li, Shaocong
Zhang, Hanyu
Liu, Chuanyi
Han, Peiyi
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 683 - 696
[49] Video anomaly detection and localization via Gaussian Mixture Fully Convolutional Variational Autoencoder
Fan, Yaxiang
Wen, Gongjian
Li, Deren
Qiu, Shaohua
Levine, Martin D.
Xiao, Fei
COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 195
[50] VESC: a new variational autoencoder based model for anomaly detection
Chunkai Zhang
Xinyu Wang
Jiahua Zhang
Shaocong Li
Hanyu Zhang
Chuanyi Liu
Peiyi Han
International Journal of Machine Learning and Cybernetics, 2023, 14 : 683 - 696

← 1 2 3 4 5 →