Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery

被引:0
|
作者
Huang, Haikun [1 ]
Solah, Michael [1 ]
Li, Dingzeyu [2 ]
Yu, Lap-Fai [3 ]
机构
[1] Univ Massachusetts, Boston, MA 02125 USA
[2] Columbia Univ, Adobe Res, New York, NY 10027 USA
[3] George Mason Univ, Fairfax, VA 22030 USA
基金
美国国家科学基金会;
关键词
immersive media; spatial audio; panorama images; virtual reality; augmented reality; VIRTUAL-REALITY; STATISTICS;
D O I
10.1145/3290605.3300851
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As 360 degrees cameras and virtual reality headsets become more popular, panorama images have become increasingly ubiquitous. While sounds are essential in delivering immersive and interactive user experiences, most panorama images, however, do not come with native audio. In this paper, we propose an automatic algorithm to augment static panorama images through realistic audio assignment. We accomplish this goal through object detection, scene classification, object depth estimation, and audio source placement. We built an audio file database composed of over 500 audio files to facilitate this process. We designed and conducted a user study to verify the efficacy of various components in our pipeline. We run our method on a large variety of panorama images of indoor and outdoor scenes. By analyzing the statistics, we learned the relative importance of these components, which can be used in prioritizing for power-sensitive time-critical tasks like mobile augmented reality (AR) applications.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A novel temporal and spatial panorama stream processing engine on IoT applications
    Yin, Yifan
    Xu, Boyi
    Cai, Hongming
    Yu, Han
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2020, 18
  • [42] Automatic 3D Indoor Scene Modeling from Single Panorama
    Yang, Yang
    Jin, Shi
    Liu, Ruiyang
    Kang, Sing Bing
    Yu, Jingyi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3926 - 3934
  • [43] Development of remote automatic panorama VR imaging rig systems using smartphones
    Sang-Hyun Lee
    Sang-Joon Lee
    Cluster Computing, 2018, 21 : 1175 - 1185
  • [44] Development of remote automatic panorama VR imaging rig systems using smartphones
    Lee, Sang-Hyun
    Lee, Sang-Joon
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 1175 - 1185
  • [45] Panorama-Based Multilane Recognition for Advanced Navigation Map Generation
    Yang, Ming
    Gu, Xiaolin
    Lu, Hao
    Wang, Chunxiang
    Ye, Lei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [46] High quality stereo panorama generation using a 3 camera system
    Yamada, K
    Ichikawa, T
    Naemura, T
    Aizawa, K
    Saito, T
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2000, PTS 1-3, 2000, 4067 : 419 - 428
  • [47] Generation of a disparity panorama using a 3-camera capturing system
    Yamada, K
    Ichikawa, T
    Naemura, T
    Aizawa, K
    Saito, T
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 772 - 775
  • [48] Semi-automatic methodology for augmented panorama development in industrial outdoor environments
    Gomes, Daniel Lima, Jr.
    Jansen dos Reis, Paulo Roberto
    de Paiva, Anselmo Cardoso
    Silva, Aristofanes Correa
    Braz, Geraldo, Jr.
    de Araujo, Antonio Sergio
    Gattass, Marcelo
    ADVANCES IN ENGINEERING SOFTWARE, 2017, 114 : 282 - 294
  • [49] Gaze-based detection of mind wandering during audio-guided panorama viewing
    Kwok, Tiffany C. K.
    Kiefer, Peter
    Schinazi, Victor R.
    Hoelscher, Christoph
    Raubal, Martin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [50] A TCN-based Primary Ambient Extraction in Generating Ambisonics Audio from Panorama Video
    Lv, Zhuliang
    Zhou, Yi
    Liu, Hongqing
    Shu, Xiaofeng
    Zhang, Nannan
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2020), 2020,