Attention-based video streaming

被引:8
|
作者
Dikici, Cagatay [1 ]
Bozma, H. Isil [1 ]
机构
[1] Bogazici Univ, Elect Elect Engn Dept, Intelligent Syst Lab, Istanbul, Turkey
关键词
Biologically motivated attentive vision; Foveation; Spatio-temporal pre-processing; Face tracking; Neural networks; Video streaming; FOVEATION; REGION;
D O I
10.1016/j.image.2010.08.002
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper considers the problem of video streaming in low bandwidth networks and presents a complete framework that is inspired by the fovea-periphery distinction of biological vision systems. First, an application specific attention function that serves to find the important small regions in a given frame is constructed a priori using a back-propagation neural network that is optimized combinatorially. Given a specific application, the respective attention function partitions each frame into foveal and periphery regions and then a spatial-temporal pre-processing algorithm encodes the foveal regions with high spatial resolution while the periphery regions are encoded with lower spatial and temporal resolution. Finally, the pre-processed video sequence is streamed using a standard streaming server. As an application, we consider the transmission of human face videos. Our experimental results indicate that even with limited amount of training, the constructed attention function is able to determine the foveal regions which have improved transmission quality while the peripheral regions have an acceptable degradation. Crown Copyright (C) 2010 Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:745 / 760
页数:16
相关论文
共 50 条
  • [41] Complex event detection via attention-based video representation and classification
    Zhao, Zhicheng
    Xiang, Rui
    Su, Fei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 3209 - 3227
  • [42] Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding
    Blanch, Marc Gorriz
    Blasi, Saverio
    Smeaton, Alan F.
    O'Connor, Noel E.
    Mrak, Marta
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 366 - 377
  • [43] ECANet: Explicit cyclic attention-based network for video saliency prediction
    Xue, Hao
    Sun, Minghui
    Liang, Yanhua
    [J]. NEUROCOMPUTING, 2022, 468 : 233 - 244
  • [44] Attention-based deep supervised hashing for near duplicate video retrieval
    Shi, Naifei
    Fu, Chong
    Tie, Ming
    Zhang, Wenchao
    Wang, Xingwei
    Sham, Chiu-Wing
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 36 (10): : 5217 - 5230
  • [45] A spatiotemporal attention-based method for geo-referenced video coding
    [J]. Feng, Jiangfan, 1600, Science and Engineering Research Support Society (07):
  • [46] Complex event detection via attention-based video representation and classification
    Zhicheng Zhao
    Rui Xiang
    Fei Su
    [J]. Multimedia Tools and Applications, 2018, 77 : 3209 - 3227
  • [47] Attention-Based Adaptive Intra Refresh Method for Robust Video Coding
    Xiaolong Wang
    [J]. Tsinghua Science and Technology, 2012, 17 (01) : 67 - 72
  • [48] An attention-based hybrid deep learning approach for bengali video captioning
    Zaoad, Md. Shahir
    Mannan, M. M. Rushadul
    Mandol, Angshu Bikash
    Rahman, Mostafizur
    Islam, Md Adnanul
    Rahman, Md. Mahbubur
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (01) : 257 - 269
  • [49] Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition
    Liu, Xiaodong
    Li, Songyang
    Wang, Miao
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [50] Visual attention-based approach for prediction of abnormalities in CCTV video surveillance
    Behera, A.
    Hogg, D.
    Howard, C.
    Gilchrist, I.
    Troscianko, T.
    [J]. PERCEPTION, 2012, 41 (03) : 367 - 367