VIDEO CODING BASED ON AUDIO-VISUAL ATTENTION

被引:0
|
作者
Lee, Jong-Seok [1 ]
De Simone, Francesca [1 ]
Ebrahimi, Touradj [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Multimedia Signal Proc Grp, CH-1015 Lausanne, Switzerland
关键词
video coding; audio-visual attention; cross-modal interaction; source localization; H.264/AVC; perceived audio-visual quality;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective evaluation.
引用
收藏
页码:57 / 60
页数:4
相关论文
共 50 条
  • [1] Efficient video coding based on audio-visual focus of attention
    Lee, Jong-Seok
    De Simone, Francesca
    Ebrahimi, Touradj
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2011, 22 (08) : 704 - 711
  • [2] Attention-Based Audio-Visual Fusion for Video Summarization
    Fang, Yinghong
    Zhang, Junpeng
    Lu, Cewu
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 328 - 340
  • [3] Subjective Quality Evaluation of Foveated Video Coding Using Audio-Visual Focus of Attention
    Lee, Jong-Seok
    De Simone, Francesca
    Ebrahimi, Touradj
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (07) : 1322 - 1331
  • [4] Audio-visual and EEG-based Attention Modeling for Extraction of Affective Video Content
    Mehmood, Irfan
    Sajjad, Muhammad
    Baik, Sung Wook
    Rho, Seungmin
    [J]. 2015 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2015, : 17 - 18
  • [5] VIDEO AND EDUCATIONAL ROBOTICS: AN INNOVATIVE INTEGRATION OF AUDIO-VISUAL LANGUAGE AND CODING
    Denicolai, Lorenzo
    Grimaldi, Renato
    Palmieri, Silvia
    [J]. INTED2016: 10TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2016, : 2617 - 2624
  • [6] Audio-visual speech processing and attention
    Sams, M
    [J]. PSYCHOPHYSIOLOGY, 2003, 40 : S5 - S6
  • [7] Audio-Visual Salieny Network with Audio Attention Module
    Cheng, Shuaiyang
    Gao, Xing
    Song, Liang
    Xiahou, Jianbing
    [J]. PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [8] Audio-Visual Fusion Based on Interactive Attention for Person Verification
    Jing, Xuebin
    He, Liang
    Song, Zhida
    Wang, Shaolei
    [J]. SENSORS, 2023, 23 (24)
  • [9] Audio-visual integration during overt visual attention
    Quigley, Cliodhna
    Onat, Selim
    Harding, Sue
    Cooke, Martin
    Koenig, Peter
    [J]. JOURNAL OF EYE MOVEMENT RESEARCH, 2007, 1 (02):
  • [10] Speaker dependent video indexing based on audio-visual interaction
    Tsekeridou, S
    Pitas, I
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 358 - 362