VIDEO EVENT DETECTION AND SUMMARIZATION USING AUDIO, VISUAL AND TEXT SALIENCY

被引:31
|
作者
Evangelopoulos, G. [1 ]
Zlatintsi, A. [1 ]
Skoumas, G. [2 ]
Rapantzikos, K. [1 ]
Potamianos, A. [2 ]
Maragos, P. [1 ]
Avrithis, Y. [1 ]
机构
[1] Natl Tech Univ Athens, Sch ECE, GR-15773 Athens, Greece
[2] Tech Univ Crete, Dept ECE, Khania EL-73100, Greece
关键词
multimodal saliency; audio; video; text processing; video abstraction; movie summarization; ATTENTION MODEL;
D O I
10.1109/ICASSP.2009.4960393
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detection of perceptually important video events is formulated here on the basis of saliency models for the audio, visual and textual information conveyed in a video stream. Audio saliency is assessed by cues that quantify multifrequency waveform modulations, extracted through nonlinear operators and energy tracking. Visual saliency is measured through a spatiotemporal attention model driven by intensity, color and motion. Text saliency is extracted from part-of-speech tagging on the subtitles information available with most movie distributions. The various modality curves are integrated in a single attention curve, where the presence of an event may be signified in one or multiple domains. This multimodal saliency curve is the basis of a bottom-up video summarization algorithm, that refines results from unimodal or audiovisual-based skimming. The algorithm performs favorably for video summarization in terms of informativeness and enjoyability.
引用
收藏
页码:3553 / +
页数:2
相关论文
共 50 条
  • [1] AUDIO SALIENT EVENT DETECTION AND SUMMARIZATION USING AUDIO AND TEXT MODALITIES
    Zlatintsi, Athanasia
    Iosif, Elias
    Maragos, Petros
    Potamianos, Alexandros
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2311 - 2315
  • [2] A SALIENCY-BASED APPROACH TO AUDIO EVENT DETECTION AND SUMMARIZATION
    Zlatintsi, A.
    Maragos, P.
    Potamianos, A.
    Evangelopoulos, G.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1294 - 1298
  • [3] An audio-visual saliency model for movie summarization
    Rapantzikos, Konstantinos
    Evangelopoulos, Georgios
    Maragos, Petros
    Avrithis, Yannis
    2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 320 - 323
  • [4] VISUAL SALIENCY DETECTION USING VIDEO DECOMPOSITION
    Bhattacharya, Saumik
    Gupta, Sumana
    Venkatesh, K. S.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 684 - 688
  • [5] Event Detection on Roads Using Perceptual Video Summarization
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (09) : 2944 - 2954
  • [6] Enhanced On-Device Video Summarization Using Audio and Visual Features
    Nagaraju, Lokesh Kumar Thandaga
    Ranjitha, B.
    Shaik, Jani Basha
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 86 - 98
  • [7] Discovering joint audio–visual codewords for video event detection
    I-Hong Jhuo
    Guangnan Ye
    Shenghua Gao
    Dong Liu
    Yu-Gang Jiang
    D. T. Lee
    Shih-Fu Chang
    Machine Vision and Applications, 2014, 25 : 33 - 47
  • [8] A audio-visual model for efficient video summarization
    El-Nagar, Gamal
    El-Sawy, Ahmed
    Rashad, Metwally
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [9] Event detection and summarization in sports video
    Li, BX
    Sezan, MI
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES, PROCEEDINGS, 2001, : 132 - 138
  • [10] AUTOMATIC CONSUMER VIDEO SUMMARIZATION BY AUDIO AND VISUAL ANALYSIS
    Jiang, Wei
    Cotton, Courtenay
    Loui, Alexander C.
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,