VIDEO EVENT DETECTION AND SUMMARIZATION USING AUDIO, VISUAL AND TEXT SALIENCY

被引:31
|
作者
Evangelopoulos, G. [1 ]
Zlatintsi, A. [1 ]
Skoumas, G. [2 ]
Rapantzikos, K. [1 ]
Potamianos, A. [2 ]
Maragos, P. [1 ]
Avrithis, Y. [1 ]
机构
[1] Natl Tech Univ Athens, Sch ECE, GR-15773 Athens, Greece
[2] Tech Univ Crete, Dept ECE, Khania EL-73100, Greece
关键词
multimodal saliency; audio; video; text processing; video abstraction; movie summarization; ATTENTION MODEL;
D O I
10.1109/ICASSP.2009.4960393
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detection of perceptually important video events is formulated here on the basis of saliency models for the audio, visual and textual information conveyed in a video stream. Audio saliency is assessed by cues that quantify multifrequency waveform modulations, extracted through nonlinear operators and energy tracking. Visual saliency is measured through a spatiotemporal attention model driven by intensity, color and motion. Text saliency is extracted from part-of-speech tagging on the subtitles information available with most movie distributions. The various modality curves are integrated in a single attention curve, where the presence of an event may be signified in one or multiple domains. This multimodal saliency curve is the basis of a bottom-up video summarization algorithm, that refines results from unimodal or audiovisual-based skimming. The algorithm performs favorably for video summarization in terms of informativeness and enjoyability.
引用
收藏
页码:3553 / +
页数:2
相关论文
共 50 条
  • [31] A survey on event detection based video summarization for cricket
    Khushali R. Raval
    Mahesh M. Goyani
    Multimedia Tools and Applications, 2022, 81 : 29253 - 29281
  • [32] Event detection and summarization in American football broadcast video
    Li, BX
    Sezan, I
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2002, 2002, 4676 : 202 - 213
  • [33] A survey on event detection based video summarization for cricket
    Raval, Khushali R.
    Goyani, Mahesh M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 29253 - 29281
  • [34] Saliency Attention Based Abnormal Event Detection in Video
    Huan, Wang
    Guo, Huiwen
    Wu, Xinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 1039 - 1043
  • [35] Visual Saliency Models for Text Detection in Real World
    Gao, Renwu
    Uchida, Seiichi
    Shahab, Asif
    Shafait, Faisal
    Frinken, Volkmar
    PLOS ONE, 2014, 9 (12):
  • [36] Video Saliency Detection Using Motion Saliency Filter
    Luo, Lei
    Jiang, Rongxin
    Tian, Xiang
    Chen, Yaowu
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1045 - 1049
  • [37] An Audio-video Summarization Scheme Based on Audio and Video Analysis
    Furini, Marco
    Ghini, Vittorio
    2006 3RD IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1-3, 2006, : 1209 - +
  • [38] Major cast detection in video using both audio and visual information
    Zhu, L
    Yao, W
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1413 - 1416
  • [39] VASD: Video Action Scene Detection using Audio Visual Data
    Lili, N. A.
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT, VOL 2, 2009, : 303 - 307
  • [40] Automatic Text Summarization of Video Lectures Using Subtitles
    Garg, Shruti
    RECENT DEVELOPMENTS IN INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, ICCD 2016, 2017, 555 : 45 - 52