An audio-visual saliency model for movie summarization

被引:8
|
作者
Rapantzikos, Konstantinos [1 ]
Evangelopoulos, Georgios [1 ]
Maragos, Petros [1 ]
Avrithis, Yannis [1 ]
机构
[1] Natl Tech Univ Athens, Sch ECE, GR-15773 Athens, Greece
关键词
saliency; saliency curves; attention modeling; event detection; key-frame selection; video summarization; audiovisual;
D O I
10.1109/MMSP.2007.4412882
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A saliency-based method for generating video summaries is presented, which exploits coupled audiovisual information from both media streams. Efficient and advanced speech and image processing algorithms to detect key frames that are acoustically and visually salient are used. Promising results are shown from experiments on a movie database.
引用
收藏
页码:320 / 323
页数:4
相关论文
共 50 条
  • [21] Audio-visual collaborative representation learning for Dynamic Saliency Prediction
    Ning, Hailong
    Zhao, Bin
    Hu, Zhanxuan
    He, Lang
    Pei, Ercheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 256
  • [22] Audio-visual Encoding of Multimedia Content for Enhancing Movie Recommendations
    Deldjoo, Yashar
    Constantin, Mihai Gabriel
    Eghbal-Zadeh, Hamid
    Ionescu, Bogdan
    Schedl, Markus
    Cremonesi, Paolo
    [J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 455 - 459
  • [23] Movie genre classification by exploiting audio-visual features of previews
    Rasheed, Z
    Shah, M
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 1086 - 1089
  • [24] YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context
    Woellmer, Martin
    Weninger, Felix
    Knaup, Tobias
    Schuller, Bjoern
    Sun, Congkai
    Sagae, Kenji
    Morency, Louis-Philippe
    [J]. IEEE INTELLIGENT SYSTEMS, 2013, 28 (03) : 46 - 53
  • [25] VIDEO EVENT DETECTION AND SUMMARIZATION USING AUDIO, VISUAL AND TEXT SALIENCY
    Evangelopoulos, G.
    Zlatintsi, A.
    Skoumas, G.
    Rapantzikos, K.
    Potamianos, A.
    Maragos, P.
    Avrithis, Y.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3553 - +
  • [26] Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention
    Evangelopoulos, Georgios
    Zlatintsi, Athanasia
    Potamianos, Alexandros
    Maragos, Petros
    Rapantzikos, Konstantinos
    Skoumas, Georgios
    Avrithis, Yannis
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (07) : 1553 - 1568
  • [27] Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents
    Zhang, Ye
    Tanishige, Ryunosuke
    Ide, Ichiro
    Doman, Keisuke
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Murase, Hiroshi
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2019, 13 (01) : 135 - 155
  • [28] Audio-Visual Saliency Map: Overview, Basic Models and Hardware Implementation
    Ramenahalli, Sudarshan
    Mendat, Daniel R.
    Dura-Bernal, Salvador
    Culurciello, Eugenio
    Niebur, Ernst
    Andreou, Andreas
    [J]. 2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
  • [29] Towards multimodal saliency detection: an enhancement of audio-visual correlation estimation
    Rodriguez-Hidalgo, Antonio
    Pelaez-Moreno, Carmen
    Gallardo-Antolin, Ascension
    [J]. 2017 IEEE 16TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2017, : 438 - 443
  • [30] An Objective Model for Audio-Visual Quality
    Martinez, Helard Becerra
    Farias, Mylene C. Q.
    [J]. IMAGE QUALITY AND SYSTEM PERFORMANCE XI, 2014, 9016