An audio-visual saliency model for movie summarization

被引：8

作者：

Rapantzikos, Konstantinos ^{[1
]}

Evangelopoulos, Georgios ^{[1
]}

Maragos, Petros ^{[1
]}

Avrithis, Yannis ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Sch ECE, GR-15773 Athens, Greece

来源：

2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING | 2007年

关键词：

saliency; saliency curves; attention modeling; event detection; key-frame selection; video summarization; audiovisual;

D O I：

10.1109/MMSP.2007.4412882

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A saliency-based method for generating video summaries is presented, which exploits coupled audiovisual information from both media streams. Efficient and advanced speech and image processing algorithms to detect key frames that are acoustically and visually salient are used. Promising results are shown from experiments on a movie database.

引用

页码：320 / 323

页数：4

共 50 条

[21] Audio-visual collaborative representation learning for Dynamic Saliency Prediction
Ning, Hailong
Zhao, Bin
Hu, Zhanxuan
He, Lang
Pei, Ercheng
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 256
[22] Audio-visual Encoding of Multimedia Content for Enhancing Movie Recommendations
Deldjoo, Yashar
Constantin, Mihai Gabriel
Eghbal-Zadeh, Hamid
Ionescu, Bogdan
Schedl, Markus
Cremonesi, Paolo
[J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 455 - 459
[23] Movie genre classification by exploiting audio-visual features of previews
Rasheed, Z
Shah, M
[J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 1086 - 1089
[24] YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context
Woellmer, Martin
Weninger, Felix
Knaup, Tobias
Schuller, Bjoern
Sun, Congkai
Sagae, Kenji
Morency, Louis-Philippe
[J]. IEEE INTELLIGENT SYSTEMS, 2013, 28 (03) : 46 - 53
[25] VIDEO EVENT DETECTION AND SUMMARIZATION USING AUDIO, VISUAL AND TEXT SALIENCY
Evangelopoulos, G.
Zlatintsi, A.
Skoumas, G.
Rapantzikos, K.
Potamianos, A.
Maragos, P.
Avrithis, Y.
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3553 - +
[26] Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention
Evangelopoulos, Georgios
Zlatintsi, Athanasia
Potamianos, Alexandros
Maragos, Petros
Rapantzikos, Konstantinos
Skoumas, Georgios
Avrithis, Yannis
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (07) : 1553 - 1568
[27] Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents
Zhang, Ye
Tanishige, Ryunosuke
Ide, Ichiro
Doman, Keisuke
Kawanishi, Yasutomo
Deguchi, Daisuke
Murase, Hiroshi
[J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2019, 13 (01) : 135 - 155
[28] Audio-Visual Saliency Map: Overview, Basic Models and Hardware Implementation
Ramenahalli, Sudarshan
Mendat, Daniel R.
Dura-Bernal, Salvador
Culurciello, Eugenio
Niebur, Ernst
Andreou, Andreas
[J]. 2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
[29] Towards multimodal saliency detection: an enhancement of audio-visual correlation estimation
Rodriguez-Hidalgo, Antonio
Pelaez-Moreno, Carmen
Gallardo-Antolin, Ascension
[J]. 2017 IEEE 16TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2017, : 438 - 443
[30] An Objective Model for Audio-Visual Quality
Martinez, Helard Becerra
Farias, Mylene C. Q.
[J]. IMAGE QUALITY AND SYSTEM PERFORMANCE XI, 2014, 9016

← 1 2 3 4 5 →