Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

被引：0

作者：

Eskevich, Maria ^{[1
]}

Larson, Martha ^{[1
,2
]}

Aly, Robin ^{[3
]}

Sabetghadam, Serwah ^{[4
]}

Jones, Gareth J. F. ^{[5
]}

Ordelman, Roeland ^{[3
]}

Huet, Benoit ^{[6
]}

机构：

[1] Radboud Univ Nijmegen, CLS, Nijmegen, Netherlands

[2] Delft Univ Technol, Delft, Netherlands

[3] Univ Twente, Enschede, Netherlands

[4] TU Vienna, Vienna, Austria

[5] Dublin City Univ, Sch Comp, ADAPT Ctr, Dublin, Ireland

[6] EURECOM, Sophia Antipolis, France

来源：

MULTIMEDIA MODELING, MMM 2017, PT II | 2017年 / 10133卷

基金：

爱尔兰科学基金会;

关键词：

Crowdsourcing; Video-to-video linking; Link evaluation; Verbal-visual information;

D O I：

10.1007/978-3-319-51814-524

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called 'video hyperlinking'), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multi-modality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.

引用

页码：280 / 292

页数：13

共 50 条

[41] Crowd detection in video sequences
Reisman, P
Mano, O
Avidan, S
Shashua, A
2004 IEEE INTELLIGENT VEHICLES SYMPOSIUM, 2004, : 66 - 71
[42] An In-Depth Evaluation of Multimodal Video Genre Categorization
Mironica, Ionut
Ionescu, Bogdan
Knees, Peter
Lambert, Patrick
2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013), 2013, : 11 - 16
[43] Multimodal perception in subjective quality evaluation of compressed video
Ostaszewska-Lizewska, Anna
Kloda, Rafal
Zebrowska-Lucyki, Sabina
ADVANCED MECHATRONICS SOLUTIONS, 2016, 393 : 569 - 574
[44] Multimodal Video Analysis for Crowd Anomaly Detection Using Open Access Tourism Cameras
Dionis-Ros, Alejandro
Vila-Frances, Joan
Magdalena-Benedito, Rafael
Mateo, Fernando
Serrano-Lopez, Antonio J.
APPLIED SCIENCES-BASEL, 2024, 14 (23):
[45] Evaluation of a Video Generation Method Linking Dance and Scenes
Yoshikawa, Yui
Shishido, Hidehiko
Kameda, Yoshinari
Kitahara, Itaru
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
[46] A pioneering video-to-video super-resolution reconstruction algorithm based on segmentation and space-time regularisation
Guo, L.
He, X. H.
Chen, W. L.
Qing, L. B.
Luo, D. S.
IMAGING SCIENCE JOURNAL, 2014, 62 (04): : 236 - 250
[47] Crowd size estimation for video surveillance
Lee, Gwang-Gook
Song, Su Han
Kim, Whoi-Yul
INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL II, 2007, : 196 - 199
[48] Crowd Behavior Recognition for Video Surveillance
Saxena, Shobhit
Bremond, Francois
Thonnat, Monnique
Ma, Ruihua
ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2008, 5259 : 970 - +
[49] Shortcut-V2V: Compression Framework for Video-to-Video Translation based on Temporal Redundancy Reduction
Chung, Chaeyeon
Park, Yeojeong
Choi, Seunghwan
Ganbat, Munkhsoyol
Choo, Jaegul
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7578 - 7588
[50] Bilinear dynamics for crowd video analysis
Wu, Shuang
Su, Hang
Yang, Hua
Zheng, Shibao
Fan, Yawen
Zhou, Qin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 461 - 470

← 1 2 3 4 5 →