Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

被引：0

作者：

Eskevich, Maria ^{[1
]}

Larson, Martha ^{[1
,2
]}

Aly, Robin ^{[3
]}

Sabetghadam, Serwah ^{[4
]}

Jones, Gareth J. F. ^{[5
]}

Ordelman, Roeland ^{[3
]}

Huet, Benoit ^{[6
]}

机构：

[1] Radboud Univ Nijmegen, CLS, Nijmegen, Netherlands

[2] Delft Univ Technol, Delft, Netherlands

[3] Univ Twente, Enschede, Netherlands

[4] TU Vienna, Vienna, Austria

[5] Dublin City Univ, Sch Comp, ADAPT Ctr, Dublin, Ireland

[6] EURECOM, Sophia Antipolis, France

来源：

MULTIMEDIA MODELING, MMM 2017, PT II | 2017年 / 10133卷

基金：

爱尔兰科学基金会;

关键词：

Crowdsourcing; Video-to-video linking; Link evaluation; Verbal-visual information;

D O I：

10.1007/978-3-319-51814-524

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called 'video hyperlinking'), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multi-modality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.

引用

页码：280 / 292

页数：13

共 50 条

[31] On the Evaluation of Video-Based Crowd Counting Models
Ledda, Emanuele
Putzu, Lorenzo
Delussu, Rita
Fumera, Giorgio
Roli, Fabio
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13233 LNCS : 301 - 311
[32] On the Evaluation of Video-Based Crowd Counting Models
Ledda, Emanuele
Putzu, Lorenzo
Delussu, Rita
Fumera, Giorgio
Roli, Fabio
IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 301 - 311
[33] Video Game Design as a Multimodal Heuristic: Turning the Tide of Composition Studies
Yacono, Candice
CEA CRITIC, 2021, 83 (01) : 94 - 101
[34] Evaluation of a Multimodal Video Annotator for Contemporary Dance
Cabral, Diogo
Valente, Joao G.
Aragao, Urandia
Fernandes, Carla
Correia, Nuno
PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, : 572 - 579
[35] MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Chu, Ernie
Huang, Tzuhsuan
Lin, Shuo-Yen
Chen, Jun-Cheng
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1353 - 1361
[36] Video-to-video e-Health applications supporting medical use cases for remote patients
1600, Springer Science and Business Media, LLC (437):
[37] Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
Zhuo, Long
Wang, Guangcong
Li, Shikai
Wu, Wayne
Liu, Ziwei
COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 289 - 305
[38] 2D Fingertip Localization on Depth Videos Using Paired Video-to-Video Translation
Farahanipad, Farnaz
Nasr, Mohammad Sadegh
Rezaei, Mohammad
Kamangar, Farhad
Athitsos, Vassilis
Huber, Manfred
ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 381 - 392
[39] FastPicker: Adaptive independent two-stage video-to-video summarization for efficient action recognition
Alfasly, Saghir
Lu, Jian
Xu, Chen
Al-Huda, Zaid
Jiang, Qingtang
Lu, Zhaosong
Chui, Charles K.
NEUROCOMPUTING, 2023, 516 : 231 - 244
[40] Defining Key Performance Indicators for Evaluating the Use of High Definition Video-to-Video Services in eHealth
Molnar, Andreea
Weerakkody, Vishanth
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2013, 2013, 412 : 452 - 461

← 1 2 3 4 5 →