Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

被引:0
|
作者
Eskevich, Maria [1 ]
Larson, Martha [1 ,2 ]
Aly, Robin [3 ]
Sabetghadam, Serwah [4 ]
Jones, Gareth J. F. [5 ]
Ordelman, Roeland [3 ]
Huet, Benoit [6 ]
机构
[1] Radboud Univ Nijmegen, CLS, Nijmegen, Netherlands
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Twente, Enschede, Netherlands
[4] TU Vienna, Vienna, Austria
[5] Dublin City Univ, Sch Comp, ADAPT Ctr, Dublin, Ireland
[6] EURECOM, Sophia Antipolis, France
来源
基金
爱尔兰科学基金会;
关键词
Crowdsourcing; Video-to-video linking; Link evaluation; Verbal-visual information;
D O I
10.1007/978-3-319-51814-524
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called 'video hyperlinking'), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multi-modality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.
引用
收藏
页码:280 / 292
页数:13
相关论文
共 50 条
  • [31] On the Evaluation of Video-Based Crowd Counting Models
    Ledda, Emanuele
    Putzu, Lorenzo
    Delussu, Rita
    Fumera, Giorgio
    Roli, Fabio
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13233 LNCS : 301 - 311
  • [32] On the Evaluation of Video-Based Crowd Counting Models
    Ledda, Emanuele
    Putzu, Lorenzo
    Delussu, Rita
    Fumera, Giorgio
    Roli, Fabio
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 301 - 311
  • [33] Video Game Design as a Multimodal Heuristic: Turning the Tide of Composition Studies
    Yacono, Candice
    CEA CRITIC, 2021, 83 (01) : 94 - 101
  • [34] Evaluation of a Multimodal Video Annotator for Contemporary Dance
    Cabral, Diogo
    Valente, Joao G.
    Aragao, Urandia
    Fernandes, Carla
    Correia, Nuno
    PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, : 572 - 579
  • [35] MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
    Chu, Ernie
    Huang, Tzuhsuan
    Lin, Shuo-Yen
    Chen, Jun-Cheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1353 - 1361
  • [36] Video-to-video e-Health applications supporting medical use cases for remote patients
    1600, Springer Science and Business Media, LLC (437):
  • [37] Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
    Zhuo, Long
    Wang, Guangcong
    Li, Shikai
    Wu, Wayne
    Liu, Ziwei
    COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 : 289 - 305
  • [38] 2D Fingertip Localization on Depth Videos Using Paired Video-to-Video Translation
    Farahanipad, Farnaz
    Nasr, Mohammad Sadegh
    Rezaei, Mohammad
    Kamangar, Farhad
    Athitsos, Vassilis
    Huber, Manfred
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 381 - 392
  • [39] FastPicker: Adaptive independent two-stage video-to-video summarization for efficient action recognition
    Alfasly, Saghir
    Lu, Jian
    Xu, Chen
    Al-Huda, Zaid
    Jiang, Qingtang
    Lu, Zhaosong
    Chui, Charles K.
    NEUROCOMPUTING, 2023, 516 : 231 - 244
  • [40] Defining Key Performance Indicators for Evaluating the Use of High Definition Video-to-Video Services in eHealth
    Molnar, Andreea
    Weerakkody, Vishanth
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2013, 2013, 412 : 452 - 461