Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

被引:0
|
作者
Eskevich, Maria [1 ]
Larson, Martha [1 ,2 ]
Aly, Robin [3 ]
Sabetghadam, Serwah [4 ]
Jones, Gareth J. F. [5 ]
Ordelman, Roeland [3 ]
Huet, Benoit [6 ]
机构
[1] Radboud Univ Nijmegen, CLS, Nijmegen, Netherlands
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Twente, Enschede, Netherlands
[4] TU Vienna, Vienna, Austria
[5] Dublin City Univ, Sch Comp, ADAPT Ctr, Dublin, Ireland
[6] EURECOM, Sophia Antipolis, France
来源
基金
爱尔兰科学基金会;
关键词
Crowdsourcing; Video-to-video linking; Link evaluation; Verbal-visual information;
D O I
10.1007/978-3-319-51814-524
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video-to-video linking systems allow users to explore and exploit the content of a large-scale multimedia collection interactively and without the need to formulate specific queries. We present a short introduction to video-to-video linking (also called 'video hyperlinking'), and describe the latest edition of the Video Hyperlinking (LNK) task at TRECVid 2016. The emphasis of the LNK task in 2016 is on multi-modality as used by videomakers to communicate their intended message. Crowdsourcing makes three critical contributions to the LNK task. First, it allows us to verify the multimodal nature of the anchors (queries) used in the task. Second, it enables us to evaluate the performance of video-to-video linking systems at large scale. Third, it gives us insights into how people understand the relevance relationship between two linked video segments. These insights are valuable since the relationship between video segments can manifest itself at different levels of abstraction.
引用
收藏
页码:280 / 292
页数:13
相关论文
共 50 条
  • [41] Crowd detection in video sequences
    Reisman, P
    Mano, O
    Avidan, S
    Shashua, A
    2004 IEEE INTELLIGENT VEHICLES SYMPOSIUM, 2004, : 66 - 71
  • [42] An In-Depth Evaluation of Multimodal Video Genre Categorization
    Mironica, Ionut
    Ionescu, Bogdan
    Knees, Peter
    Lambert, Patrick
    2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013), 2013, : 11 - 16
  • [43] Multimodal perception in subjective quality evaluation of compressed video
    Ostaszewska-Lizewska, Anna
    Kloda, Rafal
    Zebrowska-Lucyki, Sabina
    ADVANCED MECHATRONICS SOLUTIONS, 2016, 393 : 569 - 574
  • [44] Multimodal Video Analysis for Crowd Anomaly Detection Using Open Access Tourism Cameras
    Dionis-Ros, Alejandro
    Vila-Frances, Joan
    Magdalena-Benedito, Rafael
    Mateo, Fernando
    Serrano-Lopez, Antonio J.
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [45] Evaluation of a Video Generation Method Linking Dance and Scenes
    Yoshikawa, Yui
    Shishido, Hidehiko
    Kameda, Yoshinari
    Kitahara, Itaru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [46] A pioneering video-to-video super-resolution reconstruction algorithm based on segmentation and space-time regularisation
    Guo, L.
    He, X. H.
    Chen, W. L.
    Qing, L. B.
    Luo, D. S.
    IMAGING SCIENCE JOURNAL, 2014, 62 (04): : 236 - 250
  • [47] Crowd size estimation for video surveillance
    Lee, Gwang-Gook
    Song, Su Han
    Kim, Whoi-Yul
    INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL II, 2007, : 196 - 199
  • [48] Crowd Behavior Recognition for Video Surveillance
    Saxena, Shobhit
    Bremond, Francois
    Thonnat, Monnique
    Ma, Ruihua
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2008, 5259 : 970 - +
  • [49] Shortcut-V2V: Compression Framework for Video-to-Video Translation based on Temporal Redundancy Reduction
    Chung, Chaeyeon
    Park, Yeojeong
    Choi, Seunghwan
    Ganbat, Munkhsoyol
    Choo, Jaegul
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7578 - 7588
  • [50] Bilinear dynamics for crowd video analysis
    Wu, Shuang
    Su, Hang
    Yang, Hua
    Zheng, Shibao
    Fan, Yawen
    Zhou, Qin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 461 - 470