The 2nd YouTube-8M Large-Scale Video Understanding Challenge

被引：9

作者：

Lee, Joonseok ^{[1
]}

Natsev, Apostol ^{[1
]}

Reade, Walter ^{[1
]}

Sukthankar, Rahul ^{[1
]}

Toderici, George ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

来源：

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV | 2019年 / 11132卷

关键词：

YouTube; Video Classification; Video Understanding;

D O I：

10.1007/978-3-030-11018-5_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We hosted the 2nd YouTube-8M Large-Scale Video Understanding Kaggle Challenge and Workshop at ECCV'18, with the task of classifying videos from frame-level and video-level audio-visual features. In this year's challenge, we restricted the final model size to 1GB or less, encouraging participants to explore representation learning or better architecture, instead of heavy ensembles of multiple models. In this paper, we briefly introduce the YouTube-8M dataset and challenge task, followed by participants statistics and result analysis. We summarize proposed ideas by participants, including architectures, temporal aggregation methods, ensembling and distillation, data augmentation, and more.

引用

页码：193 / 205

页数：13

共 50 条

[21] LARGE-SCALE NEAR-DUPLICATE WEB VIDEO SEARCH: CHALLENGE AND OPPORTUNITY
Zhao, Wan-Lei
Tan, Song
Ngo, Chong-Wah
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1624 - 1627
[22] Student cultural diversity: Understanding and meeting the challenge, 2nd edition
Carrington, V
JOURNAL OF ADOLESCENT & ADULT LITERACY, 2000, 43 (04) : 386 - 387
[23] EVALUATION OF THE CHARACTERISTIC FEATURES OF A LARGE-SCALE TURBULENCE FIELD (2ND REPORT, ON THE STATISTICAL QUANTITIES OF THE TURBULENCE).
Makita, Hideharu
Sassa, Koji
Iwasaki, Takao
Iida, Akiyoshi
Nippon Kikai Gakkai Ronbunshu, B Hen/Transactions of the Japan Society of Mechanical Engineers, Part B, 1987, 53 (495): : 3180 - 3186
[24] Large-scale 2nd to 3rd century AD bloomery iron smelting in Korea
Park, Jang-Sik
Rehren, Thilo
JOURNAL OF ARCHAEOLOGICAL SCIENCE, 2011, 38 (06) : 1180 - 1190
[25] A study on smooth surface reconstruction from large-scale noisy point-clouds (2nd report) - Streaming processing for smoothing large-scale point-clouds
Masuda H.
Murakami K.
Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2010, 76 (06): : 689 - 693
[26] Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks
Shim, Minho
Kim, Young Hwi
Kim, Kyungmin
Kim, Seon Joo
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 420 - 437
[27] Proceedings of the 2nd KDD Workshop on Large-Scale Recommender Systems and the Netflix Prize Competition, NETFLIX '08: Foreword
Bennett, Jim
Elkan, Charles
Koren, Yehuda
Lemire, Daniel
Tuzhilin, Alex
Proceedings of the 2nd KDD Workshop on Large-Scale Recommender Systems and the Netflix Prize Competition, NETFLIX '08, 2008,
[28] THE CHALLENGE OF AGING - A MULTIDISCIPLINARY APPROACH TO EXTENDED CARE, 2ND EDITION - SHAW,M
LEMAY, A
INTERNATIONAL JOURNAL OF NURSING STUDIES, 1993, 30 (05) : 465 - 465
[29] 2MASS constraints on the local large-scale structure:: a challenge to ΛCDM?
Frith, WJ
Shanks, T
Outram, PJ
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2005, 361 (02) : 701 - 709
[30] NONLINEAR - A GUIDE TO ELECTRONIC FILM AND VIDEO EDITING, 2ND EDITION - RUBIN,M
BOWERS, RA
CD-ROM PROFESSIONAL, 1995, 8 (08): : 114 - 115

← 1 2 3 4 5 →