Large-Scale Training Framework for Video Annotation

被引:0
|
作者
Hwang, Seong Jae [1 ,2 ]
Lee, Joonseok [2 ]
Varadarajan, Balakrishnan [2 ]
Gordon, Ariel [2 ]
Xu, Zheng [2 ]
Natsev, Apostol [2 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
[2] Google Res, Mountain View, CA USA
关键词
Scalability; Distributed framework; Video annotation; MapReduce;
D O I
10.1145/3292500.3330653
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video is one of the richest sources of information available online but extracting deep insights from video content at internet scale is still an open problem, both in terms of depth and breadth of understanding, as well as scale. Over the last few years, the field of video understanding has made great strides due to the availability of large-scale video datasets and core advances in image, audio, and video modeling architectures. However, the state-of-the-art architectures on small scale datasets are frequently impractical to deploy at internet scale, both in terms of the ability to train such deep networks on hundreds of millions of videos, and to deploy them for inference on billions of videos. In this paper, we present a MapReduce-based training framework, which exploits both data parallelism and model parallelism to scale training of complex video models. The proposed framework uses alternating optimization and full-batch fine-tuning, and supports large Mixture-of-Experts classifiers with hundreds of thousands of mixtures, which enables a trade-off between model depth and breadth, and the ability to shift model capacity between shared (generalization) layers and per-class (specialization) layers. We demonstrate that the proposed framework is able to reach state-of-the-art performance on the largest public video datasets, YouTube-8M and Sports-1M, and can scale to 100 times larger datasets.
引用
收藏
页码:2394 / 2402
页数:9
相关论文
共 50 条
  • [41] A general framework for large-scale model selection
    Haunschild, M. D.
    Wahl, S. A.
    Freisleben, B.
    Wiechert, W.
    OPTIMIZATION METHODS & SOFTWARE, 2006, 21 (06): : 901 - 917
  • [42] An Analysis Framework for Large-Scale Time Series
    Teng F.
    Huang Q.-C.
    Li T.-R.
    Wang C.
    Tian C.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (07): : 1279 - 1292
  • [43] A scalable framework for large-scale distributed collaboration
    Yang, Shengwen
    Jiang, Jinlei
    Shi, Meilin
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 171 - 176
  • [44] A framework for parallel large-scale global optimization
    Evtushenko, Yuri
    Posypkin, Mikhail
    Sigal, Israel
    COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2009, 23 (3-4): : 211 - 215
  • [45] EPMS: A framework for large-scale patient matching
    Singhal, Himanshu
    Ravi, Harish
    Chakravarthy, Sathiya Narayan
    Balasundaram, Prabavathy
    Babu, Chitra
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1096 - 1101
  • [46] TOWARD A FRAMEWORK FOR LARGE-SCALE PROBLEM MANAGEMENT
    CHEVALIER, M
    BAILEY, L
    BURNS, T
    HUMAN RELATIONS, 1974, 27 (01) : 43 - 69
  • [47] Brazilian experience in large-scale hemotherapy training
    Deffune, E.
    Ferreira, R. R.
    Moroz, A.
    Nogueira, C.
    TRANSFUSION, 2008, 48 (02) : 318A - 318A
  • [48] A Framework for Large-Scale Automatic Fluency Assessment
    Silva, Warley Almeida
    Carchedi, Luiz Carlos
    Gomes Junior, Jorao
    de Souza, Joao Victor
    Barrere, Eduardo
    de Souza, Jairo Francisco
    INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2021, 19 (03) : 70 - 88
  • [49] Multilevel framework for large-scale global optimization
    Sedigheh Mahdavi
    Shahryar Rahnamayan
    Mohammad Ebrahim Shiri
    Soft Computing, 2017, 21 : 4111 - 4140
  • [50] Decentralized Embedding Framework for Large-Scale Networks
    Imran, Mubashir
    Yin, Hongzhi
    Chen, Tong
    Shao, Yingxia
    Zhang, Xiangliang
    Zhou, Xiaofang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 425 - 441