XDL: An Industrial Deep Learning Framework for High-dimensional Sparse Data

被引:22
|
作者
Jiang, Biye [1 ]
Deng, Chao [1 ]
Yi, Huimin [1 ]
Hu, Zelin [1 ]
Zhou, Guorui [1 ]
Zheng, Yang [1 ]
Huang, Sui [1 ]
Guo, Xinyang [1 ]
Wang, Dongyue [1 ]
Song, Yue [1 ]
Zhao, Liqin [1 ]
Wang, Zhi [1 ]
Sun, Peng [1 ]
Zhang, Yu [1 ]
Zhang, Di [1 ]
Li, Jinhui [1 ]
Xu, Jian [1 ]
Zhu, Xiaoqiang [1 ]
Gai, Kun [1 ]
机构
[1] Alibaba Inc, Beijing, Peoples R China
关键词
Deep learning; High-dimension sparse data; XDL;
D O I
10.1145/3326937.3341255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of data and computing power, deep learning based approaches have become the main solution for many artificial intelligence problems such as image classification, speech recognition and computer vision. Several excellent deep learning (DL) frameworks including Tensorflow, MxNet and PyTorch have been made open-sourced, further accelerating the advance of the community. However, existing DL frameworks are not designed for applications involving high-dimensional sparse data, which exists widely in many successful online businesses such as search engine, recommender systems and online advertising. In these industrial scenarios, deep models are typically trained on large scale datasets with up to billions of sparse features and hundreds of billions of samples, bringing great challenges to DL framework. In this paper, we introduce a high-performance, large-scale and distributed DL framework named XDL which provides an elegant solution to fill the gap between general design of existing DL frameworks and industrial requirements arising from high-dimensional sparse data. Since 2016, XDL has been successfully deployed in Alibaba, serving many productions such as online advertising and recommender system. Running on hundreds of GPU cards in parallel, XDL can train deep models with tens of billions parameters within only several hours. Besides its excellent performance and flexibility, XDL is also friendly to developers. Algorithm scientists in Alibaba can develop and deploy new deep models with only several lines of simple codes. The XDL API and a reference implementation were released as an open-source package under the Apache 2.0 license in December, 2018 and are available at https://github.com/alibaba/xdeeplearning.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] PCA learning for sparse high-dimensional data
    Hoyle, DC
    Rattray, M
    [J]. EUROPHYSICS LETTERS, 2003, 62 (01): : 117 - 123
  • [2] Similarity Learning for High-Dimensional Sparse Data
    Liu, Kuan
    Bellet, Aurelien
    Sha, Fei
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 653 - 662
  • [3] Group Learning for High-Dimensional Sparse Data
    Cherkassky, Vladimir
    Chen, Hsiang-Han
    Shiao, Han-Tai
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] Efficient Sparse Representation for Learning With High-Dimensional Data
    Chen, Jie
    Yang, Shengxiang
    Wang, Zhu
    Mao, Hua
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4208 - 4222
  • [5] International Workshop on Deep Learning Practice for High-Dimensional Sparse Data with RecSys 2023
    Tang, Ruiming
    Zhu, Xiaoqiang
    Ge, Junfeng
    Lee, Kuang-chih
    Jiang, Biye
    Wang, Xingxing
    Zhu, Han
    Tao, Zhuang
    Liu, Weiwen
    Kan, Ren
    Zhang, Weinan
    Zhao, Xiangyu
    [J]. PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1276 - 1280
  • [6] Sparse Learning of the Disease Severity Score for High-Dimensional Data
    Stojkovic, Ivan
    Obradovic, Zoran
    [J]. COMPLEXITY, 2017,
  • [7] On the challenges of learning with inference networks on sparse, high-dimensional data
    Krishnan, Rahul G.
    Liang, Dawen
    Hoffman, Matthew D.
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [8] Distributed Learning of Deep Sparse Neural Networks for High-dimensional Classification
    Garg, Shweta
    Krishnan, R.
    Jagannathan, S.
    Samaranayake, V. A.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1587 - 1592
  • [9] On the anonymization of sparse high-dimensional data
    Ghinita, Gabriel
    Tao, Yufei
    Kalnis, Panos
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 715 - +
  • [10] A Deep Reinforcement Learning Framework for High-Dimensional Circuit Linearization
    Rong, Chao
    Paramesh, Jeyanandh
    Carley, L. Richard
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (09) : 3665 - 3669