XDL: An Industrial Deep Learning Framework for High-dimensional Sparse Data

被引：22

作者：

Jiang, Biye ^{[1
]}

Deng, Chao ^{[1
]}

Yi, Huimin ^{[1
]}

Hu, Zelin ^{[1
]}

Zhou, Guorui ^{[1
]}

Zheng, Yang ^{[1
]}

Huang, Sui ^{[1
]}

Guo, Xinyang ^{[1
]}

Wang, Dongyue ^{[1
]}

Song, Yue ^{[1
]}

Zhao, Liqin ^{[1
]}

Wang, Zhi ^{[1
]}

Sun, Peng ^{[1
]}

Zhang, Yu ^{[1
]}

Zhang, Di ^{[1
]}

Li, Jinhui ^{[1
]}

Xu, Jian ^{[1
]}

Zhu, Xiaoqiang ^{[1
]}

Gai, Kun ^{[1
]}

机构：

[1] Alibaba Inc, Beijing, Peoples R China

来源：

1ST INTERNATIONAL WORKSHOP ON DEEP LEARNING PRACTICE FOR HIGH-DIMENSIONAL SPARSE DATA WITH KDD (DLP-KDD 2019) | 2019年

关键词：

Deep learning; High-dimension sparse data; XDL;

D O I：

10.1145/3326937.3341255

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the rapid growth of data and computing power, deep learning based approaches have become the main solution for many artificial intelligence problems such as image classification, speech recognition and computer vision. Several excellent deep learning (DL) frameworks including Tensorflow, MxNet and PyTorch have been made open-sourced, further accelerating the advance of the community. However, existing DL frameworks are not designed for applications involving high-dimensional sparse data, which exists widely in many successful online businesses such as search engine, recommender systems and online advertising. In these industrial scenarios, deep models are typically trained on large scale datasets with up to billions of sparse features and hundreds of billions of samples, bringing great challenges to DL framework. In this paper, we introduce a high-performance, large-scale and distributed DL framework named XDL which provides an elegant solution to fill the gap between general design of existing DL frameworks and industrial requirements arising from high-dimensional sparse data. Since 2016, XDL has been successfully deployed in Alibaba, serving many productions such as online advertising and recommender system. Running on hundreds of GPU cards in parallel, XDL can train deep models with tens of billions parameters within only several hours. Besides its excellent performance and flexibility, XDL is also friendly to developers. Algorithm scientists in Alibaba can develop and deploy new deep models with only several lines of simple codes. The XDL API and a reference implementation were released as an open-source package under the Apache 2.0 license in December, 2018 and are available at https://github.com/alibaba/xdeeplearning.

引用

页数：9

共 50 条

[1] PCA learning for sparse high-dimensional data
Hoyle, DC
Rattray, M
[J]. EUROPHYSICS LETTERS, 2003, 62 (01): : 117 - 123
[2] Similarity Learning for High-Dimensional Sparse Data
Liu, Kuan
Bellet, Aurelien
Sha, Fei
[J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 653 - 662
[3] Group Learning for High-Dimensional Sparse Data
Cherkassky, Vladimir
Chen, Hsiang-Han
Shiao, Han-Tai
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[4] Efficient Sparse Representation for Learning With High-Dimensional Data
Chen, Jie
Yang, Shengxiang
Wang, Zhu
Mao, Hua
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4208 - 4222
[5] International Workshop on Deep Learning Practice for High-Dimensional Sparse Data with RecSys 2023
Tang, Ruiming
Zhu, Xiaoqiang
Ge, Junfeng
Lee, Kuang-chih
Jiang, Biye
Wang, Xingxing
Zhu, Han
Tao, Zhuang
Liu, Weiwen
Kan, Ren
Zhang, Weinan
Zhao, Xiangyu
[J]. PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1276 - 1280
[6] Sparse Learning of the Disease Severity Score for High-Dimensional Data
Stojkovic, Ivan
Obradovic, Zoran
[J]. COMPLEXITY, 2017,
[7] On the challenges of learning with inference networks on sparse, high-dimensional data
Krishnan, Rahul G.
Liang, Dawen
Hoffman, Matthew D.
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[8] Distributed Learning of Deep Sparse Neural Networks for High-dimensional Classification
Garg, Shweta
Krishnan, R.
Jagannathan, S.
Samaranayake, V. A.
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1587 - 1592
[9] On the anonymization of sparse high-dimensional data
Ghinita, Gabriel
Tao, Yufei
Kalnis, Panos
[J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 715 - +
[10] A Deep Reinforcement Learning Framework for High-Dimensional Circuit Linearization
Rong, Chao
Paramesh, Jeyanandh
Carley, L. Richard
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (09) : 3665 - 3669

← 1 2 3 4 5 →