Traffic Control for Data Center Network: State of the Art and Future Research

被引:0
|
作者
Du X.-L. [1 ,2 ]
Xu K. [1 ,2 ,3 ]
Li T. [4 ]
Zheng K. [4 ]
Fu S.-T. [1 ,2 ]
Shen M. [5 ]
机构
[1] Department of Computer Science and Technology, Tsinghua University, Beijing
[2] Beijing National Research Center for Information Science and Technology, Beijing
[3] Peng Cheng Laboratory, Shenzhen
[4] 2012 Labs, Huawei Technology Co. Ltd., Beijing
[5] School of Computer, Beijing Institute of Technology, Beijing
来源
基金
中国国家自然科学基金;
关键词
Congestion control; Data center; Flow schedule; Load balance; Traffic control; Traffic engineering;
D O I
10.11897/SP.J.1016.2021.01287
中图分类号
学科分类号
摘要
As a strong foundation for the rapid storage and efficient processing of massive data, the data center has become a hot spot in academia and industry in recent years. Traditional TCP is difficult to meet the demand for data center transmission in high throughput, low latency and loss-free aspects. Based on the comparison between the traditional TCP design target and the transmission target in the data center network, this paper summarizes the research status of data center traffic control. Traffic control refers to the control of traffic rates and sending rules. Therefore, this paper introduces the traffic control technology based on congestion control and traffic engineering and makes a comparative analysis of the above technology from the aspects of control mechanism, expansibility and technical feasibility. Finally, this paper summarizes and looks forward to the future research trend of data center traffic control technology. According to the existing researches, we find that: (1) Considering the cost and performance, the most suitable traffic control algorithm for TCP/IP data center is DCTCP, and the most suitable traffic control algorithm for RDMA data center is DCQCN. Other researches require expensive custom hardware, which is difficult to deploy. (2) The traffic control technology is a technology of fair utilization of limited resources. Therefore, the performance of the technology can be improved by acquiring more relevant information or exchanging with other resources. E.g. ECN, RTT, traffic size, flow deadline. (3) Among the three main research points of congestion control, flow scheduling and load balancing, the mainstream algorithm system only focuses on one or two of them. (4) Smart NICs and programmable switches are widely used in the research of the data center network. The programmability of smart devices can bring new features to new technologies. In the end, the research directions are prospected. (1) A unified flow control test platform is needed. Different algorithms use different test environments, so it is difficult to evaluate them together. (2) Congestion control, flow scheduling, and load balancing studies need to be considered together. (3) Traditional distributed traffic control cannot be accurately scheduled due to insufficient information. As data centers grow in size, centralized controllers become network bottlenecks. The tradeoff between centralized and distributed control requires careful consideration. (4) High-performance programmable smart devices need to be developed and deployed. RDMA has become a hot topic in industry and academia. At the same time, programmable network devices greatly enhance the flexibility and rapid deployment of the network. (5) Traffic control design for specific application scenarios. The performance of the algorithm is improved by acquiring more relevant information or by exchanging other related resources. More resources are available in specific application scenarios. (6) With its strong self-adaptability and self-learning ability, artificial intelligence provides a set of effective decision-making tools for various research fields. The combination of artificial intelligence technology and network transmission technology is also a hot topic in the future. In summary, with the in-depth study of the data center, traffic control will become the most important basic performance tool for the data center, especially for the future high throughput, low latency requirements. © 2021, Science Press. All right reserved.
引用
收藏
页码:1287 / 1309
页数:22
相关论文
共 74 条
  • [1] Greenberg A, Hamilton J R, Jain N, Et al., VL2: A scalable and flexible data center network, Proceedings of the Special Interest Group on Data Communication(SIGCOMM), pp. 95-104, (2009)
  • [2] Guo C., DCell: A scalable and fault-tolerant network structure for data centers, Proceedings of the Special Interest Group on Data Communication(SIGCOMM), pp. 75-86, (2008)
  • [3] Zhang Yuchao, Xu Ke, Wang Haiyang, Et al., Going fast and fair: Latency optimization for cloud-based service chains, IEEE Network, 32, 2, pp. 138-143, (2017)
  • [4] Al-Fares M, Loukissas A, Vahdat A., A scalable, commodity data center network architecture, Proceedings of the Special Interest Group on Data Communication(SIGCOMM), pp. 63-74, (2008)
  • [5] Alizadeh M, Greenberg A, Maltz D A, Et al., Data center TCP(DCTCP), Proceedings of the Special Interest Group on Data Communication(SIGCOMM), pp. 63-74, (2010)
  • [6] Shen M, Liu H, Xu K, Et al., Routing on demand: Toward the energy-aware traffic engineering with OSPF, Proceedings of the International Conference on Research in Networking, pp. 232-246, (2012)
  • [7] Xu Ke, Shen Meng, Liu Hongying, Et al., Achieving optimal traffic engineering using a generalized routing framework, IEEE Transactions on Parallel and Distributed Systems, 27, 1, pp. 51-65, (2016)
  • [8] Shen Meng, Wei Mingwei, Zhu Liehuang, Et al., Classification of encrypted traffic with second-order Markov chains and application attribute Bigrams, IEEE Transactions on Information Forensics and Security, 12, 8, pp. 1830-1843, (2017)
  • [9] Zhang Y, Ansari N., On architecture design, congestion notification, TCP Incast and power consumption in data centers, IEEE Communication Surveys & Tutorials, 15, 1, pp. 39-64, (2013)
  • [10] Li Dan, Chen Gui-Hai, Ren Feng-Yuan, Et al., Data center network research progress and trends, Chinese Journal of Computers, 37, 2, pp. 259-274, (2014)