A measurement-based study of big-data movement

被引:0
|
作者
Addanki, Ranjana [1 ]
Maji, Sourav [1 ]
Veeraraghavan, Malathi [1 ]
Tracy, Chris [2 ]
机构
[1] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
[2] LBNL, Energy Sci Network ESnet, Berkeley, CA USA
关键词
Measurements; elephant flows; data movement;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel TCP connections are used for large scientific dataset transfers to increase throughput. Therefore, to accurately characterize big-data movement, it is important to reconstruct parallel flowsets from traffic measurements. In this work, we start with NetFlow records collected in an operational research-and-education network across which large scientific datasets are moved routinely, reconstruct individual elephant flows from the NetFlow records, and assemble parallel flowsets from elephant flows. Our findings are as follows. The top 1% of flowset sizes were in the hundreds of GBs to low TBs range, 95% of flowsets had rates less than 2.5 Gbps, and 99% of flowsets had durations shorter than 4 hours. Median flowset rate increases and rate variance decreases with increasing number of per-flowset component flows. Such findings are useful for network planning, traffic engineering, and for improving user performance, since large dataset transfers are among the most demanding of network applications.
引用
收藏
页码:445 / 449
页数:5
相关论文
共 50 条
  • [1] Analysis of Big-Data Based Data Mining Engine
    Huang, Xinxin
    Gong, Shu
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 164 - 168
  • [2] Application Study of Big-data Mining Based on Campus Card Platform
    Li, Shanna
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SUPPORTED EDUCATION (FCSE 2016), 2016, : 58 - 60
  • [3] A Study on Construction CALS Big-Data Service
    Kim, Jinuk
    Kim, Namgon
    [J]. 2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 309 - 314
  • [4] Big-Data Visualization
    Keim, Daniel
    Qu, Huamin
    Ma, Kwan-Liu
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2013, 33 (04) : 20 - 21
  • [5] Big-data approaches lead to an increased understanding of the ecology of animal movement
    Nathan, Ran
    Monk, Christopher T.
    Arlinghaus, Robert
    Adam, Timo
    Alos, Josep
    Assaf, Michael
    Baktoft, Henrik
    Beardsworth, Christine E.
    Bertram, Michael G.
    Bijleveld, Allert, I
    Brodin, Tomas
    Brooks, Jill L.
    Campos-Candela, Andrea
    Cooke, Steven J.
    Gjelland, Karl O.
    Gupte, Pratik R.
    Harel, Roi
    Hellstrom, Gustav
    Jeltsch, Florian
    Killen, Shaun S.
    Klefoth, Thomas
    Langrock, Roland
    Lennox, Robert J.
    Lourie, Emmanuel
    Madden, Joah R.
    Orchan, Yotam
    Pauwels, Ine S.
    Riha, Milan
    Roeleke, Manuel
    Schlagel, Ulrike E.
    Shohami, David
    Signer, Johannes
    Toledo, Sivan
    Vilk, Ohad
    Westrelin, Samuel
    Whiteside, Mark A.
    Jaric, Ivan
    [J]. SCIENCE, 2022, 375 (6582) : 734 - +
  • [6] A study of operational cycle of terminal distributed power supply based on Big-data
    Nie, Erbao
    Liu, Zhoubin
    He, Jinhong
    Li, Chao
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION (ESMA2017), VOLS 1-4, 2018, 108
  • [7] Big-data based infrastructure management: toward Assetmetrics
    Kobayashi, K.
    Kaito, K.
    [J]. LIFE-CYCLE OF STRUCTURAL SYSTEMS: DESIGN, ASSESSMENT, MAINTENANCE AND MANAGEMENT, 2015, : 70 - 80
  • [8] Analysis of Computer Science Based on Big-Data Mining
    Xuan, Liu
    Chang, Liu
    [J]. 2020 INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2020), 2020, : 94 - 97
  • [9] Big-data platform based on open source ecosystem
    [J]. 1600, Science Press (54):
  • [10] Big-Data analysis used NGS and study of evolution
    Ikeo, Kazuho
    [J]. GENES & GENETIC SYSTEMS, 2015, 90 (06) : 360 - 360