Determining the optimal file size on tertiary storage systems based on the distribution of query sizes

被引:0
|
作者
Bernardo, LM [1 ]
Nordberg, H [1 ]
Rotem, D [1 ]
Shoshani, A [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, Sci Data Management Res Grp, NERSC Div, Berkeley, CA 94720 USA
关键词
D O I
10.1109/SSDM.1998.688108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into files. Since in many such systems the minimum unit of data transfer is a file, it is an important problem to march file sizes with the access patterns to the data. In general, if the file size is large relative to the query size it will lend to the transfer of large amount of irrelevant data whereas small file sizes will incur an overhead penalty associated with reading each new file. In this work, we analyze the relationship between file sizes and query response times and provide a methodology to compute the optimal file size given information about the distribution of query sizes. Exact closed form solutions for the cost function are given for two common distributions.
引用
收藏
页码:22 / 31
页数:10
相关论文
共 50 条
  • [1] Benchmarking tertiary storage systems with file fragmentation
    Nikolow, D
    Slota, R
    Kitowski, J
    PARALLEL PROCESSING APPLIED MATHEMATICS, 2002, 2328 : 162 - 169
  • [2] File size distribution on UNIX systems - Then and now
    Dept. of Computer Science, Vrije Universiteit, Amsterdam, Netherlands
    Oper Syst Rev ACM, 2006, 1 (100-104):
  • [3] Advantages of Optimal Storage Location and Size on the Economic Dispatch in Distribution Systems
    Bizuayehu, Abebe W.
    Fitiwi, Desta Z.
    Catalao, Joao P. S.
    2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,
  • [4] Optimal File-Distribution in Heterogeneous and Asymmetric Storage Networks
    Langner, Tobias
    Schindelhauer, Christian
    Souza, Alexander
    SOFSEM 2011: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2011, 6543 : 368 - 381
  • [5] Determining Optimal Battery Storage along a Distribution Feeder
    Thakar, Sushrut
    Vittal, Vijay
    Ayyanar, Raja
    Palomino, Ernest
    Brown, Kenneth
    2020 52ND NORTH AMERICAN POWER SYMPOSIUM (NAPS), 2021,
  • [6] A novel method based on fuzzy logic to evaluate the storage and backup systems in determining the optimal size of a hybrid renewable energy system
    Mahmoudi, Sayyed Mostafa
    Maleki, Akbar
    Ochbelagh, Dariush Rezaei
    JOURNAL OF ENERGY STORAGE, 2022, 49
  • [7] Sensitivity-Based Pricing and Optimal Storage Utilization in Distribution Systems
    Sathyanarayana, Bharadwaj R.
    Heydt, Gerald Thomas
    IEEE TRANSACTIONS ON POWER DELIVERY, 2013, 28 (02) : 1073 - 1082
  • [8] A generalized storage model for tertiary storage based systems
    Tikekar, RV
    Fotouhi, F
    Ragan, D
    IDEAS '97 - INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 1997, : 161 - 170
  • [9] SELECTING OPTIMAL PIPE SIZES FOR WATER DISTRIBUTION-SYSTEMS
    WALSKI, TM
    GESSLER, J
    SJOSTROM, JW
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 1988, 80 (02): : 35 - 39
  • [10] A simulated annealing based hyperheuristic for determining shipper sizes for storage and transportation
    Dowsland, Kathryn A.
    Soubeiga, Eric
    Burke, Edmund
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 179 (03) : 759 - 774