Determining the optimal file size on tertiary storage systems based on the distribution of query sizes

被引:0
|
作者
Bernardo, LM [1 ]
Nordberg, H [1 ]
Rotem, D [1 ]
Shoshani, A [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, Sci Data Management Res Grp, NERSC Div, Berkeley, CA 94720 USA
关键词
D O I
10.1109/SSDM.1998.688108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In tertiary storage systems, the data is stored on multiple tape volumes where each tape is further divided into files. Since in many such systems the minimum unit of data transfer is a file, it is an important problem to march file sizes with the access patterns to the data. In general, if the file size is large relative to the query size it will lend to the transfer of large amount of irrelevant data whereas small file sizes will incur an overhead penalty associated with reading each new file. In this work, we analyze the relationship between file sizes and query response times and provide a methodology to compute the optimal file size given information about the distribution of query sizes. Exact closed form solutions for the cost function are given for two common distributions.
引用
收藏
页码:22 / 31
页数:10
相关论文
共 50 条
  • [21] General methods for determining the droplet size distribution in emulsion systems
    Ambrosone, L
    Ceglie, A
    Colafemmina, G
    Palazzo, G
    JOURNAL OF CHEMICAL PHYSICS, 1999, 110 (02): : 797 - 804
  • [22] Fountain Code Based Cloud Storage Mechanism For Optimal File Retrieval Delay
    Janet, J.
    Balakrishnan, S.
    Somasekhara, Kesani
    2016 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2016,
  • [23] Guest Editorial: Optimal Utilisation of Storage Systems in Transmission and Distribution Systems
    Chung, C. Y.
    Wen, Fushuan
    Ledwich, Gerard
    Venkatesh, Bala
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2016, 10 (03) : 563 - 565
  • [24] Age Distribution Convergence Mechanisms for Flash Based File Systems
    McEwan, Alistair A.
    Mir, Irfan F.
    JOURNAL OF COMPUTERS, 2012, 7 (04) : 988 - 997
  • [25] Integrating Parallel File Systems with Object-Based Storage Devices
    Devulapalli, Ananth
    Dalessandro, Dennis
    Wyckoff, Pete
    Ali, Nawab
    Sadayappan, P.
    2007 ACM/IEEE SC07 CONFERENCE, 2010, : 583 - 592
  • [26] Optimal Sizing and Placement of Battery Energy Storage in Distribution System Based on Solar Size for Voltage Regulation
    Nazaripouya, H.
    Wang, Y.
    Chu, P.
    Pota, H. R.
    Gadh, R.
    2015 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, 2015,
  • [27] Methodology for the Optimal Siting and Sizing of Storage Systems in Distribution Networks
    Vigano, Giacomo
    Rossi, Marco
    Moneta, Diana
    Carlini, Claudio
    2015 AEIT INTERNATIONAL ANNUAL CONFERENCE (AEIT), 2015,
  • [28] DETERMINING OPTIMAL REORDER INTERVALS IN CAPACITATED PRODUCTION-DISTRIBUTION SYSTEMS
    JACKSON, PL
    MAXWELL, WL
    MUCKSTADT, JA
    MANAGEMENT SCIENCE, 1988, 34 (08) : 938 - 958
  • [29] Optimal location-allocation of storage devices and renewable-based DG in distribution systems
    Home-Ortiz, Juan M.
    Pourakbari-Kasmaei, Mahdi
    Lehtonen, Matti
    Sanches Mantovani, Jose Roberto
    ELECTRIC POWER SYSTEMS RESEARCH, 2019, 172 : 11 - 21
  • [30] Multi-Agent Optimal Allocation of Energy Storage Systems in Distribution Systems
    Zheng, Yu
    Hill, David J.
    Dong, Zhao Yang
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2017, 8 (04) : 1715 - 1725