Mining-based File Caching in a Hybrid Storage System

被引:0
|
作者
Lee, Seongjin [1 ]
Won, Youjip [1 ]
Hong, Sungwoo [1 ]
机构
[1] Hanyang Univ, Dept Elect & Comp Engn, Seoul 133791, South Korea
关键词
HDD; SSD; hybrid storage; pattern mining; application launch time; PERFORMANCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we propose a new mining-based file caching scheme for a hybrid storage disk system. In particular, we focus our efforts on reducing the latency of launching applications. The proposed scheme identifies correlated file accesses in a file access sequence via sequential pattern mining algorithm. Our scheme caches correlated files together to maximize the caching efficiency. The correlated files are extracted from the access patterns through the proposed mining scheme, which consists of three steps: frequent pattern based file extraction, cluster moving gap based file sort, and frequency and size based file prioritization. The extracted correlated files are relocated to an SSD during idle time. DiskSim and NANDSim are used to evaluate the proposed scheme, called Informed Mining. The proposed scheme is compared with a disk only scheme and five other mining based file relocation schemes: Mining based file relocation scheme (Miner), minimum distance based file relocation scheme (Min_Dist), frequency-based relocation scheme (Fre), size-based relocation scheme (Size), and one that relocates files with highest value of (file size * file access number) first to the SSD (Fr*Sz). From the simulation based experiment, launch time is reduced by about 50% using only 10% of sum of all file sizes accessed during a launch of an application.
引用
收藏
页码:1733 / 1754
页数:22
相关论文
共 50 条
  • [31] HasFS: optimizing file system consistency mechanism on NVM-based hybrid storage architecture
    Yubo Liu
    Hongbo Li
    Yutong Lu
    Zhiguang Chen
    Nong Xiao
    Ming Zhao
    [J]. Cluster Computing, 2020, 23 : 2501 - 2515
  • [32] HasFS: optimizing file system consistency mechanism on NVM-based hybrid storage architecture
    Liu, Yubo
    Li, Hongbo
    Lu, Yutong
    Chen, Zhiguang
    Xiao, Nong
    Zhao, Ming
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (04): : 2501 - 2515
  • [33] The Storage System of PCM Based on Random Access File System
    Han, Wenbing
    Chen, Xiaogang
    Zhou, Mi
    Li, Shunfen
    Li, Gezi
    Song, Zhitang
    [J]. 2016 INTERNATIONAL WORKSHOP ON INFORMATION DATA STORAGE AND TENTH INTERNATIONAL SYMPOSIUM ON OPTICAL STORAGE, 2016, 9818
  • [34] File system design for object-based storage system
    Feng, Dan
    Shi, Wei
    Qin, Lingjun
    Guan, Qing
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2006, 34 (12): : 47 - 49
  • [35] A data-mining-based prefetching approach to caching for network storage systems
    Fang, Xiao
    Sheng, Olivia R. Liu
    Gao, Wei
    Iyer, Balakrishna R.
    [J]. INFORMS JOURNAL ON COMPUTING, 2006, 18 (02) : 267 - 282
  • [36] Data Mining-based Smart Industrial Park Energy Efficiency Management System
    Song, Ningxi
    Wan, Diming
    Sun, Qian
    Yue, Jianfeng
    [J]. GREEN POWER, MATERIALS AND MANUFACTURING TECHNOLOGY AND APPLICATIONS III, PTS 1 AND 2, 2014, 484-485 : 585 - +
  • [37] File Deduplication with Cloud Storage File System
    Ku, Chan-I
    Luo, Guo-Heng
    Chang, Che-Pin
    Yuan, Shyan-Ming
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 280 - 287
  • [38] File Repair with Cluster Based Distributed Storage System
    Haytaoglu, Elif
    Dalkilic, Mehmet Emin
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [39] A data mining-based interruptible load contract model for the modern power system
    Hui, Zou
    Jun, Yang
    Qi, Meng
    [J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, : 3161 - 3169