A similarity study of I/O traces via string kernels

被引:0
|
作者
Raul Torres
Julian M. Kunkel
Manuel F. Dolz
Thomas Ludwig
机构
[1] Universität Hamburg,Department of Informatics
[2] University of Reading,Department of Computer Science
[3] Universidad Carlos III de Madrid,Department of Computer Science
来源
关键词
Kernel functions; Kast2 spectrum kernel; I/O access pattern comparison; String kernels;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding I/O for data-intense applications is the foundation for the optimization of these applications. The classification of the applications according to the expressed I/O access pattern eases the analysis. An access pattern can be seen as fingerprint of an application. In this paper, we address the classification of traces. Firstly, we convert them first into a weighted string representation. Due to the fact that string objects can be easily compared using kernel methods, we explore their use for fingerprinting I/O patterns. To improve accuracy, we propose a novel string kernel function called kast2 spectrum kernel. The similarity matrices, obtained after applying the mentioned kernel over a set of examples from a real application, were analyzed using kernel principal component analysis and hierarchical clustering. The evaluation showed that two out of four I/O access pattern groups were completely identified, while the other two groups conformed a single cluster due to the intrinsic similarity of their members. The proposed strategy can be promisingly applied to other similarity problems involving tree-like structured data.
引用
收藏
页码:7814 / 7826
页数:12
相关论文
共 50 条
  • [21] Dancing with Giants: Wimpy Kernels for On-Demand I/O Isolation
    Zhou, Zongwei
    Yu, Miao
    Gligor, Virgil D.
    IEEE SECURITY & PRIVACY, 2015, 13 (02) : 38 - 46
  • [22] Efficient Similarity Search by Reducing I/O with Compressed Sketches
    Mueller-Molina, Arnoldo Jose
    Shinohara, Takeshi
    SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 30 - 38
  • [23] A Preprocessing of Service Registry: Based on I/O Parameter Similarity
    Lakshmi, H. N.
    Mohanty, Hrushikesha
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2015, 2015, 8956 : 220 - 232
  • [24] Survey of studies on self-similarity in I/O workloads
    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
    Jisuanji Yanjiu yu Fazhan, 2008, 6 (1072-1084): : 1072 - 1084
  • [25] TraceRAR: An I/O Performance Evaluation Tool for Replaying, Analyzing, and Regenerating Traces
    Li, Bingzhe
    Toussi, Farnaz
    Anderson, Clark
    Lilja, David J.
    Du, David H. C.
    2017 INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE, AND STORAGE (NAS), 2017, : 11 - 20
  • [26] An Empirical Method for Processing I/O Traces to Analyze the Performance of DL Applications
    Parraga, Edixon
    Leon, Betzabeth
    Mendez, Sandra
    Rexachs, Dolores
    Suppi, Remo
    Luque, Emilio
    CLOUD COMPUTING, BIG DATA AND EMERGING TOPICS, JCC-BD&ET 2024, 2025, 2189 : 74 - 90
  • [27] FriSM: Malicious Exploit Kit Detection via Feature-Based String-Similarity Matching
    Kim, Sungjin
    Kang, Brent ByungHoon
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2018, PT I, 2018, 254 : 416 - 432
  • [28] No-Ghost Theorem for Neveu-Schwarz String in 0-Picture via Similarity Transformation
    Kohriki, Maiko
    Kunitomo, Hiroshi
    Murata, Masaki
    PROGRESS OF THEORETICAL PHYSICS SUPPLEMENT, 2011, (188): : 254 - 262
  • [29] On using physico-chemical properties of amino acids in string kernels for protein classification via support vector machines
    Limin Li
    Kiyoko F. Aoki-Kinoshita
    Wai-Ki Ching
    Hao Jiang
    Journal of Systems Science and Complexity, 2015, 28 : 504 - 516
  • [30] On using physico-chemical properties of amino acids in string kernels for protein classification via support vector machines
    Li Limin
    Aoki-Kinoshita, Kiyoko F.
    Ching Wai-Ki
    Jiang Hao
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2015, 28 (02) : 504 - 516