Accelerating Regular Expression Matching Over Compressed HTTP

被引:0
|
作者
Becchi, Michela [1 ]
Bremler-Barr, Anat [2 ]
Hay, David [3 ]
Kochba, Omer [2 ]
Koral, Yaron [3 ]
机构
[1] Univ Missouri, Columbia, MO 65211 USA
[2] Interdisciplinary Ctr Herzliya, Herzliyya, Israel
[3] Hebrew Univ Jerusalem, IL-91905 Jerusalem, Israel
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on regular expression matching over compressed traffic. The need for such matching arises from two independent trends. First, the volume and share of compressed IITTP traffic is constantly increasing. Second, due to their superior expressibility, current Deep Packet Inspection engines use regular expressions more and more frequently. We present an algorithmic framework to accelerate such matching, taking advantage of information gathered when the traffic was initially compressed. HTTP compression is typically performed through the GZIP protocol, which uses back references to repeated strings. Our algorithm is based on calculating (for every byte) the minimum number of (previous) bytes that can be part of a future regular expression matching. When inspecting a back -reference, only these bytes should be taken into account, thus enabling one to skip repeated strings almost entirely without missing a match. We show that our generic framework works with either NFA-based or DFA-based implementations and gains performance boosts of more than 70%. Moreover, it can be readily adapted to most existing regular expression matching algorithms, which usually are based either on NFA, DFA or combinations of the two. Finally, we discuss other applications in which calculating the number of relevant bytes becomes handy, even when the traffic is not compressed.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Accelerating Multipattern Matching on Compressed HTTP Traffic
    Bremler-Barr, Anat
    Koral, Yaron
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2012, 20 (03) : 970 - 983
  • [2] Efficient regular expression matching over compressed traffic
    Sun, Xiuwen
    Li, Hao
    Zhao, Dan
    Lu, Xingxing
    Peng, Zheng
    Hu, Chengchen
    COMPUTER NETWORKS, 2020, 168 (168)
  • [3] Accelerating Multi-Patterns Matching on Compressed HTTP Traffic
    Bremler-Barr, Anat
    Koral, Yaron
    IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5, 2009, : 397 - 405
  • [4] Towards a Fast Regular Expression Matching Method over Compressed Traffic
    Sun, Xiuwen
    Li, Hao
    Lu, Xingxing
    Zhao, Dan
    Peng, Zheng
    Hu, Chengchen
    2018 IEEE/ACM 26TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2018,
  • [5] Efficient Regular Expression Matching on Compressed Strings
    Han, Yutong
    Wang, Bin
    Yang, Xiaochun
    Zhu, Huaijie
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT II, 2017, 10178 : 219 - 234
  • [6] Efficient regular expression matching over hybrid dictionary-based compressed data
    Sun, Xiuwen
    Mo, Da
    Wu, Di
    Ye, Chunhui
    Yu, Qingying
    Cui, Jie
    Zhong, Hong
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 215
  • [7] DFA-Based Regular Expression Matching on Compressed Traffic
    Sun, Yan
    Kim, Min Sik
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [8] Accelerating Regular Expression Matching Using Hierarchical Parallel Machines on GPU
    Lin, Cheng-Hung
    Liu, Chen-Hsiung
    Chang, Shih-Chieh
    2011 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE (GLOBECOM 2011), 2011,
  • [9] Improved approximate string matching and regular expression matching on Ziv-Lempel compressed texts
    Bille, Philip
    Fagerberg, Rolf
    Gortz, Inge Li
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2007, 4580 : 52 - +
  • [10] Improved Approximate String Matching and Regular Expression Matching on Ziv-Lempel Compressed Texts
    Bille, Philip
    Fagerberg, Rolf
    Gortz, Inge Li
    ACM TRANSACTIONS ON ALGORITHMS, 2009, 6 (01)