Fuzzing Java']JavaScript engines with a syntax-aware neural program model

被引:0
|
作者
Xu, Haoran [1 ]
Wang, Yongjun [1 ]
Jiang, Zhiyuan [1 ]
Fan, Shuhui [1 ]
Fu, Shaojing [1 ]
Xie, Peidai [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Fuzzing; !text type='Java']Java[!/text]Script engines; Language model; Neural network; Grammar; Vocabulary;
D O I
10.1016/j.cose.2024.103947
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural network language modeling has become a remarkable approach in the generation of test cases for fuzzing JavaScript engines. Fuzzers built upon neural language models offer several advantages. They obviate the need for manually developing code generation rules, enable the extraction of patterns from high -quality seed sets, and exhibit commendable portability. Nevertheless, existing works confront challenges in three key aspects: diminished language modeling performance attributable to extensive vocabularies, potential semantic errors within generated test cases, and the limitation of black -box fuzzing, which fails to leverage the internal feedback from the target engine. This paper proposes an innovative neural model -based grey -box fuzzing approach for JavaScript engines. We incorporate the context -free grammar of JavaScript into the neural language model to mitigate the challenges associated with extensive vocabularies, thereby enhancing the model's performance. Furthermore, to enhance the semantic validity of the generated test cases, we introduce semantic constraints into the mutation process. Notably, this work pioneers the integration of grey -box testing into a fuzzer built upon a neural language model, thereby enhancing the exploration of deep paths. Our prototype, PMFuzz, surpasses NNLMbased counterparts in both language modeling performance and test case generation capabilities. PMFuzz demonstrates a high level of competitiveness in exploring the software state space when compared to traditional coverage -guided grey -box fuzzers. In our evaluation, PMFuzz successfully identified 20 new defects within mainstream JS engines. Eight of them have been confirmed and fixed. Moreover, upon applying our method to C compilers, PMFuzz has revealed 11 new defects.
引用
下载
收藏
页数:14
相关论文
共 27 条
  • [11] Syntax-Enhanced Neural Machine Translation with Syntax-Aware Word Representations
    Zhang, Meishan
    Li, Zhenghua
    Fu, Guohong
    Zhang, Min
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1151 - 1161
  • [12] Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder
    Chen, Huadong
    Huang, Shujian
    Chiang, David
    Chen, Jiajun
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1936 - 1945
  • [13] Recurrent graph encoder for syntax-aware neural machine translation
    Liang Ding
    Longyue Wang
    Siyou Liu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1053 - 1062
  • [14] Recurrent graph encoder for syntax-aware neural machine translation
    Ding, Liang
    Wang, Longyue
    Liu, Siyou
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (04) : 1053 - 1062
  • [15] SEE: Syntax-Aware Entity Embedding for Neural Relation Extraction
    He, Zhengqiu
    Chen, Wenliang
    Li, Zhenghua
    Zhang, Meishan
    Zhang, Wei
    Zhang, Min
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5795 - 5802
  • [16] Syntax-Aware Complex-Valued Neural Machine Translation
    Liu, Yang
    Hou, Yuexian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 474 - 485
  • [17] Syntax-aware neural machine translation directed by syntactic dependency degree
    Ru Peng
    Tianyong Hao
    Yi Fang
    Neural Computing and Applications, 2021, 33 : 16609 - 16625
  • [18] GRAPHSPEECH: SYNTAX-AWARE GRAPH ATTENTION NETWORK FOR NEURAL SPEECH SYNTHESIS
    Liu, Rui
    Sisman, Berrak
    Li, Haizhou
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6059 - 6063
  • [19] Syntax-aware neural machine translation directed by syntactic dependency degree
    Peng, Ru
    Hao, Tianyong
    Fang, Yi
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (23): : 16609 - 16625
  • [20] Favocado: Fuzzing the Binding Code of Java']JavaScript Engines Using Semantically Correct Test Cases
    Sung Ta Dinh
    Cho, Haehyun
    Martin, Kyle
    Oest, Adam
    Zeng, Kyle
    Kapravelos, Alexandros
    Ahn, Gail-Joon
    Bao, Tiffany
    Wang, Ruoyu
    Doupe, Adam
    Shoshitaishvili, Yan
    28TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2021), 2021,