TIM: Enabling Large-Scale White-Box Testing on In-App Deep Learning Models

被引:0
|
作者
Wu, Hao [1 ]
Gong, Yuhang [1 ]
Ke, Xiaopeng [1 ]
Liang, Hanzhong [1 ]
Xu, Fengyuan [1 ]
Liu, Yunxin [2 ]
Zhong, Sheng [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Soft ware Technol, Nanjing 210023, Peoples R China
[2] Tsinghua Univ, Inst AI Ind Res, Beijing 100083, Peoples R China
关键词
AI model testing; program slicing; program analysis; intelligent application security;
D O I
10.1109/TIFS.2024.3455761
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Intelligent Applications (iApps), equipped with in-App deep learning (DL) models, are emerging to provide reliable DL inference services. However, in-App DL models are typically compiled into inference-only versions to enhance system performance, thereby impeding the evaluation of DL models. Specifically, the assessment of in-App models currently relies on black-box testing methods rather than direct white-box testing approaches. In this work, we propose TIM, an automated tool designed for conducting large-scale white-box testing of in-App models. Taking an iApp as input, TIM can lift the black-box (i.e., inference-only) in-App DL model into a backpropagation-enabled one and package it together, allowing comprehensive DL model testing or security issues detection. TIM proposes two reconstruction techniques to convert the inference-only model to a backpropagation-enabled version and reconstruct the DL-related IO processing code. In our experiments, we utilize TIM to extract 100 unique commercial in-App models and convert the models to white-box models, enabling backpropagation functionality. Experimental results show that TIM's reconstruction techniques exhibit high accuracy. We open-source our prototype and part of the experimental data on the website https://zenodo.org/record/7548141.
引用
收藏
页码:8188 / 8203
页数:16
相关论文
共 50 条
  • [41] SKETCHING FOR LARGE-SCALE LEARNING OF MIXTURE MODELS
    Keriven, Nicolas
    Bourrier, Anthony
    Gribonval, Remi
    Perez, Patrick
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6190 - 6194
  • [42] Sketching for large-scale learning of mixture models
    Keriven, Nicolas
    Bourrier, Anthony
    Gribonval, Remi
    Perez, Patrick
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2018, 7 (03) : 447 - 508
  • [43] Evaluation of white-box versus black-box machine learning models in estimating ambient black carbon concentration
    Fung, Pak L.
    Zaidan, Martha A.
    Timonen, Hilkka
    Niemi, Jarkko, V
    Kousa, Anu
    Kuula, Joel
    Luoma, Krista
    Tarkoma, Sasu
    Petaja, Tuukka
    Kulmala, Markku
    Hussein, Tareq
    JOURNAL OF AEROSOL SCIENCE, 2021, 152
  • [44] Enabling large-scale screening of Barrett's esophagus using weakly supervised deep learning in histopathology
    Bouzid, Kenza
    Sharma, Harshita
    Killcoyne, Sarah
    Castro, Daniel C.
    Schwaighofer, Anton
    Ilse, Max
    Salvatelli, Valentina
    Oktay, Ozan
    Murthy, Sumanth
    Bordeaux, Lucas
    Moore, Luiza
    O'Donovan, Maria
    Thieme, Anja
    Nori, Aditya
    Gehrung, Marcel
    Alvarez-Valle, Javier
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [45] Enabling large-scale screening of Barrett’s esophagus using weakly supervised deep learning in histopathology
    Kenza Bouzid
    Harshita Sharma
    Sarah Killcoyne
    Daniel C. Castro
    Anton Schwaighofer
    Max Ilse
    Valentina Salvatelli
    Ozan Oktay
    Sumanth Murthy
    Lucas Bordeaux
    Luiza Moore
    Maria O’Donovan
    Anja Thieme
    Aditya Nori
    Marcel Gehrung
    Javier Alvarez-Valle
    Nature Communications, 15
  • [46] A constrained large-scale lever evolutionary algorithm for white-box problems and its application in spectral-energy efficiency tradeoff of massive MIMO
    Qingzhu Wang
    Tianyang Li
    Memetic Computing, 2025, 17 (1)
  • [47] Testing quintessence models with large-scale structure growth
    Benabed, K
    Bernardeau, F
    PHYSICAL REVIEW D, 2001, 64 (08)
  • [48] Testing and evaluating large-scale agricultural simulation models
    Johnson, Ian R.
    19TH INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2011), 2011, : 84 - 96
  • [49] Deep learning models for large-scale slope instability examination in Western Uttarakhand, India
    Vishnu Himanshu Ratnam Pandey
    Ashutosh Kainthola
    Vikram Sharma
    Abhishek Srivastav
    T. Jayal
    T. N. Singh
    Environmental Earth Sciences, 2022, 81
  • [50] Deep learning to segment pelvic bones: large-scale CT datasets and baseline models
    Liu, Pengbo
    Han, Hu
    Du, Yuanqi
    Zhu, Heqin
    Li, Yinhao
    Gu, Feng
    Xiao, Honghu
    Li, Jun
    Zhao, Chunpeng
    Xiao, Li
    Wu, Xinbao
    Zhou, S. Kevin
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (05) : 749 - 756