Fault Injection and Detection for Artificial Intelligence Applications in Container-Based Clouds

被引:13
|
作者
Ye, Kejiang [1 ]
Liu, Yangyang [2 ]
Xu, Guoyao [1 ,3 ]
Xu, Cheng-Zhong [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[3] Wayne State Univ, Dept Elect & Comp Engn, Detroit, MI USA
来源
CLOUD COMPUTING - CLOUD 2018 | 2018年 / 10967卷
基金
中国国家自然科学基金;
关键词
D O I
10.1007/978-3-319-94295-7_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Container technique is increasingly used to build modern cloud computing systems to achieve higher efficiency and lower resource costs, as compared with traditional virtual machine technique. Artificial intelligence (AI) is a mainstream method to deal with big data, and is used in many areas to achieve better effectiveness. It is known that attacks happen every day in production cloud systems, however, the fault behaviors and interferences of up-to-date AI applications in container-based cloud systems is still not clear. This paper aims to study the reliability issue of container-based clouds. We first propose a fault injection framework for container-based cloud systems. We build a docker container environment installed with TensorFlow deep learning framework, and develop four typical attack programs, i.e., CPU attack, Memory attack, Disk attack and DDOS attack. Then, we inject the attack programs to the containers running AI applications (CNN, RNN, BRNN and DRNN), to observe fault behaviors and interferences phenomenon. After that, we design fault detection models based on quantile regression method to detect potential faults in containers. Experimental results show the proposed fault detection models can effectively detect the injected faults with more than 60% Precision, more than 90% Recall and nearly 100% Accuracy.
引用
收藏
页码:112 / 127
页数:16
相关论文
共 50 条
  • [1] Minimizing Communication Overheads in Container-based Clouds for HPC Applications
    Maliszewski, Anderson M.
    Vogel, Adriano
    Griebler, Dalvan
    Roloff, Eduardo
    Fernandes, Luiz G.
    Navaux, Philippe O. A.
    [J]. 2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 474 - 479
  • [2] ADGS: Anomaly Detection and Localization based on Graph Similarity in Container-based Clouds
    Lu, Chengzhi
    Ye, Kejiang
    Chen, Wenyan
    Xu, Cheng-Zhong
    [J]. 2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 53 - 60
  • [3] CMonitor: A Monitoring and Alarming Platform for Container-Based Clouds
    Ji, Shujian
    Ye, Kejiang
    Xu, Cheng-Zhong
    [J]. CLOUD COMPUTING - CLOUD 2019, 2019, 11513 : 324 - 339
  • [4] Flexible Network Address Mapping for Container-based Clouds
    Kim, Kyung-Hwa
    Lee, Jae Woo
    Ben-Ami, Michael
    Nam, Hyunwoo
    Janak, Jan
    Schulzrinne, Henning
    [J]. 2015 1st IEEE Conference on Network Softwarization (NetSoft), 2015,
  • [5] Performance Impact of IEEE 802.3ad in Container-Based Clouds for HPC Applications
    Maliszewski, Anderson M.
    Roloff, Eduardo
    Griebler, Dalvan
    Gaspary, Luciano P.
    Navaux, Philippe O. A.
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT VI, 2020, 12254 : 158 - 167
  • [6] Using Attack Injection to Evaluate Intrusion Detection Effectiveness in Container-based Systems
    Flora, Jose
    Goncalves, Paulo
    Antunes, Nuno
    [J]. 2020 IEEE 25TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC 2020), 2020, : 60 - 69
  • [7] A Container-Based Framework for Developing ROS Applications
    Melo, Pedro
    Arrais, Rafael
    Teixeira, Sergio
    Veiga, Germano
    [J]. 2022 IEEE 20TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2022, : 280 - 285
  • [8] Container-based Microservice Architecture for Cloud Applications
    Singh, Vindeep
    Peddoju, Sateesh K.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 847 - 852
  • [9] Securing Container-based Clouds with Syscall-aware Scheduling
    Le, Michael V.
    Ahmed, Salman
    Williams, Dan
    Jamjoom, Hani
    [J]. PROCEEDINGS OF THE 2023 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ASIA CCS 2023, 2023, : 812 - 826
  • [10] Evaluating, Estimating, and Improving Network Performance in Container-based Clouds
    Rista, Cassiano
    Teixeira, Marcelo
    Griebler, Dalvan
    Fernandes, Luiz Gustavo
    [J]. 2018 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2018, : 519 - 525