On the use of big data frameworks in big service management

被引：0

作者：

Ghedass, Fedia ^{[1
]}

Ben Charrada, Faouzi ^{[1
]}

机构：

[1] Univ Tunis El Manar, Dept Comp Sci, Tunis, Tunisia

来源：

JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS | 2024年 / 36卷 / 07期

关键词：

autonomic computing; big service; big service management; distributed representation learning; knowledge graph; MapReduce; MAPREDUCE; ALGORITHM;

D O I：

10.1002/smr.2642

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Over the last few years, big data have emerged as a paradigm for processing and analyzing a large volume of data. Coupled with other paradigms, such as cloud computing, service computing, and Internet of Things, big data processing takes advantage of the underlying cloud infrastructure, which allows hosting and managing massive amounts of data, while service computing allows to process and deliver various data sources as on-demand services. This synergy between multiple paradigms has led to the emergence of big services, as a cross-domain, large-scale, and big data-centric service model. Apart from the adaptation issues (e.g., need of high reaction to changes) inherited from other service models, the massiveness and heterogeneity of big services add a new factor of complexity to the way such a large-scale service ecosystem is managed in case of execution deviations. Indeed, big services are often subject to frequent deviations at both the functional (e.g., service failure, QoS degradation, and IoT resource unavailability) and data (e.g., data source unavailability or access restrictions) levels. Handling these execution problems is beyond the capacity of traditional web/cloud service management tools, and the majority of big service approaches have targeted specific management operations, such as selection and composition. To maintain a moderate state and high quality of their cross-domain execution, big services should be continuously monitored and managed in a scalable and autonomous way. To cope with the absence of self-management frameworks for large-scale services, the goal of this work is to design an autonomic management solution that takes the whole control of big services in an autonomous and distributed lifecycle process. We combine autonomic computing and big data processing paradigms to endow big services with self-* and parallel processing capabilities. The proposed management framework takes advantage of the well-known MapReduce programming model and Apache Spark and manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. Finally, a cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. Big services' management tasks (monitoring, embedding, and decision), as well as the core modules (autonomic managers' controller, embedding module, and coordinator), are implemented on top of Apache Spark as MapReduce jobs, while the processed data are represented as resilient distributed dataset (RDD) structures. To exploit the shared information exchanged between the workers and the master node (coordinator), and for further resolution of conflicts between management policies, we endowed the proposed framework with a lightweight communication mechanism that allows transferring useful knowledge between the running map-reduce tasks and filtering inappropriate intermediate data (e.g., conflicting actions). The experimental results proved the increased quality of embeddings and the high performance of autonomic managers in a parallel and cooperative setting, thanks to the shared knowledge. We combine autonomic computing and big data processing to endow big services with self-management capabilities. The proposed management framework takes advantage of MapReduce programming model and Apache Spark. It manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. A cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. image

引用

页数：28

共 50 条

[1] On the use of big data frameworks for big service composition
Sellami, Mokhtar
Mezni, Haithem
Hacid, Mohand Said
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2020, 166
[2] From Big Data to Big Service
Xu, Xiaofei
Sheng, Quan Z.
Zhang, Liang-Jie
Fan, Yushun
Dustdar, Schahram
COMPUTER, 2015, 48 (07) : 80 - 83
[3] Service Oriented Big Data Management for Transport
Kemp, Gavin
Vargas-Solar, Genoveva
Da Silva, Catarina Ferreira
Ghodous, Parisa
Collet, Christine
SMART CITIES, GREEN TECHNOLOGIES, AND INTELLIGENT TRANSPORT SYSTEMS (SMARTGREENS 2015), 2015, 579 : 267 - 281
[4] Big data analytics for network and service management
Diao, Yixin
Zincir-Heywood, A. Nur
INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2017, 27 (04)
[5] Demonstration of the Myria Big Data Management Service
Halperin, Daniel
de Almeida, Victor Teixeira
Choo, Lee Lee
Chu, Shumo
Koutris, Paraschos
Moritz, Dominik
Ortiz, Jennifer
Ruamviboonsuk, Vaspol
Wang, Jingjing
Whitaker, Andrew
Xu, Shengliang
Balazinska, Magdalena
Howe, Bill
Suciu, Dan
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 881 - 884
[6] Big Data Analytics Frameworks
Chandarana, Parth
Vijayalakshmi, M.
2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 430 - 434
[7] Big Data in Stroke: How to Use Big Data to Make the Next Management Decision
Yuzhe Liu
Yuan Luo
Andrew M. Naidech
Neurotherapeutics, 2023, 20 : 744 - 757
[8] Big Data in Stroke: How to Use Big Data to Make the Next Management Decision
Liu, Yuzhe
Luo, Yuan
Naidech, Andrew M.
NEUROTHERAPEUTICS, 2023, 20 (03) : 744 - 757
[9] Service-generated Big Data and Big Data-as-a-Service: An Overview
Zheng, Zibin
Zhu, Jieming
Lyu, Michael R.
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 403 - 410
[10] Access Control Management as a Service for NoSQL Big Data
Habiba, Mansura
Islam, Md Rafiqul
Ali, A. B. M. Shawkat
2015 2ND ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWC ON CSE 2015), 2015,

← 1 2 3 4 5 →