On the use of big data frameworks in big service management

被引：0

作者：

Ghedass, Fedia ^{[1
]}

Ben Charrada, Faouzi ^{[1
]}

机构：

[1] Univ Tunis El Manar, Dept Comp Sci, Tunis, Tunisia

来源：

JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS | 2024年 / 36卷 / 07期

关键词：

autonomic computing; big service; big service management; distributed representation learning; knowledge graph; MapReduce; MAPREDUCE; ALGORITHM;

D O I：

10.1002/smr.2642

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Over the last few years, big data have emerged as a paradigm for processing and analyzing a large volume of data. Coupled with other paradigms, such as cloud computing, service computing, and Internet of Things, big data processing takes advantage of the underlying cloud infrastructure, which allows hosting and managing massive amounts of data, while service computing allows to process and deliver various data sources as on-demand services. This synergy between multiple paradigms has led to the emergence of big services, as a cross-domain, large-scale, and big data-centric service model. Apart from the adaptation issues (e.g., need of high reaction to changes) inherited from other service models, the massiveness and heterogeneity of big services add a new factor of complexity to the way such a large-scale service ecosystem is managed in case of execution deviations. Indeed, big services are often subject to frequent deviations at both the functional (e.g., service failure, QoS degradation, and IoT resource unavailability) and data (e.g., data source unavailability or access restrictions) levels. Handling these execution problems is beyond the capacity of traditional web/cloud service management tools, and the majority of big service approaches have targeted specific management operations, such as selection and composition. To maintain a moderate state and high quality of their cross-domain execution, big services should be continuously monitored and managed in a scalable and autonomous way. To cope with the absence of self-management frameworks for large-scale services, the goal of this work is to design an autonomic management solution that takes the whole control of big services in an autonomous and distributed lifecycle process. We combine autonomic computing and big data processing paradigms to endow big services with self-* and parallel processing capabilities. The proposed management framework takes advantage of the well-known MapReduce programming model and Apache Spark and manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. Finally, a cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. Big services' management tasks (monitoring, embedding, and decision), as well as the core modules (autonomic managers' controller, embedding module, and coordinator), are implemented on top of Apache Spark as MapReduce jobs, while the processed data are represented as resilient distributed dataset (RDD) structures. To exploit the shared information exchanged between the workers and the master node (coordinator), and for further resolution of conflicts between management policies, we endowed the proposed framework with a lightweight communication mechanism that allows transferring useful knowledge between the running map-reduce tasks and filtering inappropriate intermediate data (e.g., conflicting actions). The experimental results proved the increased quality of embeddings and the high performance of autonomic managers in a parallel and cooperative setting, thanks to the shared knowledge. We combine autonomic computing and big data processing to endow big services with self-management capabilities. The proposed management framework takes advantage of MapReduce programming model and Apache Spark. It manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. A cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. image

引用

页数：28

共 50 条

[21] Big Data and Service Operations
Cohen, Maxime C.
PRODUCTION AND OPERATIONS MANAGEMENT, 2018, 27 (09) : 1709 - 1723
[22] A Framework for Big Data as a Service
Quang Hieu Vu
Asal, Rasool
2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 492 - 496
[23] Big data driven smart energy management: From big data to big insights
Zhou, Kaile
Fu, Chao
Yang, Shanlin
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2016, 56 : 215 - 225
[24] A framework for investigating the role of big data in service parts management
Boone, Christopher A.
Skipper, Joseph B.
Hazen, Benjamin T.
JOURNAL OF CLEANER PRODUCTION, 2017, 153 (01) : 687 - 691
[25] AWESoME: Big Data for Automatic Web Service Management in SDN
Trevisan, Martino
Drago, Idilio
Mellia, Marco
Song, Han Hee
Baldi, Mario
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2018, 15 (01): : 13 - 26
[26] A new paradigm of software service engineering in big data and big service era
Xu, Xiaofei
Motta, Gianmario
Tu, Zhiying
Xu, Hanchuan
Wang, Zhongjie
Wang, Xianzhi
COMPUTING, 2018, 100 (04) : 353 - 368
[27] A new paradigm of software service engineering in big data and big service era
Xiaofei Xu
Gianmario Motta
Zhiying Tu
Hanchuan Xu
Zhongjie Wang
Xianzhi Wang
Computing, 2018, 100 : 353 - 368
[28] Towards Use and Reuse Driven Big Data Management
Xie, Zhiwu
Chen, Yinlin
Speer, Julie
Walters, Tyler
Tarazaga, Pablo A.
Kasarda, Mary
PROCEEDINGS OF THE 15TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL'15), 2015, : 65 - 74
[29] Big Data Lakes: Models, Frameworks, and Techniques
Cuzzocrea, Alfredo
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 1 - 4
[30] Challenges in High Performance Big Data Frameworks
Papadopoulos, Alessandro V.
Maggio, Martina
PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 153 - 156

← 1 2 3 4 5 →