On the use of big data frameworks in big service management

被引:0
|
作者
Ghedass, Fedia [1 ]
Ben Charrada, Faouzi [1 ]
机构
[1] Univ Tunis El Manar, Dept Comp Sci, Tunis, Tunisia
关键词
autonomic computing; big service; big service management; distributed representation learning; knowledge graph; MapReduce; MAPREDUCE; ALGORITHM;
D O I
10.1002/smr.2642
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Over the last few years, big data have emerged as a paradigm for processing and analyzing a large volume of data. Coupled with other paradigms, such as cloud computing, service computing, and Internet of Things, big data processing takes advantage of the underlying cloud infrastructure, which allows hosting and managing massive amounts of data, while service computing allows to process and deliver various data sources as on-demand services. This synergy between multiple paradigms has led to the emergence of big services, as a cross-domain, large-scale, and big data-centric service model. Apart from the adaptation issues (e.g., need of high reaction to changes) inherited from other service models, the massiveness and heterogeneity of big services add a new factor of complexity to the way such a large-scale service ecosystem is managed in case of execution deviations. Indeed, big services are often subject to frequent deviations at both the functional (e.g., service failure, QoS degradation, and IoT resource unavailability) and data (e.g., data source unavailability or access restrictions) levels. Handling these execution problems is beyond the capacity of traditional web/cloud service management tools, and the majority of big service approaches have targeted specific management operations, such as selection and composition. To maintain a moderate state and high quality of their cross-domain execution, big services should be continuously monitored and managed in a scalable and autonomous way. To cope with the absence of self-management frameworks for large-scale services, the goal of this work is to design an autonomic management solution that takes the whole control of big services in an autonomous and distributed lifecycle process. We combine autonomic computing and big data processing paradigms to endow big services with self-* and parallel processing capabilities. The proposed management framework takes advantage of the well-known MapReduce programming model and Apache Spark and manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. Finally, a cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. Big services' management tasks (monitoring, embedding, and decision), as well as the core modules (autonomic managers' controller, embedding module, and coordinator), are implemented on top of Apache Spark as MapReduce jobs, while the processed data are represented as resilient distributed dataset (RDD) structures. To exploit the shared information exchanged between the workers and the master node (coordinator), and for further resolution of conflicts between management policies, we endowed the proposed framework with a lightweight communication mechanism that allows transferring useful knowledge between the running map-reduce tasks and filtering inappropriate intermediate data (e.g., conflicting actions). The experimental results proved the increased quality of embeddings and the high performance of autonomic managers in a parallel and cooperative setting, thanks to the shared knowledge. We combine autonomic computing and big data processing to endow big services with self-management capabilities. The proposed management framework takes advantage of MapReduce programming model and Apache Spark. It manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. A cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. image
引用
收藏
页数:28
相关论文
共 50 条
  • [1] On the use of big data frameworks for big service composition
    Sellami, Mokhtar
    Mezni, Haithem
    Hacid, Mohand Said
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2020, 166
  • [2] From Big Data to Big Service
    Xu, Xiaofei
    Sheng, Quan Z.
    Zhang, Liang-Jie
    Fan, Yushun
    Dustdar, Schahram
    COMPUTER, 2015, 48 (07) : 80 - 83
  • [3] Service Oriented Big Data Management for Transport
    Kemp, Gavin
    Vargas-Solar, Genoveva
    Da Silva, Catarina Ferreira
    Ghodous, Parisa
    Collet, Christine
    SMART CITIES, GREEN TECHNOLOGIES, AND INTELLIGENT TRANSPORT SYSTEMS (SMARTGREENS 2015), 2015, 579 : 267 - 281
  • [4] Big data analytics for network and service management
    Diao, Yixin
    Zincir-Heywood, A. Nur
    INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2017, 27 (04)
  • [5] Demonstration of the Myria Big Data Management Service
    Halperin, Daniel
    de Almeida, Victor Teixeira
    Choo, Lee Lee
    Chu, Shumo
    Koutris, Paraschos
    Moritz, Dominik
    Ortiz, Jennifer
    Ruamviboonsuk, Vaspol
    Wang, Jingjing
    Whitaker, Andrew
    Xu, Shengliang
    Balazinska, Magdalena
    Howe, Bill
    Suciu, Dan
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 881 - 884
  • [6] Big Data Analytics Frameworks
    Chandarana, Parth
    Vijayalakshmi, M.
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 430 - 434
  • [7] Big Data in Stroke: How to Use Big Data to Make the Next Management Decision
    Yuzhe Liu
    Yuan Luo
    Andrew M. Naidech
    Neurotherapeutics, 2023, 20 : 744 - 757
  • [8] Big Data in Stroke: How to Use Big Data to Make the Next Management Decision
    Liu, Yuzhe
    Luo, Yuan
    Naidech, Andrew M.
    NEUROTHERAPEUTICS, 2023, 20 (03) : 744 - 757
  • [9] Service-generated Big Data and Big Data-as-a-Service: An Overview
    Zheng, Zibin
    Zhu, Jieming
    Lyu, Michael R.
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 403 - 410
  • [10] Access Control Management as a Service for NoSQL Big Data
    Habiba, Mansura
    Islam, Md Rafiqul
    Ali, A. B. M. Shawkat
    2015 2ND ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWC ON CSE 2015), 2015,