On the use of big data frameworks in big service management

被引:0
|
作者
Ghedass, Fedia [1 ]
Ben Charrada, Faouzi [1 ]
机构
[1] Univ Tunis El Manar, Dept Comp Sci, Tunis, Tunisia
关键词
autonomic computing; big service; big service management; distributed representation learning; knowledge graph; MapReduce; MAPREDUCE; ALGORITHM;
D O I
10.1002/smr.2642
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Over the last few years, big data have emerged as a paradigm for processing and analyzing a large volume of data. Coupled with other paradigms, such as cloud computing, service computing, and Internet of Things, big data processing takes advantage of the underlying cloud infrastructure, which allows hosting and managing massive amounts of data, while service computing allows to process and deliver various data sources as on-demand services. This synergy between multiple paradigms has led to the emergence of big services, as a cross-domain, large-scale, and big data-centric service model. Apart from the adaptation issues (e.g., need of high reaction to changes) inherited from other service models, the massiveness and heterogeneity of big services add a new factor of complexity to the way such a large-scale service ecosystem is managed in case of execution deviations. Indeed, big services are often subject to frequent deviations at both the functional (e.g., service failure, QoS degradation, and IoT resource unavailability) and data (e.g., data source unavailability or access restrictions) levels. Handling these execution problems is beyond the capacity of traditional web/cloud service management tools, and the majority of big service approaches have targeted specific management operations, such as selection and composition. To maintain a moderate state and high quality of their cross-domain execution, big services should be continuously monitored and managed in a scalable and autonomous way. To cope with the absence of self-management frameworks for large-scale services, the goal of this work is to design an autonomic management solution that takes the whole control of big services in an autonomous and distributed lifecycle process. We combine autonomic computing and big data processing paradigms to endow big services with self-* and parallel processing capabilities. The proposed management framework takes advantage of the well-known MapReduce programming model and Apache Spark and manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. Finally, a cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. Big services' management tasks (monitoring, embedding, and decision), as well as the core modules (autonomic managers' controller, embedding module, and coordinator), are implemented on top of Apache Spark as MapReduce jobs, while the processed data are represented as resilient distributed dataset (RDD) structures. To exploit the shared information exchanged between the workers and the master node (coordinator), and for further resolution of conflicts between management policies, we endowed the proposed framework with a lightweight communication mechanism that allows transferring useful knowledge between the running map-reduce tasks and filtering inappropriate intermediate data (e.g., conflicting actions). The experimental results proved the increased quality of embeddings and the high performance of autonomic managers in a parallel and cooperative setting, thanks to the shared knowledge. We combine autonomic computing and big data processing to endow big services with self-management capabilities. The proposed management framework takes advantage of MapReduce programming model and Apache Spark. It manages big service's related data using knowledge graph technology. We also define a scalable embedding model that allows processing and learning latent big service knowledge in a distributed manner. A cooperative decision mechanism is defined to trigger non-conflicting management policies in response to the captured deviations of the running big service. image
引用
收藏
页数:28
相关论文
共 50 条
  • [21] Big Data and Service Operations
    Cohen, Maxime C.
    PRODUCTION AND OPERATIONS MANAGEMENT, 2018, 27 (09) : 1709 - 1723
  • [22] A Framework for Big Data as a Service
    Quang Hieu Vu
    Asal, Rasool
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 492 - 496
  • [23] Big data driven smart energy management: From big data to big insights
    Zhou, Kaile
    Fu, Chao
    Yang, Shanlin
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2016, 56 : 215 - 225
  • [24] A framework for investigating the role of big data in service parts management
    Boone, Christopher A.
    Skipper, Joseph B.
    Hazen, Benjamin T.
    JOURNAL OF CLEANER PRODUCTION, 2017, 153 (01) : 687 - 691
  • [25] AWESoME: Big Data for Automatic Web Service Management in SDN
    Trevisan, Martino
    Drago, Idilio
    Mellia, Marco
    Song, Han Hee
    Baldi, Mario
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2018, 15 (01): : 13 - 26
  • [26] A new paradigm of software service engineering in big data and big service era
    Xu, Xiaofei
    Motta, Gianmario
    Tu, Zhiying
    Xu, Hanchuan
    Wang, Zhongjie
    Wang, Xianzhi
    COMPUTING, 2018, 100 (04) : 353 - 368
  • [27] A new paradigm of software service engineering in big data and big service era
    Xiaofei Xu
    Gianmario Motta
    Zhiying Tu
    Hanchuan Xu
    Zhongjie Wang
    Xianzhi Wang
    Computing, 2018, 100 : 353 - 368
  • [28] Towards Use and Reuse Driven Big Data Management
    Xie, Zhiwu
    Chen, Yinlin
    Speer, Julie
    Walters, Tyler
    Tarazaga, Pablo A.
    Kasarda, Mary
    PROCEEDINGS OF THE 15TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL'15), 2015, : 65 - 74
  • [29] Big Data Lakes: Models, Frameworks, and Techniques
    Cuzzocrea, Alfredo
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 1 - 4
  • [30] Challenges in High Performance Big Data Frameworks
    Papadopoulos, Alessandro V.
    Maggio, Martina
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 153 - 156