Spark-Based Parallel Method for Prediction of Events

被引:0
|
作者
B. S. A. S. Rajita
Yash Ranjan
Chandekar Tanmay Umesh
Subhrakanta Panda
机构
[1] BITS-Pilani-Hyderabad Campus,Department of CSIS
关键词
Social networks (SN); Community detection and prediction; Spark; Ensemble ML methods;
D O I
暂无
中图分类号
学科分类号
摘要
Prediction of events is imperative in many areas of social network (SN) applications. These events influence the temporal evolutionary characteristic of social networks. A study of these events can give better insights to understand the evolutionary patterns (communities) in social networks. One of the major challenges in such implementation is the processing and structuring of large datasets to suit ML models. This paper proposes a Spark-based parallel method for detection, mining, and prediction of these events that influence the evolution of communities in a temporal SN. The proposed framework processes large temporal data (taken from the DBLP dataset), uses parallel algorithms to detect the structural changes, and applies ML techniques to predict the future structural changes (events). The proposed methodology uses ensemble ML methods in the Spark ML pipeline to achieve the desired performance and accuracy. The experimental results justify that the proposed framework can predict future events with an accuracy of 82% and saves 99% of computational time.
引用
收藏
页码:3437 / 3453
页数:16
相关论文
共 50 条
  • [31] Improve Spark-based Application Performance Using Minimizer
    Wu, Jinda
    Deng, Li
    Wang, Lili
    Li, Kexue
    Lu, Yakang
    Song, Yang
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 595 - 599
  • [32] Feature selection from disaster tweets using Spark-based parallel meta-heuristic optimizers
    Noori, Mohammed Ahsan Raza
    Sharma, Bharti
    Mehra, Ritika
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [33] A Spark-based Incremental Algorithm for Frequent Itemset Mining
    Wen, Haoxing
    Li, Xiaoguang
    Kou, Mingdong
    Tou, Huaixiao
    He, Hengyi
    Yang, Yulu
    BDIOT 2018: PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS, 2018, : 53 - 58
  • [34] CHARACTERIZATION OF A SPARK-BASED, NONTHERMAL PLASMA FOR GERMICIDAL CAPABILITIES
    Ferrell, J. R.
    Fulton, J. A.
    Woolverton, C. J.
    WOUND REPAIR AND REGENERATION, 2011, 19 (02) : A22 - A22
  • [35] Feature selection from disaster tweets using Spark-based parallel meta-heuristic optimizers
    Mohammed Ahsan Raza Noori
    Bharti Sharma
    Ritika Mehra
    Social Network Analysis and Mining, 2022, 12
  • [36] Spark-Based Classification Algorithms for Daily Living Activities
    Moldovan, Dorin
    Antal, Marcel
    Pop, Claudia
    Olosutean, Adrian
    Cioara, Tudor
    Anghel, Ionut
    Salomie, Ioan
    ARTIFICIAL INTELLIGENCE AND ALGORITHMS IN INTELLIGENT SYSTEMS, 2019, 764 : 69 - 78
  • [37] An efficient spark-based adaptive windowing for entity matching
    Mestre, Demetrio Gomes
    Santos Pires, Carlos Eduardo
    Nascimento, Dimas Cassimiro
    Monteiro de Queiroz, Andreza Raquel
    Santos, Veruska Borges
    Araujo, Tiago Brasileiro
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 128 : 1 - 10
  • [38] A Spark-based Apriori algorithm with reduced shuffle overhead
    Raj, Shashi
    Ramesh, Dharavath
    Sethi, Krishan Kumar
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (01): : 133 - 151
  • [39] A spark-based big data analysis framework for real-time sentiment prediction on streaming data
    Kilinc, Deniz
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (09): : 1352 - 1364
  • [40] Lemonade: A scalable and efficient Spark-based platform for data analytics
    dos Santos, Walter
    Carvalho, Luiz F. M.
    Avelar, Gustavo de P.
    Silva, Atila, Jr.
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 745 - 748