An empirical framework for event prediction in massive datasets

被引:0
|
作者
Rajita, B. S. A. S. [1 ]
Soni, Samarth [1 ]
Kumari, Deepa [1 ]
Panda, Subhrakanta [1 ]
机构
[1] BITS Pilani Hyderabad Campus, CSIS Dept, Hyderabad, India
关键词
Social networks (SN); Evolution; Machine learning model; Event prediction; Computer science digital bibliography & library project (DBLP); NETWORK; CLASSIFICATION; SELECTION; EVOLUTION; MACHINE;
D O I
10.1007/s13198-024-02302-1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Certain events always trigger evolutionary changes in temporal Social Networks (SNs) communities. Machine Learning models make predictions for such events. The performance of these ML models largely depends on the dataset's features. Existing literature shows that the community features of the datasets have helped ML models predict the events with some accuracy. However, a temporal dataset has temporal and community features owing to its evolving structures. These temporal features also aid in improving the performance of the ML models. Thus, this work aims to compare the effectiveness of temporal and community features in improving the accuracy of ML models. This paper proposes a framework to extract the detected communities' community- and temporal- features in temporal data. This research also analyses ML models suitable for predicting events based on features and compares their performance. The experimental research shows that adding temporal features improves the prediction accuracy from 79.51 to 81.47% and saves 59.37% of the computational time of ML models.
引用
收藏
页码:2880 / 2901
页数:22
相关论文
共 50 条
  • [1] Statistical inference in massive datasets by empirical likelihood
    Ma, Xuejun
    Wang, Shaochen
    Zhou, Wang
    [J]. COMPUTATIONAL STATISTICS, 2022, 37 (03) : 1143 - 1164
  • [2] Statistical inference in massive datasets by empirical likelihood
    Xuejun Ma
    Shaochen Wang
    Wang Zhou
    [J]. Computational Statistics, 2022, 37 : 1143 - 1164
  • [3] A Framework for Accelerating Graph Convolutional Networks on Massive Datasets
    Li, Xiang
    Jin, Ruoming
    Ramnath, Rajiv
    Agrawal, Gagan
    [J]. COMPUTATIONAL DATA AND SOCIAL NETWORKS, CSONET 2021, 2021, 13116 : 79 - 92
  • [4] A framework for analyzing massive astrophysical datasets on a distributed grid
    Gardner, Jeffrey P.
    Connolly, Andrew
    McBride, Cameron
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVI, 2007, 376 : 69 - +
  • [5] A fast, optimal spatial-prediction method for massive datasets
    Tzeng, SL
    Huang, HC
    Cressie, N
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (472) : 1343 - 1357
  • [6] Massive datasets
    Kettenring, Jon R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2009, 1 (01): : 25 - 32
  • [7] Massive, accurate empirical datasets: New tools for drug discovery and computational chemistry
    Hodge, C. Nicholas
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 233 : 197 - 197
  • [8] Mining of Massive Datasets
    Richter, Lothar
    [J]. BIOMETRICS, 2018, 74 (04) : 1520 - 1521
  • [9] Modeling massive spatial datasets using a conjugate Bayesian linear modeling framework
    Banerjee, Sudipto
    [J]. SPATIAL STATISTICS, 2020, 37
  • [10] Workshop on Massive Datasets
    Wren, Christopher R.
    Ivanov, Yuri A.
    [J]. ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 385 - 385