An empirical framework for event prediction in massive datasets

被引:0
|
作者
Rajita, B. S. A. S. [1 ]
Soni, Samarth [1 ]
Kumari, Deepa [1 ]
Panda, Subhrakanta [1 ]
机构
[1] BITS Pilani Hyderabad Campus, CSIS Dept, Hyderabad, India
关键词
Social networks (SN); Evolution; Machine learning model; Event prediction; Computer science digital bibliography & library project (DBLP); NETWORK; CLASSIFICATION; SELECTION; EVOLUTION; MACHINE;
D O I
10.1007/s13198-024-02302-1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Certain events always trigger evolutionary changes in temporal Social Networks (SNs) communities. Machine Learning models make predictions for such events. The performance of these ML models largely depends on the dataset's features. Existing literature shows that the community features of the datasets have helped ML models predict the events with some accuracy. However, a temporal dataset has temporal and community features owing to its evolving structures. These temporal features also aid in improving the performance of the ML models. Thus, this work aims to compare the effectiveness of temporal and community features in improving the accuracy of ML models. This paper proposes a framework to extract the detected communities' community- and temporal- features in temporal data. This research also analyses ML models suitable for predicting events based on features and compares their performance. The experimental research shows that adding temporal features improves the prediction accuracy from 79.51 to 81.47% and saves 59.37% of the computational time of ML models.
引用
收藏
页码:2880 / 2901
页数:22
相关论文
共 50 条
  • [31] Distributed Subtrajectory Join on Massive Datasets
    Tampakis, Panagiotis
    Doulkeridis, Christos
    Pelekis, Nikos
    Theodoridis, Yannis
    [J]. ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2020, 6 (02)
  • [32] Imbalanced Learning in Massive Phishing Datasets
    Azari, Ali
    Namayanja, Josephine M.
    Kaur, Navneet
    Misal, Vasundhara
    Shukla, Suraksha
    [J]. 2020 IEEE 6TH INT CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY) / 6TH IEEE INT CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) / 5TH IEEE INT CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2020, : 127 - 132
  • [33] Robust clustering of massive tractography datasets
    Guevara, P.
    Poupon, C.
    Riviere, D.
    Cointepas, Y.
    Descoteaux, M.
    Thirion, B.
    Mangin, J. -F.
    [J]. NEUROIMAGE, 2011, 54 (03) : 1975 - 1993
  • [34] Approximate Search on Massive Spatiotemporal Datasets
    Brugere, Ivan
    Steinhaeuser, Karsten
    Boriah, Shyam
    Kumar, Vipin
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 773 - 780
  • [35] Scalable Affinity Propagation for Massive Datasets
    Shiokawa, Hiroaki
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9639 - 9646
  • [36] Composite quantile regression for massive datasets
    Jiang, Rong
    Hu, Xueping
    Yu, Keming
    Qian, Weimin
    [J]. STATISTICS, 2018, 52 (05) : 980 - 1004
  • [37] Data mining of massive datasets in healthcare
    Goodall, CR
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1999, 8 (03) : 620 - 634
  • [38] Adaptive quantile regressions for massive datasets
    Jiang, Rong
    Chen, Wei-wei
    Liu, Xin
    [J]. STATISTICAL PAPERS, 2021, 62 (04) : 1981 - 1995
  • [39] Introduction to the special section on massive datasets
    Buja, A
    Keller-McNulty, S
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1999, 8 (03) : 544 - 544
  • [40] Structural inferences from massive datasets
    Yip, K
    [J]. IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 534 - 539