Data reduction in big data: a survey of methods, challenges and future directions

被引:0
|
作者
Khoei, Tala Talaei [1 ]
Singh, Aditi [2 ]
机构
[1] Northeastern Univ, Khoury Coll Comp Sci, Roux Inst, Portland, ME 04101 USA
[2] Cleveland State Univ, Washkewicz Coll Engn, Cleveland, OH USA
关键词
Artificial intelligence; Biometrics; Crime; Detection; Emotions; Facial recognition; Prediction; Policing; CLASSIFICATION; COMPRESSION;
D O I
10.1007/s41060-024-00603-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction plays a pivotal role in managing and analyzing big data, which is characterized by its volume, velocity, variety, veracity, value, variability, and visibility. However, several surveys have been conducted to summarize these techniques in the field of big data, and there are several concerns that require attention, such as limited discussions of reduction techniques. Also, most of these studies focused on applications and only described their techniques. In contrast, this survey provides a comprehensive overview of data reduction methods, challenges, and future directions in the context of big data analytics in general concepts. This survey begins discussing the significance of data reduction in addressing the scalability and complexity issues inherent in big data processing. Subsequently, a classification data reduction method in big data is provided. For each category, the underlying principles, popular algorithms, and applications in big data analytics are highlighted. Moreover, the key challenges associated with data reduction in the era of big data, such as scalability, computational complexity, quality preservation, and interpretability, are found and discussed, while the importance of addressing these challenges to ensure the effectiveness and reliability of data reduction techniques in large-scale data analytics are reviewed. This survey can serve as a comprehensive reference for researchers, practitioners, and stakeholders interested in understanding and using data reduction techniques to address the challenges and opportunities posed by big data. Finally, tangible results of this study can be listed as introducing techniques for improving storage efficiency and faster computational processing by minimizing dataset size, while these techniques can enhance data analysis by removing redundancy and noise, leading to more accurate and actionable insights.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Big Data Reduction Methods: A Survey
    Rehman, Muhammad Habib ur
    Liew, Chee Sun
    Abbas, Assad
    Jayaraman, Prem Prakash
    Wah, Teh Ying
    Khan, Samee U.
    [J]. DATA SCIENCE AND ENGINEERING, 2016, 1 (04) : 265 - 284
  • [2] Handling big data: research challenges and future directions
    I. Anagnostopoulos
    S. Zeadally
    E. Exposito
    [J]. The Journal of Supercomputing, 2016, 72 : 1494 - 1516
  • [3] Handling big data: research challenges and future directions
    Anagnostopoulos, I.
    Zeadally, S.
    Exposito, E.
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (04): : 1494 - 1516
  • [4] Orchestrating Big Data Analysis Workflows in the Cloud: Research Challenges, Survey, and Future Directions
    Barika, Mutaz
    Garg, Saurabh
    Zomaya, Albert Y.
    Wang, Lizhe
    Van Moorsel, Aad
    Ranjan, Rajiv
    [J]. ACM COMPUTING SURVEYS, 2019, 52 (05)
  • [5] Evolutionary Computation and Big Data: Key Challenges and Future Directions
    Cheng, Shi
    Liu, Bin
    Shi, Yuhui
    Jin, Yaochu
    Li, Bin
    [J]. DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 3 - 14
  • [6] Utilization of Big Data Analysis in HRM - Challenges and Future Directions
    Al-Ahmadi, Haneen Hassan
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (09): : 36 - 40
  • [7] Challenges and Future Directions of Big Data and Artificial Intelligence in Education
    Luan, Hui
    Geczy, Peter
    Lai, Hollis
    Gobert, Janice
    Yang, Stephen J. H.
    Ogata, Hiroaki
    Baltes, Jacky
    Guerra, Rodrigo
    Li, Ping
    Tsai, Chin-Chung
    [J]. FRONTIERS IN PSYCHOLOGY, 2020, 11
  • [8] A Survey on IoT Big Data: Current Status, 13 V's Challenges, and Future Directions
    Bansal, Maggi
    Chana, Inderveer
    Clarke, Siobhan
    [J]. ACM COMPUTING SURVEYS, 2021, 53 (06)
  • [9] A survey on blockchain for big data: Approaches, opportunities, and future directions
    Deepa, N.
    Pham, Quoc-Viet
    Nguyen, Dinh C.
    Bhattacharya, Sweta
    Prabadevi, B.
    Fang, Fang
    Pathirana, Pubudu N.
    Gadekallu, Thippa Reddy
    Maddikunta, Praveen Kumar Reddy
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 131 : 209 - 226
  • [10] Access methods for Big Data: current status and future directions
    Rashid A.N.M.B.
    [J]. EAI Endorsed Transactions on Scalable Information Systems, 2017, 4 (15) : 1 - 14