The Impact of Distributed Data in Big Data Platforms on Organizations

被引:1
|
作者
Koren, Oded [1 ]
Binyaminov, Matan [1 ]
Perel, Nir [1 ]
机构
[1] Shenkar Engn Design Art, Sch Ind Engn & Management, 12 Anne Frank St, Ramat Gan, Israel
关键词
Big data; Architecture; HDFS; MANAGEMENT; EFFICIENT; SET;
D O I
10.1007/978-3-030-02683-7_76
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is an established platform used worldwide by many organizations for exploring and analyzing business inputs in order to reach better understanding and capabilities. Our research is focused on how organizations' data-accumulating procedures may influence the processing of data in a big data environment. In this paper, we present a use case which examines the impact of data structure, due to big data architecture characteristics (specifically on HDFS), and how it can reflect on business processes and performance. The main contribution of this research is to point out why an organization that uses big data platforms needs to take into consideration the big data storage architecture.
引用
收藏
页码:1024 / 1036
页数:13
相关论文
共 50 条
  • [1] Implementation of Data Preprocessing Techniques on Distributed Big Data Platforms
    Celik, Oguz
    Hasanbasoglu, Muruvvet
    Aktas, Mehmet S.
    Kalipsiz, Oya
    Kanli, Alper Nebi
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 73 - 78
  • [2] Big Data in organizations: Exploring the adoption of Big Data applications and their impact on organizations in China and the Netherlands
    Raab, Jorg
    Pang, Yuting
    Baaijens, Joan
    Zhou, Honggeng
    [J]. BIG DATA RESEARCH, 2024, 36
  • [3] Data Feature Selection Methods on Distributed Big Data Processing Platforms
    Catalkaya, Mehmet Burak
    Kalipsiz, Oya
    Aktas, Mehmet S.
    Turgut, Umut Orcun
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 133 - 138
  • [4] A Distributed Decision Tree Algorithm and Its Implementation on Big Data Platforms
    Chen, Jingxiang
    Wang, Tao
    Abbey, Ralph
    Pingenot, Joseph
    [J]. PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 752 - 761
  • [5] sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms
    Elgamal, Tarek
    Yabandeh, Maysam
    Aboulnaga, Ashraf
    Mustafa, Waleed
    Hefeeda, Mohamed
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 79 - 91
  • [6] Big data for the comprehensive data analysis of IT organizations
    Madugula, Sujatha
    Pratapagiri, Sreenivas
    Phridviraj, M.S.B.
    Rao, V. Chandra Shekhar
    Polala, Niranjan
    Kumaraswamy, P.
    [J]. Journal of High Technology Management Research, 2023, 34 (02):
  • [7] Big Data and Privacy: Why Public Organizations Adopt Big Data
    Prince, Christopher
    [J]. CANADIAN JOURNAL OF INFORMATION AND LIBRARY SCIENCE-REVUE CANADIENNE DES SCIENCES DE L INFORMATION ET DE BIBLIOTHECONOMIE, 2017, 41 (04): : 233 - 244
  • [8] Resilient Distributed Computing Platforms for Big Data Analysis Using Spark and Hadoop
    Chang, Bao Rong
    Tsai, Hsiu-Fen
    Wang, Yo-Ai
    Huang, Chien-Feng
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION (ICASI), 2016,
  • [9] AN ANALYSIS OF THE IMPACT OF DISTRIBUTED DATA PROCESSING ON ORGANIZATIONS IN THE 1980's
    Davis, Charles K.
    Wetherbe, James C.
    [J]. MIS QUARTERLY, 1979, 3 (04) : 47 - 56
  • [10] Big Data and Smart City Platforms
    Oktug, Sema F.
    Yaslan, Yusuf
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,