Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads

被引:29
|
作者
Mehta, Parmita [1 ]
Dorkenwald, Sven [1 ]
Zhao, Dongfang [1 ]
Kaftan, Tomer [1 ]
Cheung, Alvin [1 ]
Balazinska, Magdalena [1 ]
Rokem, Ariel [1 ]
Connolly, Andrew [1 ]
Vanderplas, Jacob [1 ]
AlSayyad, Yusra [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2017年 / 10卷 / 11期
基金
美国国家科学基金会;
关键词
D O I
10.14778/3137628.3137634
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific discoveries are increasingly driven by analyzing large volumes of image data. Many new libraries and specialized database management systems (DBMSs) have emerged to support such tasks. It is unclear how well these systems support real-world image analysis use cases, and how performant the image analytics tasks implemented on top of such systems are. In this paper, we present the first comprehensive evaluation of large-scale image analysis systems using two real-world scientific image data processing use cases. We evaluate five representative systems (SciDB, Myria, Spark, Dask, and TensorFlow) and find that each of them has shortcomings that complicate implementation or hurt performance. Such shortcomings lead to new research opportunities in making large-scale image analysis both efficient and easy to use.
引用
收藏
页码:1226 / 1237
页数:12
相关论文
共 50 条
  • [41] Big Data Analytics in Healthcare Systems
    Wang, Lidong
    Alexander, Cheryl Ann
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2019, 4 (01) : 17 - 26
  • [42] Adopt Big-Data Analytics to Explore and Exploit the New Value for Service Innovation
    Thuethongchai, Nopsaran
    Taiphapoon, Tatri
    Chandrachai, Achara
    Triukose, Sipat
    [J]. SOCIAL SCIENCES-BASEL, 2020, 9 (03):
  • [43] Big-data analytics framework for incorporating smallholders in sustainable palm oil production
    Shukla, Manish
    Tiwari, Manoj Kumar
    [J]. PRODUCTION PLANNING & CONTROL, 2017, 28 (16) : 1365 - 1377
  • [44] ARI CAROLINE THIS BIG-DATA GURU MINES ANALYTICS TO HELP CANCER PATIENTS
    Nordrum, Amy
    [J]. IEEE SPECTRUM, 2016, 53 (05) : 23 - 23
  • [45] Characterizing big data analytics workloads on POWER8 SMT processors
    贾禛
    Zhan Jianfeng
    Wang Lei
    Zhang Lixin
    [J]. High Technology Letters, 2017, 23 (03) : 245 - 251
  • [46] On-Line Big-Data Processing for Visual Analytics with Argus-Panoptes
    Vlantis, Panayiotis, I
    Delis, Alex
    [J]. ALGORITHMIC ASPECTS OF CLOUD COMPUTING (ALGOCLOUD 2018), 2019, 11409 : 102 - 117
  • [47] Development of a Semi-Synthetic Dataset as a Testbed for Big-Data Semantic Analytics
    Techentin, Robert
    Foti, Daniel
    Li, Peter
    Daniel, Erik
    Gilbert, Barry
    Holmes, David
    Al-Saffar, Sinan
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 252 - +
  • [48] A Secure and Intelligent Framework for Vehicle Health Monitoring Exploiting Big-Data Analytics
    Rahman, Md Arafatur
    Rahim, Md Abdur
    Rahman, Md Mustafizur
    Moustafa, Nour
    Razzak, Imran
    Ahmad, Tanvir
    Patwary, Mohammad N.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 19727 - 19742
  • [49] Cross-Platform Aviation Analytics Using Big-Data Integration Methods
    Larsen, Tulinda
    [J]. 2013 INTEGRATED COMMUNICATIONS, NAVIGATION AND SURVEILLANCE CONFERENCE (ICNS), 2013,
  • [50] Editorial: Big scientific data analytics on HPC and cloud
    Wang, Jianwu
    Yin, Junqi
    Nguyen, Mai H.
    Wang, Jingbo
    Xu, Weijia
    [J]. FRONTIERS IN BIG DATA, 2024, 7