SAIR: significance-aware approach to improve QoR of big data processing in case of budget constraint

被引:0
|
作者
Hossein Ahmadvand
Maziar Goudarzi
机构
[1] Sharif University of Technology,Department of Computer Engineering
来源
关键词
Big data; Significance; Quality of Result; Data variety; Budget constraint;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, a wide range of enterprises are faced with big data processing in different domains such as transaction operations, business calculations and analytical computations. Large-scale computing is an approach for big data processing. Due to the cost of large-scale computing and limitations of enterprise budgets, it is hardly possible to process all the input data and therefore the Quality of Result (QoR) may be affected. SAIR is an approach to improve QoR of big data processing for aggregative usages based on significance variety when there is a budget constraint. In this paper, the most significant data portions have been assigned to the most efficient resources in terms of time and cost. If the budget is still available, other data portions have been assigned to remaining resources. In this approach, statistical methods and a sampling technique with a 95% of the confidence interval and 5% of error margin are used to identify the most and least significant data portions. By using this method, the users are able to improve QoR with respect to budget constraint and preferred finishing time. In the evaluation phase, applications from different domains such as document and text, transaction data and system logs are used. Our results indicate that SAIR improves QoR while meeting budget constraint for considered usages. This approach improves the QoR up to 15%, compared with the state of the art.
引用
下载
收藏
页码:5760 / 5781
页数:21
相关论文
共 5 条
  • [1] SAIR: significance-aware approach to improve QoR of big data processing in case of budget constraint
    Ahmadvand, Hossein
    Goudarzi, Maziar
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (09): : 5760 - 5781
  • [2] QoR-Aware Power Capping for Approximate Big Data Processing
    Nabavinejad, Seyed Morteza
    Zhan, Xin
    Azimi, Reza
    Goudarzi, Maziar
    Reda, Sherief
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 253 - 256
  • [3] SMBSP: A Self-Tuning Approach using Machine Learning to Improve Performance of Spark in Big Data Processing
    Rahman, Md. Armanur
    Hossen, J.
    Venkataseshaiah, C.
    PROCEEDINGS OF THE 2018 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2018, : 274 - 279
  • [4] A Heuristic Approach to Improve the Data Processing in Big Data using Enhanced Salp Swarm Algorithm (ESSA) and MK-means Algorithm
    Sundarakumar, M. R.
    Nayagi, D. Salangai
    Vinodhini, V.
    VinayagaPriya, S.
    Marimuthu, M.
    Basheer, Shajahan
    Santhakumar, D.
    Renoald, A. Johny
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2625 - 2640
  • [5] A Novel Statistic-Based Corpus Machine Processing Approach to Refine a Big Textual Data: An ESP Case of COVID-19 News Reports
    Chen, Liang-Ching
    Chang, Kuei-Hu
    Chung, Hsiang-Yu
    APPLIED SCIENCES-BASEL, 2020, 10 (16):