SAIR: significance-aware approach to improve QoR of big data processing in case of budget constraint

被引：0

作者：

Hossein Ahmadvand

Maziar Goudarzi

机构：

[1] Sharif University of Technology,Department of Computer Engineering

来源：

The Journal of Supercomputing | 2019年 / 75卷

关键词：

Big data; Significance; Quality of Result; Data variety; Budget constraint;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Nowadays, a wide range of enterprises are faced with big data processing in different domains such as transaction operations, business calculations and analytical computations. Large-scale computing is an approach for big data processing. Due to the cost of large-scale computing and limitations of enterprise budgets, it is hardly possible to process all the input data and therefore the Quality of Result (QoR) may be affected. SAIR is an approach to improve QoR of big data processing for aggregative usages based on significance variety when there is a budget constraint. In this paper, the most significant data portions have been assigned to the most efficient resources in terms of time and cost. If the budget is still available, other data portions have been assigned to remaining resources. In this approach, statistical methods and a sampling technique with a 95% of the confidence interval and 5% of error margin are used to identify the most and least significant data portions. By using this method, the users are able to improve QoR with respect to budget constraint and preferred finishing time. In the evaluation phase, applications from different domains such as document and text, transaction data and system logs are used. Our results indicate that SAIR improves QoR while meeting budget constraint for considered usages. This approach improves the QoR up to 15%, compared with the state of the art.

引用

下载

页码：5760 / 5781

页数：21

共 5 条

[1] SAIR: significance-aware approach to improve QoR of big data processing in case of budget constraint
Ahmadvand, Hossein
Goudarzi, Maziar
JOURNAL OF SUPERCOMPUTING, 2019, 75 (09): : 5760 - 5781
[2] QoR-Aware Power Capping for Approximate Big Data Processing
Nabavinejad, Seyed Morteza
Zhan, Xin
Azimi, Reza
Goudarzi, Maziar
Reda, Sherief
PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 253 - 256
[3] SMBSP: A Self-Tuning Approach using Machine Learning to Improve Performance of Spark in Big Data Processing
Rahman, Md. Armanur
Hossen, J.
Venkataseshaiah, C.
PROCEEDINGS OF THE 2018 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2018, : 274 - 279
[4] A Heuristic Approach to Improve the Data Processing in Big Data using Enhanced Salp Swarm Algorithm (ESSA) and MK-means Algorithm
Sundarakumar, M. R.
Nayagi, D. Salangai
Vinodhini, V.
VinayagaPriya, S.
Marimuthu, M.
Basheer, Shajahan
Santhakumar, D.
Renoald, A. Johny
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2625 - 2640
[5] A Novel Statistic-Based Corpus Machine Processing Approach to Refine a Big Textual Data: An ESP Case of COVID-19 News Reports
Chen, Liang-Ching
Chang, Kuei-Hu
Chung, Hsiang-Yu
APPLIED SCIENCES-BASEL, 2020, 10 (16):

← 1 →