Scheduling Data on Data-Driven Master/Worker Platform

被引:2
|
作者
Labidi, Mohamed [1 ]
Tang, Bing [2 ]
Fedak, Gilles [3 ]
Khemakhem, Maher [4 ]
Jemni, Mohamed [1 ]
机构
[1] Univ Tunis, LaTICE Lab, Tunis, Tunisia
[2] Hunan Univ Sci & Technol, Xiangtan 411201, Peoples R China
[3] ENS Lyon, LIP, INRIA, F-69364 Lyon, France
[4] Univ Sfax, Miracl Lab Fsegs, Sfax, Tunisia
关键词
Data scheduling; BitDew; Large Scale OCR; Scheduling heuristics;
D O I
10.1109/PDCAT.2012.122
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With data intensive applications it can be interesting to resort to a distributed storage to reach scalability and avoid data-intensive problems. Storing data permanently on computing nodes can be an interesting approach especially with the frequent use and the large volume of this data. Moreover, processing large data is a computing intensive task which encourages parallel execution. Nevertheless, data placement on computing nodes should be optimal to reach load balancing. In this work, we investigate scheduling heuristics towards the optimization of data distribution on the computing nodes. Motivated by its capacity to control perfectly the common operations associated with data management, we use BitDew: a desktop grid middleware designed for large scale data management. With BitDew, we build a Data-Driven Master/Worker Platform to carry out the distribution of Magick, the OCR application based on Dynamic Time Warping (DTW) algorithm. We evaluate the benefit of the implementation of studied scheduling heuristics to achieve load balancing with both homogeneous and heterogeneous environment. We present experimental results which demonstrate the efficiency of our approach.
引用
收藏
页码:593 / 598
页数:6
相关论文
共 50 条
  • [1] Data-driven appointment scheduling
    Fiems, Dieter
    [J]. PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 3 - 3
  • [2] Data-Driven Batch Scheduling
    Bent, John
    Denehy, Timothy E.
    Livny, Miron
    Arpaci-Dusseau, Andrea C.
    Arpaci-Dusseau, Remzi H.
    [J]. DADC 2009: SECOND INTERNATIONAL WORKSHOP ON DATA AWARE DISTRIBUTED COMPUTING, 2009, : 1 - 10
  • [3] The Data-Driven Workplace and the Case for Worker Technology Rights
    Bernhardt, Annette
    Kresge, Lisa
    Suleiman, Reem
    [J]. ILR REVIEW, 2023, 76 (01) : 3 - 29
  • [4] Welcome to the Library: Data-Driven Student Worker Empowerment
    Carrillo, Elena
    Scoulas, Jung Mi
    [J]. EVIDENCE BASED LIBRARY AND INFORMATION PRACTICE, 2020, 15 (02): : 139 - 143
  • [5] Optimization for data-driven wireless sensor scheduling
    Vasconcelos, Marcos M.
    Mitra, Urbashi
    [J]. CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 215 - 219
  • [6] Data-Driven Suboptimal Scheduling of Switched Systems
    Zhang, Chi
    Gan, Minggang
    Zhao, Jingang
    Xue, Chenchen
    [J]. SENSORS, 2020, 20 (05)
  • [7] Data-driven robust flexible personnel scheduling
    Wang, Zilu
    Luo, Zhixing
    Shen, Huaxiao
    [J]. Computers and Operations Research, 2025, 176
  • [8] Data-driven Algorithm for Scheduling with Total Tardiness
    Bouska, Michal
    Novak, Antonin
    Sucha, Premysl
    Modos, Istvan
    Hanzalek, Zdenek
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON OPERATIONS RESEARCH AND ENTERPRISE SYSTEMS (ICORES), 2020, : 59 - 68
  • [9] A data-driven scheduling approach to smart manufacturing
    Alejandro Rossit, Daniel
    Tohme, Fernando
    Frutos, Mariano
    [J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2019, 15 : 69 - 79
  • [10] A data-driven method for pipeline scheduling optimization
    Liao, Qi
    Zhang, Haoran
    Xia, Tianqi
    Chen, Quanjun
    Li, Zhengbing
    Liang, Yongtu
    [J]. CHEMICAL ENGINEERING RESEARCH & DESIGN, 2019, 144 : 79 - 94