Running the SkyMapper Science Data Pipeline: To be a Big Fish in a Small Pond, or a Small Fish in a Big Ocean?

被引:0
|
作者
Luvaul, Lance C. [1 ]
Onken, Christopher A. [1 ]
Wolf, Christian [1 ]
Smillie, Jonathan G. [2 ]
Sebo, Kim M. [1 ]
机构
[1] Australian Natl Univ, Res Sch Astron & Astrophys, Canberra, ACT 2611, Australia
[2] Australian Natl Univ, Natl Comp Infrastruct, Canberra, ACT 2601, Australia
关键词
D O I
暂无
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We review structure and frameworks behind the SkyMapper Science Data Pipeline (SDP), and consider the challenges of deploying on two disparate platforms: a publicly shared, massively parallel, queue-scheduled compute fabric, and a dedicated NUMA-based, multi-core, mini-supercomputer. Concepts reviewed include a) how to impose a layer of central operator control over hundreds of jobs of varying type and CPU/IO profile, all running concurrently and at different stages in their logic, b) how to maintain configuration control in an ever-changing algorithmic environment while not giving up ease of build and deployment, and c) how to configure a NUMA-architected machine for optimal cache buffer usage, process-to-memory locality, and user/system CPU cycle ratio.
引用
收藏
页码:393 / 397
页数:5
相关论文
共 50 条