Building analytical platform with Big Data solutions for log files of PanDA infrastructure

被引:3
|
作者
Alekseev, A. A. [1 ]
Megino, F. G. Barreiro [2 ]
Klimentov, A. A. [3 ]
Korchuganova, T. A. [1 ]
Maendo, T. [3 ]
Padolski, S. V. [3 ]
机构
[1] Tomsk Polytech Univ, 30 Lenina Ave, Tomsk 634050, Russia
[2] Univ Texas Arlington, 701 South Nedderman Dr, Arlington, TX 76019 USA
[3] Brookhaven Natl Lab, POB 5000, Upton, NY 11973 USA
基金
俄罗斯科学基金会;
关键词
D O I
10.1088/1742-6596/1015/3/032003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper describes the implementation of a high-performance system for the processing and analysis of log files for the PanDA infrastructure of the ATLAS experiment at the Large Hadron Collider (LHC), responsible for the workload management of order of 2M daily jobs across the Worldwide LHC Computing Grid. The solution is based on the ELK technology stack, which includes several components: Filebeat, Logstash, ElasticSearch (ES), and Kibana. Filebeat is used to collect data from logs. Logstash processes data and export to Elasticsearch. ES are responsible for.entralized data storage. Accumulated data in ES can be viewed using a special software Kibana. These components were integrated with the PanDA infrastructure and replaced previous log processing systems for increased scalability and usability. The authors will describe all the components and their configuration tuning for the current tasks, the scale of the actual system and give several real-life examples of how this centralized log processing and storage service is used to showcase the advantages for daily operations.
引用
收藏
页数:6
相关论文
共 36 条
  • [21] The Design of "Smart Party Building" Platform in Colleges and Universities Based on Big Data Environment
    Shan, Hui
    Zhang, Hongshen
    2020 INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2020), 2020, : 31 - 34
  • [22] IoT platform and infrastructure for data-driven optimization and control of building energy system operation
    Bruemmendorf, Erik
    Ziegeldorf, Jan Henrik
    Fuetterer, Johannes Peter
    CLIMATE RESILIENT CITIES - ENERGY EFFICIENCY & RENEWABLES IN THE DIGITAL ERA (CISBAT 2019), 2019, 1343
  • [23] TemPredict: A Big Data Analytical Platform for Scalable Exploration and Monitoring of Personalized Multimodal Data for COVID-19
    Purawat, Shweta
    Dasgupta, Subhasis
    Song, Jining
    Davis, Shakti
    Claypool, Kajal T.
    Chandra, Sandeep
    Mason, Ashley
    Viswanath, Varun
    Klein, Amit
    Kasl, Patrick
    Wen, YingJing
    Smarr, Benjamin
    Gupta, Amarnath
    Altintas, Ilkay
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4411 - 4420
  • [24] Access efficiency of small sized files in Big Data using various Techniques on Hadoop Distributed File System platform
    Alange, Neeta
    Mathur, Anjali
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (07): : 359 - 364
  • [25] Bioinformatics solutions for big data analysis in life sciences presented by the German network for bioinformatics infrastructure
    Puehler, A.
    JOURNAL OF BIOTECHNOLOGY, 2017, 261 : 1 - 1
  • [26] HiPerData: An Autonomous Large-Scale Model Building and Management Platform for Big Data Analytics
    Duan, Rubing
    Goh, Rick Siow Mong
    Yang, Feng
    Di Shang, Richard
    Liu, Yong
    Li, Zengxiang
    Wang, Long
    Lu, Sifei
    Yang, Xulei
    Qin, Zheng
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 449 - 454
  • [27] Evaluating the severity of building fires with the analytical hierarchy process, big data analysis, and remote sensing
    Ching-An Lee
    Yu-Chi Sung
    Yuan-Shang Lin
    Gary Li-Kai Hsiao
    Natural Hazards, 2020, 103 : 1843 - 1856
  • [28] Evaluating the severity of building fires with the analytical hierarchy process, big data analysis, and remote sensing
    Lee, Ching-An
    Sung, Yu-Chi
    Lin, Yuan-Shang
    Hsiao, Gary Li-Kai
    NATURAL HAZARDS, 2020, 103 (02) : 1843 - 1856
  • [29] Building Platform Application Big Sensor Data for e-Health Wireless Body Area Network
    Al Rasyid, M. Udin Harun
    Yuwono, Wiratmoko
    Al Muharom, Syamsudin
    Alasiry, Ali Husein
    2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 409 - 413
  • [30] Infrastructure Optimizing through a Big Data Clustering Algorithm-Based Model for Universities' English Online Learning Platform
    Zhenzhen Y.
    Computer-Aided Design and Applications, 2023, 20 (S15): : 236 - 249