The Hopsworks Feature Store for Machine Learning

被引:0
|
作者
Martinez, Javier de la Rua [1 ,2 ]
Buso, Fabio [1 ]
Kouzoupis, Antonios [1 ]
Ormenisan, Alexandru A. [1 ]
Niazi, Salman [1 ]
Bzhalava, Davit [1 ]
Mak, Kenneth [1 ]
Jouffrey, Victor [1 ]
Ronstrom, Mikael [1 ]
Cunningham, Raymond [1 ]
Zangis, Ralfs [1 ]
Mukhedkar, Dhananjay [1 ]
Khazanchi, Ayushman [2 ]
Vlassov, Vladimir [2 ]
Dowling, Jim [1 ,2 ]
机构
[1] Hopsworks AB, Stockholm, Sweden
[2] KTH Royal Inst Technol, Stockholm, Sweden
基金
欧盟地平线“2020”;
关键词
Feature Store; MLOps; RonDB; Arrow Flight; DuckDB;
D O I
10.1145/3626246.3653389
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data management is the most challenging aspect of building Machine Learning (ML) systems. ML systems can read large volumes of historical data when training models, but inference workloads are more varied, depending on whether it is a batch or online ML system. The feature store for ML has recently emerged as a single data platform for managing ML data throughout the ML lifecycle, from feature engineering to model training to inference. In this paper, we present the Hopsworks feature store for machine learning as a highly available platform for managing feature data with API support for columnar, row-oriented, and similarity search query workloads. We introduce and address challenges solved by the feature stores related to feature reuse, how to organize data transformations, and how to ensure correct and consistent data between feature engineering, model training, and model inference. We present the engineering challenges in building high-performance query services for a feature store and show how Hopsworks outperforms existing cloud feature stores for training and online inference query workloads.
引用
收藏
页码:135 / 147
页数:13
相关论文
共 50 条
  • [31] A Comprehensive Review of Feature Selection and Feature Selection Stability in Machine Learning
    Buyukkececi, Mustafa
    Okur, Mehmet Cudi
    GAZI UNIVERSITY JOURNAL OF SCIENCE, 2023, 36 (04): : 1506 - 1520
  • [32] Research on Location Selection of General Merchandise Store Based on Machine Learning
    Lin, Boyu
    Li, Feipeng
    Feng, Jiqiang
    Xu, Shengbing
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2023, PT II, 2023, 13969 : 168 - 180
  • [33] Leveraging Machine Learning for Accurate Store Sales Prediction: A Comparative Study
    AbdElminaam, Diaa Salama
    Mohamed, Mennatallah
    Khaled, Shand
    Hany, Farrah
    Magdy, Mario
    Sherif, Youssef
    2024 INTERNATIONAL MOBILE, INTELLIGENT, AND UBIQUITOUS COMPUTING CONFERENCE, MIUCC 2024, 2024, : 355 - 362
  • [34] Retail store location screening: A machine learning-based approach
    Lu, Jialiang
    Zheng, Xu
    Nervino, Esterina
    Li, Yanzhi
    Xu, Zhihua
    Xu, Yabo
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2024, 77
  • [35] Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms
    Bezerra, Francisco Elanio
    de Oliveira Neto, Geraldo Cardoso
    Cervi, Gabriel Magalhaes
    Mazetto, Rafaella Francesconi
    de Faria, Aline Mariane
    Vido, Marcos
    Lima, Gustavo Araujo
    de Araujo, Sidnei Alves
    Sampaio, Mauro
    Amorim, Marlene
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [36] Store buildings as tourist attractions: Mining retail meaning of store building pictures through a machine learning approach
    Pantano, Eleonora
    Dennis, Charles
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2019, 51 : 304 - 310
  • [37] Feature selection and feature design for machine learning indirect test: a tutorial review
    Barragan, Manuel J.
    Leger, Gildas
    2019 16TH INTERNATIONAL CONFERENCE ON SYNTHESIS, MODELING, ANALYSIS AND SIMULATION METHODS AND APPLICATIONS TO CIRCUIT DESIGN (SMACD 2019), 2019, : 69 - 72
  • [38] Feature Extraction, Feature Selection and Machine Learning for Image Classification: A Case Study
    Popescu, Madalina Cosmina
    Sasu, Lucian Mircea
    2014 INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT (OPTIM), 2014, : 968 - 973
  • [39] Extreme learning machine with feature mapping of kernel function
    Wang, Zhaoxi
    Chen, Shengyong
    Guo, Rongwei
    Li, Bin
    Feng, Yangbo
    IET IMAGE PROCESSING, 2020, 14 (11) : 2495 - 2502
  • [40] Quantum computer based feature selection in machine learning
    Hellstern, Gerhard
    Dehn, Vanessa
    Zaefferer, Martin
    IET QUANTUM COMMUNICATION, 2024, 5 (03): : 232 - 252