Medical data lake query assistance

被引:0
|
作者
Abdelhedi, Fatma [1 ]
Jemmali, Rym [1 ,2 ]
Zurfluh, Gilles [2 ]
机构
[1] Trimane, CBI2, Paris, France
[2] Toulouse Univ, CNRS, UMR 5505, IRIT, Toulouse, France
关键词
Data Lake; Data Warehouse; Data Mart; NoSQL; Big Data; recommendation system; collaborative filtering;
D O I
10.1109/AICCSA59173.2023.10479336
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's world, there is a growing need to analyze data stored in a Data Lake, which is a collection of large, heterogeneous databases. Our work is part of a medical application that aims to help healthcare professionals analyze complex data for decision-making. We propose mechanisms that promote data accessibility. The data are stored in a Data Warehouse (DW) that is periodically built from a data lake. Depending on the needs of the decision-maker, data are extracted from the DW and transferred to a Data Mart (DM) for querying. In this paper, we present a schema recommendation system based on the principle of collaborative filtering. This system can predict the DM schemas that were developed in the past that best match the data need expressed by a decision-maker. It does this by comparing the attributes present in the schemas with the attributes deduced from the need to propose a list of predictions for the most suitable schemas. The technique used is simple, while allowing us to solve the problem of periodic updates to the source data. An experiment was conducted for a medical application.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Ontario: Federated Query Processing Against a Semantic Data Lake
    Endris, Kemele M.
    Rohde, Philipp D.
    Vidal, Maria-Esther
    Auer, Soeren
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, 2019, 11706 : 379 - 395
  • [2] An Effective Framework for Enhancing Query Answering in a Heterogeneous Data Lake
    Yuan, Qin
    Yuan, Ye
    Wen, Zhenyu
    Wang, He
    Tang, Shiyuan
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 770 - 780
  • [3] SPARQL Query-Builder for Medical Temporal Data
    Vcelak, Petr
    Kryl, Martin
    Kleckova, Jana
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [4] Extensible Query Framework for Unstructured Medical Data - A Big Data Approach
    Istephan, Sarmad
    Siadat, Mohammad-Reza
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 455 - 462
  • [5] A Novel Medical Assistance System Based On Data Mining
    Dai, Jingyuan
    PROCEEDINGS OF 2014 IEEE WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS (WARTIA), 2014, : 264 - 266
  • [6] Medical Assistance in Dying, Data and Casting Assertions Aside
    Chochinov, Harvey Max
    JOURNAL OF PALLIATIVE MEDICINE, 2023, 26 (01) : 9 - 12
  • [7] Investigating Query Formulation Assistance for Children
    Anuyah, Oghenemaro
    Fails, Jerry Alan
    Pera, Maria Soledad
    PROCEEDINGS OF THE 2018 ACM CONFERENCE ON INTERACTION DESIGN AND CHILDREN (IDC 2018), 2018, : 581 - 586
  • [8] Composite Pattern Query Expression Over Medical Data Streams
    Li, Feifei
    Li, Hongyan
    Qu, Qiang
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1798 - 1802
  • [9] Manual query modification and data fusion for medical image retrieval
    Jensen, Jeffery R.
    Hersh, William R.
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 673 - 679
  • [10] MEDICAL ASSISTANCE
    BING, K
    NEW REPUBLIC, 1966, 155 (4-5) : 40 - 40