Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning

被引:29
|
作者
Bar El, Ori [1 ]
Milo, Tova [1 ]
Somech, Amit [1 ]
机构
[1] Tel Aviv Univ, Tel Aviv, Israel
基金
以色列科学基金会;
关键词
D O I
10.1145/3318464.3389779
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Exploratory Data Analysis (EDA) is an essential yet highly demanding task. To get a head start before exploring a new dataset, data scientists often prefer to view existing EDA notebooks - illustrative, curated exploratory sessions, on the same dataset, that were created by fellow data scientists who shared them online. Unfortunately, such notebooks are not always available (e.g., if the dataset is new or confidential). To address this, we present ATENA, a system that takes an input dataset and auto-generates a compelling exploratory session, presented in an EDA notebook. We shape EDA into a control problem, and devise a novel Deep Reinforcement Learning (DRL) architecture to effectively optimize the notebook generation. Though ATENA uses a limited set of EDA operations, our experiments show that it generates useful EDA notebooks, allowing users to gain actual insights.
引用
收藏
页码:1527 / 1537
页数:11
相关论文
共 50 条
  • [1] Automatically organizing papers in conference sessions using deep learning and network modeling
    Gundogan, Esra
    Kaya, Mehmet
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) : 45345 - 45365
  • [2] Automatically organizing papers in conference sessions using deep learning and network modeling
    Esra Gündoğan
    Mehmet Kaya
    [J]. Multimedia Tools and Applications, 2024, 83 : 45345 - 45365
  • [3] 3D object recognition using deep learning for automatically generating semantic BIM data
    Rogage, Kay
    Doukari, Omar
    [J]. AUTOMATION IN CONSTRUCTION, 2024, 162
  • [4] Exploration of Unknown Environments Using Deep Reinforcement Learning
    McCalmon, Joseph
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15970 - 15971
  • [5] Exploration in deep reinforcement learning: A survey
    Ladosz, Pawel
    Weng, Lilian
    Kim, Minwoo
    Oh, Hyondong
    [J]. INFORMATION FUSION, 2022, 85 : 1 - 22
  • [6] ATENA: An Autonomous System for Data Exploration Based on Deep Reinforcement Learning
    Bar, El Ori
    Milo, Tova
    Somech, Amit
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2873 - 2876
  • [7] Generating Test Input with Deep Reinforcement Learning
    Kim, Junhwi
    Kwon, Minhyuk
    Yoo, Shin
    [J]. PROCEEDINGS 2018 IEEE/ACM 11TH INTERNATIONAL WORKSHOP ON SEARCH-BASED SOFTWARE TESTING (SBST), 2018, : 51 - 58
  • [8] Automatically Generating Dockerfiles via Deep Learning: Challenges and Promises
    Rosa, Giovanni
    Mastropaolo, Antonio
    Scalabrino, Simone
    Bavota, Gabriele
    Oliveto, Rocco
    [J]. 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON SOFTWARE AND SYSTEM PROCESSES, ICSSP, 2023, : 1 - 12
  • [9] Causal Deep Reinforcement Learning Using Observational Data
    Zhu, Wenxuan
    Yu, Chao
    Zhang, Qiang
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4711 - 4719
  • [10] Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
    Chandak, Yash
    Thakoor, Shantanu
    Guo, Zhaohan Daniel
    Tang, Yunhao
    Munos, Remi
    Dabney, Will
    Borsa, Diana
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202