SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics

被引:152
|
作者
Vartak, Manasi [1 ]
Rahman, Sajjadur
Madden, Samuel [1 ]
Parameswaran, Aditya [2 ]
Polyzotis, Neoklis [3 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ Illinois UIUC, Champaign, IL USA
[3] Google, Mountain View, CA 94043 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2015年 / 8卷 / 13期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2831360.2831371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data analysts often build visualizations as the first step in their analytical workflow. However, when working with high-dimensional datasets, identifying visualizations that show relevant or desired trends in data can be laborious. We propose SEEDB, a visualization recommendation engine to facilitate fast visual analysis: given a subset of data to be studied, SEEDB intelligently explores the space of visualizations, evaluates promising visualizations for trends, and recommends those it deems most "useful" or "interesting". The two major obstacles in recommending interesting visualizations are (a) scale: evaluating a large number of candidate visualizations while responding within interactive time scales, and (b) utility: identifying an appropriate metric for assessing interestingness of visualizations. For the former, SEEDB introduces pruning optimizations to quickly identify high-utility visualizations and sharing optimizations to maximize sharing of computation across visualizations. For the latter, as a first step, we adopt a deviation-based metric for visualization utility, while indicating how we may be able to generalize it to other factors influencing utility. We implement SEEDB as a middleware layer that can run on top of any DBMS. Our experiments show that our framework can identify interesting visualizations with high accuracy. Our optimizations lead to multiple orders of magnitude speedup on relational row and column stores and provide recommendations at interactive time scales. Finally, we demonstrate via a user study the effectiveness of our deviation-based utility metric and the value of recommendations in supporting visual analytics.
引用
收藏
页码:2182 / 2193
页数:12
相关论文
共 50 条
  • [1] A Data-Driven Platform for the Coordination of Independent Visual Analytics Tools
    Nonnemann, Lars
    Hogräfer, Marius
    Röhlig, Martin
    Schumann, Heidrun
    Urban, Bodo
    Schulz, Hans-Jörg
    [J]. Computers and Graphics (Pergamon), 2022, 106 : 152 - 160
  • [2] InsideInsights: Integrating Data-Driven Reporting in Collaborative Visual Analytics
    Mathisen, A.
    Horak, T.
    Klokmose, C. N.
    Grdnbaek, K.
    Elmqvist, N.
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (03) : 649 - 661
  • [3] Visual Analytics for Data-Driven Understanding of the Substance Use Disorder Epidemic
    Qaurooni, Danial
    Herr, Bruce W.
    Zappone, Sarah Renee
    Wojciechowska, Klaudia
    Boerner, Katy
    Schleyer, Titus
    [J]. INQUIRY-THE JOURNAL OF HEALTH CARE ORGANIZATION PROVISION AND FINANCING, 2024, 61
  • [4] Data-Driven Intelligence on Innovation and Competition: Patent Overlay Network Visualization and Analytics
    Sarica, Serhad
    Yan, Bowen
    Luo, Jianxi
    [J]. INFORMATION SYSTEMS MANAGEMENT, 2020, 37 (03) : 198 - 212
  • [5] Predicting the visualization intensity for interactive spatio-temporal visual analytics: a data-driven view-dependent approach
    Li, Jing
    Zhang, Tong
    Liu, Qing
    Yu, Manzhu
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2017, 31 (01) : 168 - 189
  • [6] WHIZ: Data-Driven Analytics Execution
    Grandl, Robert
    Singhvi, Arjun
    Viswanathan, Raajay
    Akella, Aditya
    [J]. PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON NETWORKED SYSTEM DESIGN AND IMPLEMENTATION, 2021, : 407 - 424
  • [7] Data-driven Digital Therapeutics Analytics
    Lee, Uichin
    Jung, Gyuwon
    Park, Sangjun
    Ma, Eun-Yeol
    Kim, Heeyoung
    Lee, Yonggeon
    Noh, Youngtae
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 386 - 388
  • [8] IMPROVE QUALITY WITH DATA-DRIVEN ANALYTICS
    HAHN, GJ
    [J]. QUALITY PROGRESS, 1993, 26 (10) : 83 - 86
  • [9] Towards Trust-Augmented Visual Analytics for Data-Driven Energy Modeling
    Kandakatla, Akshith Reddy
    Chandan, Vikas
    Kundu, Soumya
    Chakraborty, Indrasis
    Cook, Kristin
    Dasgupta, Aritra
    [J]. 2020 IEEE WORKSHOP ON TRUST AND EXPERTISE IN VISUAL ANALYTICS (TREX 2020), 2020, : 16 - 21
  • [10] VESPa 2.0: Data-Driven Behavior Models for Visual Analytics of Movement Sequences
    Krueger, Robert
    Tremel, Tina
    Thom, Dennis
    [J]. 2017 INTERNATIONAL SYMPOSIUM ON BIG DATA VISUAL ANALYTICS (BDVA), 2017, : 1 - 8