How data scientists use computational notebooks for real-time collaboration

被引:65
|
作者
Wang A.Y. [1 ]
Mittal A. [1 ]
Brooks C. [1 ]
Oney S. [1 ]
机构
[1] University of Michigan, 105 S State St, Ann Arbor, 48103, MI
基金
美国国家科学基金会;
关键词
Collaborative systems; Computational notebooks; Data science;
D O I
10.1145/3359141
中图分类号
学科分类号
摘要
Effective collaboration in data science can leverage domain expertise from each team member and thus improve the quality and efficiency of the work. Computational notebooks give data scientists a convenient interactive solution for sharing and keeping track of the data exploration process through a combination of code, narrative text, visualizations, and other rich media. In this paper, we report how synchronous editing in computational notebooks changes the way data scientists work together compared to working on individual notebooks. We first conducted a formative survey with 195 data scientists to understand their past experience with collaboration in the context of data science. Next, we carried out an observational study of 24 data scientists working in pairs remotely to solve a typical data science predictive modeling problem, working on either notebooks supported by synchronous groupware or individual notebooks in a collaborative setting. The study showed that working on the synchronous notebooks improves collaboration by creating a shared context, encouraging more exploration, and reducing communication costs. However, the current synchronous editing features may lead to unbalanced participation and activity interference without strategic coordination. The synchronous notebooks may also amplify the tension between quick exploration and clear explanations. Building on these findings, we propose several design implications aimed at better supporting collaborative editing in computational notebooks, and thus improving efficiency in teamwork among data scientists. © 2019 Association for Computing Machinery.
引用
收藏
相关论文
共 50 条
  • [1] "Don't Step on My Toes": Resolving Editing Conflicts in Real-Time Collaboration in Computational Notebooks
    Wang, April Yi
    Wu, Zihan
    Brooks, Christopher
    Oney, Steve
    PROCEEDINGS OF THE 2024 FIRST IDE WORKSHOP, IDE 2024, 2024, : 47 - 52
  • [2] Real-time collaboration with flair
    Stowell, C
    COMMUNICATIONS NEWS, 2005, 42 (03): : 40 - +
  • [3] HOW REAL-TIME IS YOUR REAL-TIME COMPUTER
    TINHAM, B
    CONTROL AND INSTRUMENTATION, 1986, 18 (11): : 53 - &
  • [4] Collaboration delivers real-time control
    不详
    COMPUTING AND CONTROL ENGINEERING, 2006, 17 (06): : 8 - 8
  • [5] Improving real-time collaboration with highlighting
    Shen, HF
    Sun, CZ
    FUTURE GENERATION COMPUTER SYSTEMS, 2004, 20 (04) : 605 - 625
  • [6] Online real-time drilling collaboration
    不详
    JOURNAL OF PETROLEUM TECHNOLOGY, 2002, 54 (11): : 57 - 58
  • [7] How 'real time' are real-time systems?
    Gallimore, Heath
    Electronic Products (Garden City, New York), 2011, 53 (09):
  • [8] Online real-time drilling collaboration
    Kaminiski, Dean H.
    Pellerin, Nicholas M.
    Williams, Jevon H.
    JPT, Journal of Petroleum Technology, 2002, 54 (11): : 57 - 58
  • [9] Real-time flood forecasting with the use of inadequate data
    Nalbantis, I
    HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2000, 45 (02): : 269 - 284
  • [10] Real-time data and analysis of the use of office space
    Counsell, John
    Puybaraud, Marie-Cecile
    11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 579 - +