The Types, Roles, and Practices of Documentation in Data Analytics Open Source Software Libraries

被引:17
|
作者
Geiger, R. Stuart [1 ]
Varoquaux, Nelle [1 ,2 ]
Mazel-Cabasse, Charlotte [1 ]
Holdgraf, Chris [1 ,3 ]
机构
[1] Univ Calif Berkeley, Berkeley Inst Data Sci, 190 Doe Lib, Berkeley, CA 94730 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley Inst Data Sci, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley Inst Data Sci, Berkeley, CA 94720 USA
来源
COMPUTER SUPPORTED COOPERATIVE WORK-THE JOURNAL OF COLLABORATIVE COMPUTING AND WORK PRACTICES | 2018年 / 27卷 / 3-6期
关键词
Documentation; Standards; Invisible work; Motivations; Peer production; Collaboration; Infrastructure; Ethnography; Open source; ORGANIZATIONAL PROCESS; WORK;
D O I
10.1007/s10606-018-9333-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Computational research and data analytics increasingly relies on complex ecosystems of open source software (OSS) "libraries" - curated collections of reusable code that programmers import to perform a specific task. Software documentation for these libraries is crucial in helping programmers/analysts know what libraries are available and how to use them. Yet documentation for open source software libraries is widely considered low-quality. This article is a collaboration between CSCW researchers and contributors to data analytics OSS libraries, based on ethnographic fieldwork and qualitative interviews. We examine several issues around the formats, practices, and challenges around documentation in these largely volunteer-based projects. There are many different kinds and formats of documentation that exist around such libraries, which play a variety of educational, promotional, and organizational roles. The work behind documentation is similarly multifaceted, including writing, reviewing, maintaining, and organizing documentation. Different aspects of documentation work require contributors to have different sets of skills and overcome various social and technical barriers. Finally, most of our interviewees do not report high levels of intrinsic enjoyment for doing documentation work (compared to writing code). Their motivation is affected by personal and project-specific factors, such as the perceived level of credit for doing documentation work versus more 'technical' tasks like adding new features or fixing bugs. In studying documentation work for data analytics OSS libraries, we gain a new window into the changing practices of data-intensive research, as well as help practitioners better understand how to support this often invisible and infrastructural work in their projects.
引用
收藏
页码:767 / 802
页数:36
相关论文
共 50 条
  • [1] The Types, Roles, and Practices of Documentation in Data Analytics Open Source Software LibrariesA Collaborative Ethnography of Documentation Work
    R. Stuart Geiger
    Nelle Varoquaux
    Charlotte Mazel-Cabasse
    Chris Holdgraf
    Computer Supported Cooperative Work (CSCW), 2018, 27 : 767 - 802
  • [2] Demystifying the Impact of Open-Source Machine Learning Libraries on Software Analytics
    Zhao, Yu
    Gong, Yihui
    Gong, Lina
    Jiang, Shujuan
    Huang, Zhiqiu
    IEEE TRANSACTIONS ON RELIABILITY, 2024,
  • [3] A Study on the Software Architecture Documentation Practices and Maturity in Open-Source Software Development
    Muszynski, Michel
    Lugtigheid, Sven
    Castor, Fernando
    Brinkkemper, Sjaak
    IEEE 19TH INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE (ICSA 2022), 2022, : 47 - 57
  • [4] A Log Data Analytics based Scheduling in Open Source Cloud Software
    Srikrishnan, V.
    Sivasankar, E.
    Pitchiah, R.
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 390 - 395
  • [5] A data science and open source software approach to analytics for strategic sourcing
    Boehmke, Brad
    Hazen, Benjamin
    Boone, Christopher A.
    Robinson, Jessica L.
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2020, 54
  • [6] Possibilities for open source software in libraries
    Morgan, EL
    INFORMATION TECHNOLOGY AND LIBRARIES, 2002, 21 (01) : 12 - 15
  • [7] Open source software use in libraries
    Payne, Alexandria
    Singh, Vandana
    LIBRARY REVIEW, 2010, 59 (09) : 708 - 717
  • [8] Practical Open Source Software for Libraries
    Chawner, Brenda
    ONLINE INFORMATION REVIEW, 2011, 35 (06) : 977 - 978
  • [9] PRACTICAL OPEN SOURCE SOFTWARE FOR LIBRARIES
    O'Hagan, Emma
    JOURNAL OF WEB LIBRARIANSHIP, 2011, 5 (04) : 345 - 346
  • [10] Evaluating the Quality of the Documentation of Open Source Software
    Aversano, Lerina
    Guardabascio, Daniela
    Tortorella, Maria
    ENASE: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, 2017, : 308 - 313