A Survey on How Test Flakiness Affects Developers and What Support They Need To Address It

被引:11
|
作者
Gruber, Martin [1 ]
Fraser, Gordon [2 ]
机构
[1] Univ Passau, BMW Grp, Munich, Germany
[2] Univ Passau, Passau, Germany
关键词
Flaky Tests; Empirical Study; Survey;
D O I
10.1109/ICST53961.2022.00020
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Non-deterministically passing and failing test cases, so-called flaky tests, have recently become a focus area of software engineering research. While this research focus has been met with some enthusiastic endorsement from industry, prior work nevertheless mostly studied flakiness using a code-centric approach by mining software repositories. What data extracted from software repositories cannot tell us, however, is how developers perceive flakiness: How prevalent is test flakiness in developers' daily routine, how does it affect them, and most importantly: What do they want us researchers to do about it? To answer these questions, we surveyed 335 professional software developers and testers in different domains. The survey respondents confirm that flaky tests are a common and serious problem, thus reinforcing ongoing research on flaky test detection. Developers are less worried about the computational costs caused by re-running tests and more about the loss of trust in the test outcomes. Therefore, they would like to have IDE plugins to detect flaky code as well as better visualizations of the problem, particularly dashboards showing test outcomes over time; they also wish for more training and information on flakiness. These important aspects will require the attention of researchers as well as tool developers.
引用
收藏
页码:82 / 92
页数:11
相关论文
共 27 条
  • [1] Developers Need Support, Too: A Survey of Security Advice for Software Developers
    Acar, Yasemin
    Stransky, Christian
    Wermke, Dominik
    Weir, Charles
    Mazurek, Michelle L.
    Fahl, Sascha
    [J]. 2017 IEEE CYBERSECURITY DEVELOPMENT (SECDEV), 2017, : 22 - 26
  • [2] Who, what, and how: A survey of informal and professional web developers
    Rosson, MB
    Ballin, J
    Rode, J
    [J]. 2005 IEEE SYMPOSIUM ON VISUAL LANGUAGE AND HUMAN-CENTRIC COMPUTING, PROCEEDINGS, 2005, : 199 - 206
  • [3] SURVEY OF NONDESTRUCTIVE TEST METHODS - WHAT THEY DO HOW THEY WORK
    COFIELD, RE
    [J]. INSTRUMENTATION TECHNOLOGY, 1967, 14 (11): : 38 - &
  • [4] The perceptions of science classes survey: What undergraduate science reform efforts really need to address
    Kardash, CA
    Wallace, ML
    [J]. JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2001, 93 (01) : 199 - 210
  • [5] HOW DEVELOPERS TEST THEIR OPEN SOURCE SOFTWARE PRODUCTS A Survey of Well-known OSS Projects
    Tosi, Davide
    Tahir, Abbas
    [J]. ICSOFT 2010: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2010, : 22 - 31
  • [6] IS THERE A NEED FOR STANDARDIZED APHASIA TESTS - WHY, HOW, WHAT AND WHEN TO TEST APHASICS
    KERTESZ, A
    [J]. APHASIOLOGY, 1988, 2 (3-4) : 313 - 317
  • [7] Toward 5G: What's Changing and How to Address Design and Test Challenges?
    Whitacre, Jan
    [J]. MICROWAVE JOURNAL, 2014, 57 (05) : 94 - +
  • [8] Toward 5G: What's changing and how to address design and test challenges?
    [J]. Whitacre, J, 1600, Horizon House (57):
  • [9] What You Say Is What You Get: How Self-Disclosure in Support Seeking Affects Language Use in Support Provision in Online Support Forums
    Pan, Wenjing
    Feng, Bo
    Wingate, V. Skye
    [J]. JOURNAL OF LANGUAGE AND SOCIAL PSYCHOLOGY, 2018, 37 (01) : 3 - 27
  • [10] What Evidence and Support Do State-Level Public Health Practitioners Need to Address Obesity Prevention
    Leeman, Jennifer
    Teal, Randall
    Jernigan, Jan
    Reed, Jenica Huddleston
    Farris, Rosanne
    Ammerman, Alice
    [J]. AMERICAN JOURNAL OF HEALTH PROMOTION, 2014, 28 (03) : 189 - 196