Grouping related stack overflow comments for software developer recommendation

被引:0
|
作者
Sheth, Viral [1 ]
Damevski, Kostadin [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
Software developer discussions; Developer forums; Stack overflow; Comment grouping; Comment ranking;
D O I
10.1007/s10515-022-00339-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stack Overflow is a question and answer forum widely used by developers all over the world. Contributors share their knowledge on this platform not only in the form of answers, but also as comments to those answers. With millions of developer-contributed comments, the valuable knowledge contained within them remains difficult to locate by readers. Moreover, Stack Overflow's comment hiding mechanism that only shows the top five most highly voted comments and hides the remaining leads to wealth condensation. Recently, researchers have observed that the Stack Overflow's comment display mechanism hides important and relevant comments and makes it difficult for readers to understand the conversational context, as many comments are related to other hidden comments. In this paper, we propose a set of features and a machine learning-based technique to identify the relatedness of pairs of comments. Further, we extend the relatedness into comment clustering, as, with clusters, readers can get the entire context of a set of comments that form a single conversational thread. We evaluate our methods against several baselines to show that they provide strong improvements, although the problem in general is made difficult by the short text and narrow topic of discussion in the comments.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Grouping related stack overflow comments for software developer recommendation
    Viral Sheth
    Kostadin Damevski
    [J]. Automated Software Engineering, 2022, 29
  • [2] Studying Software Developer Expertise and Contributions in Stack Overflow and GitHub
    Vadlamani, Sri Lakshmi
    Baysal, Olga
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2020), 2020, : 312 - 323
  • [3] API-Related Developer Information Needs in Stack Overflow
    Liu, Mingwei
    Peng, Xin
    Marcus, Andrian
    Xing, Shuangshuang
    Treude, Christoph
    Zhao, Chengyuan
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (11) : 4485 - 4500
  • [4] Grouping Android Tag Synonyms on Stack Overflow
    Beyer, Stefanie
    Pinzger, Martin
    [J]. 13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 430 - 440
  • [5] Expert as a Service: Software Expert Recommendation via Knowledge Domain Embeddings in Stack Overflow
    Huang, Chaoran
    Yao, Lina
    Wang, Xianzhi
    Benatallah, Boualem
    Sheng, Quan Z.
    [J]. 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2017), 2017, : 317 - 324
  • [6] Sentiment overflow in the testing stack: Analyzing software testing posts on Stack Overflow
    Swillus, Mark
    Zaidman, Andy
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 205
  • [7] IEA: an answerer recommendation approach on stack overflow
    Liting Wang
    Li Zhang
    Jing Jiang
    [J]. Science China Information Sciences, 2019, 62
  • [8] IEA: an answerer recommendation approach on stack overflow
    Wang, Liting
    Zhang, Li
    Jiang, Jing
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (11)
  • [9] IEA: an answerer recommendation approach on stack overflow
    Liting WANG
    Li ZHANG
    Jing JIANG
    [J]. Science China(Information Sciences), 2019, 62 (11) : 51 - 69
  • [10] SOFTWARE ROUTINE PREVENTS STACK OVERFLOW
    EHRECKE, RJ
    [J]. EDN, 1985, 30 (20) : 304 - &