TECHSUMBOT: A Stack Overflow Answer Summarization Tool for Technical Query

被引:0
|
作者
Yang, Chengran [1 ]
Xu, Bowen [1 ]
Liu, Jiakun [1 ]
Lo, David [1 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
Summarization; Question Retrieval;
D O I
10.1109/ICSE-Companion58688.2023.00040
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stack Overflow is a popular platform for developers to seek solutions to programming-related problems. However, prior studies identified that developers may suffer from the redundant, useless, and incomplete information retrieved by the Stack Overflow search engine. To help developers better utilize the Stack Overflow knowledge, researchers proposed tools to summarize answers to a Stack Overflow question. However, existing tools use hand-craft features to assess the usefulness of each answer sentence and fail to remove semantically redundant information in the result. Besides, existing tools only focus on a certain programming language and cannot retrieve up-to-date new posted knowledge from Stack Overflow. In this paper, we propose TECHSUMBOT, an automatic answer summary generation tool for a technical problem. Given a question, TECH-SUMBOT first retrieves answers using the Stack Overflow search engine, then TECHSUMBOT 1) ranks each answers sentence based on the sentence's usefulness, 2) estimates the centrality of each sentence to all candidates, and 3) removes the semantic redundant information. Finally, TECHSUMBOT returns the top 5 ranked answer sentences as the answer summary. We implement TECHSUMBOT in the form of a search engine website. To evaluate TECHSUMBOT in both automatic and manual manners, we construct the first Stack Overflow multi-answer summarization benchmark and design a manual evaluation study to assess the effectiveness of TECHSUMBOT and state-of-the-art baselines from the NLP and SE domain. Both results indicate that the summaries generated by TECHSUMBOT are more diverse, useful, and similar to the ground truth summaries.
引用
收藏
页码:132 / 135
页数:4
相关论文
共 41 条
  • [31] Multi-Feature Fusion Based Structural Deep Neural Network for Predicting Answer Time on Stack Overflow
    Shi-Kai Guo
    Si-Wen Wang
    Hui Li
    Yu-Long Fan
    Ya-Qing Liu
    Bin Zhang
    Journal of Computer Science and Technology, 2023, 38 : 582 - 599
  • [32] Multi-Feature Fusion Based Structural Deep Neural Network for Predicting Answer Time on Stack Overflow
    Guo, Shi-Kai
    Wang, Si-Wen
    Li, Hui
    Fan, Yu-Long
    Liu, Ya-Qing
    Zhang, Bin
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (03) : 582 - 599
  • [33] Trouble in programmer's paradise: gender-biases in sharing and recognising technical knowledge on Stack Overflow
    Brooke, S. J.
    INFORMATION COMMUNICATION & SOCIETY, 2021, 24 (14) : 2091 - 2112
  • [34] How Do Users Revise Answers on Technical Q&A Websites? A Case Study on Stack Overflow
    Wang, Shaowei
    Chen, Tse-Hsun
    Hassan, Ahmed E.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (09) : 1024 - 1038
  • [35] What is discussed about Flutter on Stack Overflow (SO) question-and-answer (Q&A) website: An empirical study*
    Alanazi, Afit
    Alfayez, Reem
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 215
  • [36] Code2Que: A Tool for Improving Question Titles from Mined Code Snippets in Stack Overflow
    Gao, Zhipeng
    Xia, Xin
    Lo, David
    Grundy, John
    Li, Yuan-Fang
    PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 1525 - 1529
  • [37] Code2Que: A tool for improving question titles from mined code snippets in stack overflow
    Gao, Zhipeng
    Xia, Xin
    Lo, David
    Grundy, John
    Li, Yuan-Fang
    ESEC/FSE 2021 - Proceedings of the 29th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021, : 1525 - 1529
  • [38] What is asked about technical debt (TD) on Stack Exchange question-and-answer (Q&A) websites? An observational study
    Alfayez, Reem
    Ding, Yunyan
    Winn, Robert
    Alfayez, Ghaida
    Harman, Christopher
    Boehm, Barry
    EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (02)
  • [39] What is asked about technical debt (TD) on Stack Exchange question-and-answer (Q&A) websites? An observational study
    Reem Alfayez
    Yunyan Ding
    Robert Winn
    Ghaida Alfayez
    Christopher Harman
    Barry Boehm
    Empirical Software Engineering, 2023, 28
  • [40] How do developers discuss and support new programming languages in technical Q&A site? An empirical study of Go, Swift, and Rust in Stack Overflow
    Chakraborty, Partha
    Shahriyar, Rifat
    Iqbal, Anindya
    Uddin, Gias
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 137