Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack Overflow

被引：5

作者：

Naghshzan, AmirHossein ^{[1
]}

Guerrouj, Latifa ^{[1
]}

Baysal, Olga ^{[2
]}

机构：

[1] Ecole Technol Super, Montreal, PQ, Canada

[2] Carleton Univ, Ottawa, ON, Canada

来源：

IEEE 21ST INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM 2021) | 2021年

关键词：

code summarization; unsupervised learning; unofficial documentation; survey; professional developers;

D O I：

10.1109/SCAM52516.2021.00026

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Automated source code summarization is a task that generates summarized information about the purpose, usage, and-or implementation of methods and classes to support understanding of these code entities. Multiple approaches and techniques have been proposed for supervised and unsupervised learning in code summarization, however, they were mostly focused on generating a summary for a piece of code. In addition, very few works have leveraged unofficial documentation. This paper proposes an automatic and novel approach for summarizing Android API methods discussed in Stack Overflow that we consider as unofficial documentation in this research. Our approach takes the API method's name as an input and generates a natural language summary based on Stack Overflow discussions of that API method. We have conducted a survey that involves 16 Android developers to evaluate the quality of our automatically generated summaries and compare them with the official Android documentation. Our results demonstrate that while developers find the official documentation more useful in general, the generated summaries are also competitive, in particular for offering implementation details, and can be used as a complementary source for guiding developers in software development and maintenance tasks.

引用

页码：142 / 152

页数：11

共 50 条

[41] Web APIs: Features, Issues, and Expectations - A Large-Scale Empirical Study of Web APIs From Two Publicly Accessible Registries Using Stack Overflow and a User Survey
Zhang, Neng
Zou, Ying
Xia, Xin
Huang, Qiao
Lo, David
Li, Shanping
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (02) : 498 - 528
[42] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
Henghui ZHAO
Yanhui LI
Fanwei LIU
Xiaoyuan XIE
Lin CHEN
Science China(Information Sciences), 2021, 64 (11) : 131 - 153
[43] A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
Ahmad, Arshad
Feng, Chong
Khan, Muzammil
Khan, Asif
Ullah, Ayaz
Nazir, Shah
Tahir, Adnan
SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
[44] Poster: Learning to Mine Parallel Natural Language/Source Code Corpora from Stack Overflow
Yin, Pengcheng
Deng, Bowen
Chen, Edgar
Vasilescu, Bogdan
Neubig, Graham
PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 388 - 389
[45] ERF: An Empirical Recommender Framework for Ascertaining Appropriate Learning Materials from Stack Overflow Discussions
Iqbal, Ashesh
Khatun, Sumi
Arefin, Mohammad Shamsul
Dewan, M. Ali Akber
COMPUTERS, 2020, 9 (03) : 1 - 16
[46] What are the emotions of developers towards deep learning documentation? - An exploratory study on Stack Overflow posts
Venigalla, Akhila Sri Manasa
Chimalakonda, Sridhar
INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 179
[47] An Empirical Evaluation of Machine Learning Algorithms for Identifying Software Requirements on Stack Overflow: Initial Results
Ahmad, Arshad
Feng, Chong
Tahir, Adnan
Khan, Asif
Waqas, Muhammad
Ahmad, Sadique
Ullah, Ayaz
PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 693 - 697
[48] Time Series Forecasting Performance of the Novel Deep Learning Algorithms on Stack Overflow Website Data
Guven, Mesut
Uysal, Fatih
APPLIED SCIENCES-BASEL, 2023, 13 (08):
[49] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
Henghui Zhao
Yanhui Li
Fanwei Liu
Xiaoyuan Xie
Lin Chen
Science China Information Sciences, 2021, 64
[50] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
Zhao, Henghui
Li, Yanhui
Liu, Fanwei
Xie, Xiaoyuan
Chen, Lin
SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (11)

← 1 2 3 4 5 →