Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack Overflow

被引:5
|
作者
Naghshzan, AmirHossein [1 ]
Guerrouj, Latifa [1 ]
Baysal, Olga [2 ]
机构
[1] Ecole Technol Super, Montreal, PQ, Canada
[2] Carleton Univ, Ottawa, ON, Canada
关键词
code summarization; unsupervised learning; unofficial documentation; survey; professional developers;
D O I
10.1109/SCAM52516.2021.00026
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Automated source code summarization is a task that generates summarized information about the purpose, usage, and-or implementation of methods and classes to support understanding of these code entities. Multiple approaches and techniques have been proposed for supervised and unsupervised learning in code summarization, however, they were mostly focused on generating a summary for a piece of code. In addition, very few works have leveraged unofficial documentation. This paper proposes an automatic and novel approach for summarizing Android API methods discussed in Stack Overflow that we consider as unofficial documentation in this research. Our approach takes the API method's name as an input and generates a natural language summary based on Stack Overflow discussions of that API method. We have conducted a survey that involves 16 Android developers to evaluate the quality of our automatically generated summaries and compare them with the official Android documentation. Our results demonstrate that while developers find the official documentation more useful in general, the generated summaries are also competitive, in particular for offering implementation details, and can be used as a complementary source for guiding developers in software development and maintenance tasks.
引用
收藏
页码:142 / 152
页数:11
相关论文
共 50 条
  • [41] Web APIs: Features, Issues, and Expectations - A Large-Scale Empirical Study of Web APIs From Two Publicly Accessible Registries Using Stack Overflow and a User Survey
    Zhang, Neng
    Zou, Ying
    Xia, Xin
    Huang, Qiao
    Lo, David
    Li, Shanping
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (02) : 498 - 528
  • [42] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
    Henghui ZHAO
    Yanhui LI
    Fanwei LIU
    Xiaoyuan XIE
    Lin CHEN
    Science China(Information Sciences), 2021, 64 (11) : 131 - 153
  • [43] A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
    Ahmad, Arshad
    Feng, Chong
    Khan, Muzammil
    Khan, Asif
    Ullah, Ayaz
    Nazir, Shah
    Tahir, Adnan
    SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
  • [44] Poster: Learning to Mine Parallel Natural Language/Source Code Corpora from Stack Overflow
    Yin, Pengcheng
    Deng, Bowen
    Chen, Edgar
    Vasilescu, Bogdan
    Neubig, Graham
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 388 - 389
  • [45] ERF: An Empirical Recommender Framework for Ascertaining Appropriate Learning Materials from Stack Overflow Discussions
    Iqbal, Ashesh
    Khatun, Sumi
    Arefin, Mohammad Shamsul
    Dewan, M. Ali Akber
    COMPUTERS, 2020, 9 (03) : 1 - 16
  • [46] What are the emotions of developers towards deep learning documentation? - An exploratory study on Stack Overflow posts
    Venigalla, Akhila Sri Manasa
    Chimalakonda, Sridhar
    INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 179
  • [47] An Empirical Evaluation of Machine Learning Algorithms for Identifying Software Requirements on Stack Overflow: Initial Results
    Ahmad, Arshad
    Feng, Chong
    Tahir, Adnan
    Khan, Asif
    Waqas, Muhammad
    Ahmad, Sadique
    Ullah, Ayaz
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 693 - 697
  • [48] Time Series Forecasting Performance of the Novel Deep Learning Algorithms on Stack Overflow Website Data
    Guven, Mesut
    Uysal, Fatih
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [49] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
    Henghui Zhao
    Yanhui Li
    Fanwei Liu
    Xiaoyuan Xie
    Lin Chen
    Science China Information Sciences, 2021, 64
  • [50] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
    Zhao, Henghui
    Li, Yanhui
    Liu, Fanwei
    Xie, Xiaoyuan
    Chen, Lin
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (11)