Modeling the Evolution of Development Topics using Dynamic Topic Models

被引:0
|
作者
Hu, Jiajun [1 ]
Sun, Xiaobing [1 ,3 ]
Lo, David [2 ]
Li, Bin [1 ,3 ]
机构
[1] Yangzhou Univ, Sch Informat Engn, Yangzhou, Jiangsu, Peoples R China
[2] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As the development of a software project progresses, its complexity grows accordingly, making it difficult to understand and maintain. During software maintenance and evolution, software developers and stakeholders constantly shift their focus between different tasks and topics. They need to investigate into software repositories (e.g., revision control systems) to know what tasks have recently been worked on and how much effort has been devoted to them. For example, if an important new feature request is received, an amount of work that developers perform on ought to be relevant to the addition of the incoming feature. If this does not happen, project managers might wonder what kind of work developers are currently working on. Several topic analysis tools based on Latent Dirichlet Allocation (LDA) have been proposed to analyze information stored in software repositories to model software evolution, thus helping software stakeholders to be aware of the focus of development efforts at various time during software evolution. Previous LDA-based topic analysis tools can capture either changes on the strengths of various development topics over time (i.e., strength evolution) or changes in the content of existing topics over time (i.e., content evolution). Unfortunately, none of the existing techniques can capture both strength and content evolution. In this paper, we use Dynamic Topic Models (DTM) to analyze commit messages within a project's lifetime to capture both strength and content evolution simultaneously. We evaluate our approach by conducting a case study on commit messages of two well-known open source software systems, jEdit and PostgreSQL. The results show that our approach could capture not only how the strengths of various development topics change over time, but also how the content of each topic (i.e., words that form the topic) changes over time. Compared with existing topic analysis approaches, our approach can provide a more complete and valuable view of software evolution to help developers better understand the evolution of their projects.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 50 条
  • [1] Darwin's Theory Of Censorship: Analysing the Evolution of Censored Topics with Dynamic Topic Models
    Waheed, Asim
    Qunaibi, Sara
    Barradas, Diogo
    Weinberg, Zachary
    [J]. PROCEEDINGS OF THE 21ST WORKSHOP ON PRIVACY IN THE ELECTRONIC SOCIETY, WPES 2022, 2022, : 103 - 108
  • [2] Discovering Health Topics in Social Media Using Topic Models
    Paul, Michael J.
    Dredze, Mark
    [J]. PLOS ONE, 2014, 9 (08):
  • [3] How to Identify Hot Topics in Psychology Using Topic Modeling
    Bittermann, Andre
    Fischer, Andreas
    [J]. ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2018, 226 (01): : 3 - 13
  • [4] Exploring Evolution of Public Opinions on Tianya Club Using Dynamic Topic Models
    Zhihua YAN
    Xijin TANG
    [J]. Journal of Systems Science and Information, 2020, 8 (04) : 309 - 324
  • [5] Discovering the evolution of artificial intelligence in cancer research using dynamic topic modeling
    Mosallaie, Shahab
    Rad, Mandi
    Schiffauerova, Andrea
    Ebadi, Ashkan
    [J]. COLLNET JOURNAL OF SCIENTOMETRICS AND INFORMATION MANAGEMENT, 2021, 15 (02) : 225 - 240
  • [6] Modeling the evolution of climate change assessment research using dynamic topic models and cross-domain divergence maps
    Sleeman, Jennifer
    Halem, Milton
    Finin, Tim
    Cane, Mark
    [J]. AAAI Spring Symposium - Technical Report, 2017, SS-17-01 - SS-17-08 : 85 - 94
  • [7] Detecting Favorite Topics in Computing Scientific Literature via Dynamic Topic Modeling
    Quille, Rosa Virginia Encinas
    Barros, Jose Melendez
    Barbado Junior, Marcio
    De Almeida, Felipe Valencia
    Pizzigatti Correa, Pedro Luiz
    [J]. IEEE ACCESS, 2023, 11 : 41535 - 41545
  • [8] Towards Predicting Trend of Scientific Research Topics using Topic Modeling
    Abuhay, Tesfamariam M.
    Nigatie, Yemisrach G.
    Kovalchuk, Sergey, V
    [J]. 7TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE ON COMPUTATIONAL SCIENCE, YSC2018, 2018, 136 : 304 - 310
  • [9] EXTRACTION OF TRENDING TOPICS IN MICROBLOGS USING TOPIC MODELS AND DISCRIMINATION OF BITERMS
    Quesada Grosso, Minor
    Casasola Murillo, Edgar
    Leoni de Leon, Antonio
    [J]. REVISTA KANINA, 2016, 40 (04): : 33 - 47
  • [10] Studying software evolution using topic models
    Thomas, Stephen W.
    Adams, Bram
    Hassan, Ahmed E.
    Blostein, Dorothea
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2014, 80 : 457 - 479