Towards an Accurate Prediction of the Question Quality on Stack Overflow using a Deep-Learning-Based NLP Approach

被引:10
|
作者
Toth, Laszlo [1 ]
Nagy, Balazs [1 ]
Jantho, David [1 ]
Vidacs, Laszlo [1 ,2 ]
Gyimothy, Tibor [1 ,2 ]
机构
[1] Univ Szeged, Dept Software Engn, Szeged, Hungary
[2] Univ Szeged, MTA SZTE Res Grp Artificial Intelligence, Szeged, Hungary
关键词
Question Answering; Q&A; Stack Overflow; Quality; Natural Language Processing; NLP; Deep Learning; Doc2Vec;
D O I
10.5220/0007971306310639
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Online question answering (Q&A) forums like Stack Overflow have been playing an increasingly important role in supporting the daily tasks of developers. Stack Overflow can be considered as a meeting point of experienced developers and those who are looking for a solution for a specific problem. Since anyone with any background and experience level can ask and respond to questions, the community tries to use different solutions to maintain quality, such as closing and deleting inappropriate posts. As over 8,000 posts arrive on Stack Overflow every day, the effective automatic filtering of them is essential. In this paper, we present a novel approach for classifying questions based exclusively on their linguistic and semantic features using deep learning method. Our binary classifier relying on the textual properties of posts can predict whether the question is to be closed with an accuracy of 74% similar to the results of previous metrics-based models. In accordance with our findings we conclude that by combining deep learning and natural language processing methods, the maintenance of quality at Q&A forums could be supported using only the raw text of posts.
引用
收藏
页码:631 / 639
页数:9
相关论文
共 50 条
  • [1] Quality Prediction of a Stack Overflow Question Using Machine Learning
    Mehta, Tanvi
    Multaikar, Samruddhi
    Patil, Srushti
    Gawande, Namrata
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 65 - 80
  • [2] Duplicate Question Detection With Deep Learning in Stack Overflow
    Wang, Liting
    Zhang, Li
    Jiang, Jing
    IEEE ACCESS, 2020, 8 (08): : 25964 - 25975
  • [3] Deep-learning-based Accurate Beamforming Prediction Using LiDAR-assisted Network
    Rinchi, Omar
    Alsharoa, Ahmad
    Shatnawi, Ibrahem
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [4] Deep-Learning-Based Approach for Prediction of Algal Blooms
    Zhang, Feng
    Wang, Yuanyuan
    Cao, Minjie
    Sun, Xiaoxiao
    Du, Zhenhong
    Liu, Renyi
    Ye, Xinyue
    SUSTAINABILITY, 2016, 8 (10)
  • [5] Predicting Tags of Stack Overflow Questions: A Deep Learning Approach
    Subramani, Srinivas
    Rajesh, Sangeetha
    Wankhede, Kirti
    Wukkadada, Bharati
    2023 Somaiya International Conference on Technology and Information Management, SICTIM 2023, 2023, : 64 - 66
  • [6] DeepSipred: A deep-learning-based approach on siRNA inhibition prediction
    Liu, Bin
    Huang, Huiya
    Liao, Weixi
    Pan, Xiaoyong
    Jin, Cheng
    Yuan, Ye
    PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024, 2024, : 430 - 436
  • [7] A novel deep-learning-based pressure distribution prediction approach of airfoils
    Zhang, Hao
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2023, 237 (16) : 3786 - 3799
  • [8] Why Will My Question Be Closed? NLP-Based Pre-Submission Predictions of Question Closing Reasons on Stack Overflow
    Toth, Laszlo
    Nagy, Balazs
    Gyimothy, Tibor
    Vidacs, Laszlo
    2020 IEEE/ACM 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING RESULTS (ICSE-NIER 2020), 2020, : 45 - 48
  • [9] SOQDE: A Supervised Learning based Question Difficulty Estimation Model for Stack Overflow
    Hassan, Sk. Adnan
    Das, Dipto
    Iqbal, Anindya
    Bosu, Amiangshu
    Shahriyar, Rifat
    Ahmed, Toufique
    2018 25TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2018), 2018, : 445 - 454
  • [10] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
    Henghui ZHAO
    Yanhui LI
    Fanwei LIU
    Xiaoyuan XIE
    Lin CHEN
    Science China(Information Sciences), 2021, 64 (11) : 131 - 153