A comparative evaluation of machine learning and deep learning algorithms for question categorization of VQA datasets

被引:1
|
作者
Asudani, Deepak Suresh [1 ]
Nagwani, Naresh Kumar [1 ]
Singh, Pradeep [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Raipur, Chhattisgarh, India
关键词
Question Classification; Machine Learning; Deep Learning; SMOTE; BERT-based Transformers;
D O I
10.1007/s11042-023-17797-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question classification primarily involves categorizing questions based on the type of answer, with less emphasis on the words or phrases used to form the query. Question classification is crucial in the Visual Question Answering (VQA) system, and the dataset's quality plays an essential role in the system's development. The available question categorization in the VQA and TDIUC datasets shows imbalance, and the VQA model trained on imbalanced datasets performs poorly in handling language-prior problems, failing to categorize questions, and predicting incorrect outcomes. Therefore, developing a better classification method for classifying questions into appropriate categories based on phrases is necessary. This paper examines the effectiveness of the synthetic minority oversampling technique (SMOTE) in addressing the class imbalance problem within the question classification task using the LSTM, selected machine learning models and BERT-based transformer model. The preprocessing and analysis module efficiently categorizes input question sets by identifying valuable phrases and obtaining an evenly distributed dataset based on question categories from both datasets. The performance evaluation of Naive Bayes, SVM, Random Forests, and XGBoost models shows that the XGBoost model outperforms other selected classifiers, and the LSTM model achieves higher accuracy but requires more computation time. The empirical assessment indicates that the BERT-based transformer model exceeds the traditional models employed for comparison. The ablation study also reveals that utilizing SMOTE techniques for question classification tasks achieves slightly improved accuracy at the expense of higher computation time and resources. It is concluded that the BERT-based transformer model efficiently and precisely performs question classification tasks.
引用
收藏
页码:57829 / 57859
页数:31
相关论文
共 50 条
  • [1] Evaluation of Deep Learning and Machine Learning Algorithms for Building Occupancy Classification on Open Datasets
    Cretu, Georgiana
    Stamatescu, Iulia
    Stamatescu, Grigore
    [J]. 2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 575 - 580
  • [2] Disease Inference on Medical Datasets Using Machine Learning and Deep Learning Algorithms
    Chinnaswamy, Arunkumar
    Srinivasan, Ramakrishnan
    Gaurang, Desai Prutha
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 902 - 908
  • [3] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang Fangchao
    Tong Lingling
    Shi Chen
    Zuo Rui
    Wang Liwei
    Wang Yan
    [J]. 母胎医学杂志(英文)., 2024, 06 (03)
  • [4] Comparative Performance of Deep Learning and Machine Learning Algorithms on Imbalanced Handwritten Data
    Amri, A'Inur A'Fifah
    Ismail, Amelia Ritahani
    Zarir, Abdullah Ahmad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (02) : 258 - 264
  • [5] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang, Fangchao
    Tong, Lingling
    Shi, Chen
    Zuo, Rui
    Wang, Liwei
    Wang, Yan
    [J]. MATERNAL-FETAL MEDICINE, 2024, 6 (03) : 141 - 146
  • [6] Comparison of Machine Learning Algorithms on Different Datasets
    Uysal, Elif
    Ozturk, Ali
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [7] Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms
    Surabhi Saxena
    Debashish Mohapatra
    Subhransu Padhee
    Goutam Kumar Sahoo
    [J]. Evolutionary Intelligence, 2023, 16 : 587 - 603
  • [8] Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms
    Saxena, Surabhi
    Mohapatra, Debashish
    Padhee, Subhransu
    Sahoo, Goutam Kumar
    [J]. EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 587 - 603
  • [9] Comparative Analysis of Intrusion Detection System Using Machine Learning and Deep Learning Algorithms
    Note, Johan
    Ali, Maaruf
    [J]. Annals of Emerging Technologies in Computing, 2022, 6 (03) : 19 - 36
  • [10] A comparative study of machine learning and deep learning algorithms for padel tennis shot classification
    Dominguez, Guillermo Cartes
    Alvarez, Evelia Franco
    Cordoba, Alejandro Tapia
    Reina, Daniel Gutierrez
    [J]. SOFT COMPUTING, 2023, 27 (17) : 12367 - 12385