STraVEns: Sentence Transformer Voting Ensemble for Intent Classification-Based Chatbot Model

被引:0
|
作者
Pravitasari, Anindya Apriliyanti [1 ]
Hamid Asnawi, Mohammad [2 ]
Helen, Afrida [3 ]
Handoko, Budhi [1 ]
Amor Kusuma, Dianne [4 ]
Herawan, Tutut [5 ]
Hendrawati, Triyani [1 ]
机构
[1] Univ Padjadjaran, Fac Math & Nat Sci, Dept Stat, Bandung 45363, Indonesia
[2] Monash Univ, Fac Informat Technol, Dept Data Sci & Artificial Intelligence, Clayton, Vic 3800, Australia
[3] Univ Padjadjaran, Fac Math & Nat Sci, Dept Comp Sci, Bandung 45363, Indonesia
[4] Univ Padjadjaran, Fac Math & Nat Sci, Dept Math, Bandung 45363, Indonesia
[5] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Chatbots; Intent recognition; Transformers; Accuracy; Ensemble learning; Atmospheric modeling; Robustness; Training; Oral communication; Business; Chatbot; ensemble learning; intent classification; machine learning; natural language processing; sentence transformer; voting classifier;
D O I
10.1109/ACCESS.2024.3519223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural Language Processing has experienced significant advancements in recent years, leading to the widespread adoption of Large Language Model-based chatbots. These chatbots are popular due to their ability to engage in context-aware conversations. However, deploying LLM-based chatbots can be resource-intensive, making them less suitable for smaller applications or focused tasks. To address this issue, we propose a robust and flexible approach to intent classification for chatbots using STraVEns (Sentence Transformer Voting Ensemble), which includes both hard voting and soft voting ensembles of sentence transformers. Our proposed method aims to improve accuracy and versatility in intent-based chatbots model. We use five sentence transformer models for this ensemble framework: RoBERTa, DistilRoBERTa, MPNet, MiniLM L6, and MiniLM L12, and evaluated our approach by training and testing using four distinct datasets: ATIS, IDE, Small Talk, and CLINC150 which cover a range of scenarios from general conversation to specific tasks and out-of-scope intent classification. The results demonstrate that the STraVEns approach is a promising solution for intent classification-based chatbot model. Results show that our ensemble models outperformed previous benchmarks, achieving the highest accuracy and F1-scores across all datasets. The soft voting method provided flexibility and robustness, while hard voting ensured stability in specific contexts. Overall, our study suggests that ensemble-based approaches can enhance the performance of intent classification chatbots model, providing a scalable solution for various applications.
引用
收藏
页码:197187 / 197200
页数:14
相关论文
共 50 条
  • [1] Unifying Sentence Transformer Embedding and Softmax Voting Ensemble for Accurate News Category Prediction
    Khosa, Saima
    Mehmood, Arif
    Rizwan, Muhammad
    COMPUTERS, 2023, 12 (07)
  • [2] Probability-Weighted Voting Ensemble Learning for Classification ModelProbability-Weighted Voting Ensemble Learning for Classification Model
    Rojarath, Artitayapron
    Songpan, Wararat
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (04) : 217 - 227
  • [3] ResNet and Transformer Hybrid Malware Classification Model Based on Ensemble Learning
    Li, Kewei
    Liu, Fudong
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1269 - 1275
  • [4] Voting Ensemble SVM Model for Deep CNN Based Breast Histopathology Classification
    Chowdhary, Jyoti
    Sankaran, Praveen
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [5] An ensemble classification-based approach to detect attack level of SQL injections
    Kasim, Omer
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 59
  • [6] An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation
    Fraz, Muhammad Moazam
    Remagnino, Paolo
    Hoppe, Andreas
    Uyyanonvara, Bunyarit
    Rudnicka, Alicja R.
    Owen, Christopher G.
    Barman, Sarah A.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (09) : 2538 - 2548
  • [7] Quantification and Mitigation of Directional Pairwise Class Confusion Bias in a Chatbot Intent Classification Model
    Sayenju, Sudhashree
    Aygun, Ramazan
    Boardman, Jonathan
    Don, Duleep Prasanna Rathgamage
    Zhang, Yifan
    Franks, Bill
    Johnston, Sereres
    Lee, George
    Sullivan, Dan
    Modgil, Girish
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2022, 16 (04) : 497 - 520
  • [8] Intent recognition model based on sequential information and sentence features
    Wu, Tiefeng
    Wang, Miao
    Xi, Yunfang
    Zhao, Zhichao
    NEUROCOMPUTING, 2024, 566
  • [9] Intent recognition model based on sequential information and sentence features
    School of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao
    266520, China
    Neurocomputing, 2024,
  • [10] Improved Ensemble learning for Classification Techniques Based on Majority Voting
    Rojarath, Artittayapron
    Songpan, Wararat
    Pong-inwong, Chakrit
    PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 107 - 110