STraVEns: Sentence Transformer Voting Ensemble for Intent Classification-Based Chatbot Model

被引：0

作者：

Pravitasari, Anindya Apriliyanti ^{[1
]}

Hamid Asnawi, Mohammad ^{[2
]}

Helen, Afrida ^{[3
]}

Handoko, Budhi ^{[1
]}

Amor Kusuma, Dianne ^{[4
]}

Herawan, Tutut ^{[5
]}

Hendrawati, Triyani ^{[1
]}

机构：

[1] Univ Padjadjaran, Fac Math & Nat Sci, Dept Stat, Bandung 45363, Indonesia

[2] Monash Univ, Fac Informat Technol, Dept Data Sci & Artificial Intelligence, Clayton, Vic 3800, Australia

[3] Univ Padjadjaran, Fac Math & Nat Sci, Dept Comp Sci, Bandung 45363, Indonesia

[4] Univ Padjadjaran, Fac Math & Nat Sci, Dept Math, Bandung 45363, Indonesia

[5] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Chatbots; Intent recognition; Transformers; Accuracy; Ensemble learning; Atmospheric modeling; Robustness; Training; Oral communication; Business; Chatbot; ensemble learning; intent classification; machine learning; natural language processing; sentence transformer; voting classifier;

D O I：

10.1109/ACCESS.2024.3519223

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Natural Language Processing has experienced significant advancements in recent years, leading to the widespread adoption of Large Language Model-based chatbots. These chatbots are popular due to their ability to engage in context-aware conversations. However, deploying LLM-based chatbots can be resource-intensive, making them less suitable for smaller applications or focused tasks. To address this issue, we propose a robust and flexible approach to intent classification for chatbots using STraVEns (Sentence Transformer Voting Ensemble), which includes both hard voting and soft voting ensembles of sentence transformers. Our proposed method aims to improve accuracy and versatility in intent-based chatbots model. We use five sentence transformer models for this ensemble framework: RoBERTa, DistilRoBERTa, MPNet, MiniLM L6, and MiniLM L12, and evaluated our approach by training and testing using four distinct datasets: ATIS, IDE, Small Talk, and CLINC150 which cover a range of scenarios from general conversation to specific tasks and out-of-scope intent classification. The results demonstrate that the STraVEns approach is a promising solution for intent classification-based chatbot model. Results show that our ensemble models outperformed previous benchmarks, achieving the highest accuracy and F1-scores across all datasets. The soft voting method provided flexibility and robustness, while hard voting ensured stability in specific contexts. Overall, our study suggests that ensemble-based approaches can enhance the performance of intent classification chatbots model, providing a scalable solution for various applications.

引用

页码：197187 / 197200

页数：14

共 50 条

[1] Unifying Sentence Transformer Embedding and Softmax Voting Ensemble for Accurate News Category Prediction
Khosa, Saima
Mehmood, Arif
Rizwan, Muhammad
COMPUTERS, 2023, 12 (07)
[2] Probability-Weighted Voting Ensemble Learning for Classification ModelProbability-Weighted Voting Ensemble Learning for Classification Model
Rojarath, Artitayapron
Songpan, Wararat
JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (04) : 217 - 227
[3] ResNet and Transformer Hybrid Malware Classification Model Based on Ensemble Learning
Li, Kewei
Liu, Fudong
PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1269 - 1275
[4] Voting Ensemble SVM Model for Deep CNN Based Breast Histopathology Classification
Chowdhary, Jyoti
Sankaran, Praveen
2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
[5] An ensemble classification-based approach to detect attack level of SQL injections
Kasim, Omer
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 59
[6] An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation
Fraz, Muhammad Moazam
Remagnino, Paolo
Hoppe, Andreas
Uyyanonvara, Bunyarit
Rudnicka, Alicja R.
Owen, Christopher G.
Barman, Sarah A.
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (09) : 2538 - 2548
[7] Quantification and Mitigation of Directional Pairwise Class Confusion Bias in a Chatbot Intent Classification Model
Sayenju, Sudhashree
Aygun, Ramazan
Boardman, Jonathan
Don, Duleep Prasanna Rathgamage
Zhang, Yifan
Franks, Bill
Johnston, Sereres
Lee, George
Sullivan, Dan
Modgil, Girish
INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2022, 16 (04) : 497 - 520
[8] Intent recognition model based on sequential information and sentence features
Wu, Tiefeng
Wang, Miao
Xi, Yunfang
Zhao, Zhichao
NEUROCOMPUTING, 2024, 566
[9] Intent recognition model based on sequential information and sentence features
School of Information and Control Engineering, Qingdao University of Technology, Shandong, Qingdao
266520, China
Neurocomputing, 2024,
[10] Improved Ensemble learning for Classification Techniques Based on Majority Voting
Rojarath, Artittayapron
Songpan, Wararat
Pong-inwong, Chakrit
PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 107 - 110

← 1 2 3 4 5 →