STraVEns: Sentence Transformer Voting Ensemble for Intent Classification-Based Chatbot Model

被引:0
|
作者
Pravitasari, Anindya Apriliyanti [1 ]
Hamid Asnawi, Mohammad [2 ]
Helen, Afrida [3 ]
Handoko, Budhi [1 ]
Amor Kusuma, Dianne [4 ]
Herawan, Tutut [5 ]
Hendrawati, Triyani [1 ]
机构
[1] Univ Padjadjaran, Fac Math & Nat Sci, Dept Stat, Bandung 45363, Indonesia
[2] Monash Univ, Fac Informat Technol, Dept Data Sci & Artificial Intelligence, Clayton, Vic 3800, Australia
[3] Univ Padjadjaran, Fac Math & Nat Sci, Dept Comp Sci, Bandung 45363, Indonesia
[4] Univ Padjadjaran, Fac Math & Nat Sci, Dept Math, Bandung 45363, Indonesia
[5] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Chatbots; Intent recognition; Transformers; Accuracy; Ensemble learning; Atmospheric modeling; Robustness; Training; Oral communication; Business; Chatbot; ensemble learning; intent classification; machine learning; natural language processing; sentence transformer; voting classifier;
D O I
10.1109/ACCESS.2024.3519223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural Language Processing has experienced significant advancements in recent years, leading to the widespread adoption of Large Language Model-based chatbots. These chatbots are popular due to their ability to engage in context-aware conversations. However, deploying LLM-based chatbots can be resource-intensive, making them less suitable for smaller applications or focused tasks. To address this issue, we propose a robust and flexible approach to intent classification for chatbots using STraVEns (Sentence Transformer Voting Ensemble), which includes both hard voting and soft voting ensembles of sentence transformers. Our proposed method aims to improve accuracy and versatility in intent-based chatbots model. We use five sentence transformer models for this ensemble framework: RoBERTa, DistilRoBERTa, MPNet, MiniLM L6, and MiniLM L12, and evaluated our approach by training and testing using four distinct datasets: ATIS, IDE, Small Talk, and CLINC150 which cover a range of scenarios from general conversation to specific tasks and out-of-scope intent classification. The results demonstrate that the STraVEns approach is a promising solution for intent classification-based chatbot model. Results show that our ensemble models outperformed previous benchmarks, achieving the highest accuracy and F1-scores across all datasets. The soft voting method provided flexibility and robustness, while hard voting ensured stability in specific contexts. Overall, our study suggests that ensemble-based approaches can enhance the performance of intent classification chatbots model, providing a scalable solution for various applications.
引用
收藏
页码:197187 / 197200
页数:14
相关论文
共 50 条
  • [21] A novel model based on a transformer for intent detection and slot filling
    Dapeng Li
    Shuliang Wang
    Boxiang Zhao
    Zhiqiang Ma
    Leixiao Li
    Urban Informatics, 3 (1):
  • [22] Chatbot Interaction with Artificial Intelligence: human data augmentation with T5 and language transformer ensemble for text classification
    Bird, Jordan J.
    Ekart, Aniko
    Faria, Diego R.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (4) : 3129 - 3144
  • [23] A Classification-Based Graduates Employability Model for Tracer Study by MOHE
    Sapaat, Myzatul Akmam
    Mustapha, Aida
    Ahmad, Johanna
    Chamili, Khadijah
    Muhamad, Rahamirzam
    DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS, PT 1, 2011, 188 : 277 - 287
  • [24] (Partial) user preference similarity as classification-based model similarity
    Bouza, Amancio
    Bernstein, Abraham
    SEMANTIC WEB, 2014, 5 (01) : 47 - 64
  • [25] Classification-based behavior model for detection of abnormal states in systems
    Vachkov, G
    Komatsu, K
    Fujii, S
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON INTELLIGENT MECHATRONICS AND AUTOMATION, 2004, : 611 - 616
  • [26] A classification-based prediction model of messenger RNA polyadenylation sites
    Ji, Guoli
    Wu, Xiaohui
    Shen, Yingjia
    Huang, Jiangyin
    Li, Qingshun Quinn
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 265 (03) : 287 - 296
  • [27] Chatbot Interaction with Artificial Intelligence: human data augmentation with T5 and language transformer ensemble for text classification
    Jordan J. Bird
    Anikó Ekárt
    Diego R. Faria
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 3129 - 3144
  • [28] A classification-based glioma diffusion model using MRI data
    Morris, Marianne
    Greiner, Russell
    Sander, Joerg
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 98 - 109
  • [29] A Robust Ensemble Machine Learning Model with Advanced Voting Techniques for Comment Classification
    Shiplu, Ariful Islam
    Rahman, Md Mostafizer
    Watanobe, Yutaka
    BIG DATA ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2023, 2024, 14516 : 141 - 159
  • [30] VOTING-BASED ENSEMBLE MODEL FOR NETWORK ANOMALY DETECTION
    Yang, Tzu-Hsin
    Lin, Yu-Tai
    Wu, Chao-Lun
    Wang, Chih-Yu
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8543 - 8547