A Hybrid Neural Network BERT-Cap Based on Pre-Trained Language Model and Capsule Network for User Intent Classification

被引:3
|
作者
Liu, Hai [1 ,2 ]
Liu, Yuanxia [1 ]
Wong, Leung-Pun [3 ]
Lee, Lap-Kei [3 ]
Hao, Tianyong [1 ,4 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510000, Peoples R China
[2] Guangzhou Key Lab Big Data & Intelligent Educ, Guangzhou 510000, Peoples R China
[3] Open Univ Hong Kong, Sch Sci & Technol, Kowloon, Hong Kong 999077, Peoples R China
[4] South China Normal Univ, Inst Adv Study Educ Dev Guangdong Hong Kong Macao, Guangzhou 510000, Peoples R China
基金
中国国家自然科学基金;
关键词
Signal encoding - Semantics - Speech processing - Text processing - Encoding (symbols);
D O I
10.1155/2020/8858852
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
User intent classification is a vital component of a question-answering system or a task-based dialogue system. In order to understand the goals of users' questions or discourses, the system categorizes user text into a set of pre-defined user intent categories. User questions or discourses are usually short in length and lack sufficient context; thus, it is difficult to extract deep semantic information from these types of text and the accuracy of user intent classification may be affected. To better identify user intents, this paper proposes a BERT-Cap hybrid neural network model with focal loss for user intent classification to capture user intents in dialogue. The model uses multiple transformer encoder blocks to encode user utterances and initializes encoder parameters with a pre-trained BERT. Then, it extracts essential features using a capsule network with dynamic routing after utterances encoding. Experiment results on four publicly available datasets show that our model BERT-Cap achieves a F1 score of 0.967 and an accuracy of 0.967, outperforming a number of baseline methods, indicating its effectiveness in user intent classification.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Research on Chinese Intent Recognition Based on BERT pre-trained model
    Zhang, Pan
    Huang, Li
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 128 - 132
  • [2] Patent classification with pre-trained Bert model
    Kahraman, Selen Yuecesoy
    Durmusoglu, Alptekin
    Dereli, Tuerkay
    [J]. JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2024, 39 (04): : 2485 - 2496
  • [3] BSTC: A Fake Review Detection Model Based on a Pre-Trained Language Model and Convolutional Neural Network
    Lu, Junwen
    Zhan, Xintao
    Liu, Guanfeng
    Zhan, Xinrong
    Deng, Xiaolong
    [J]. ELECTRONICS, 2023, 12 (10)
  • [4] Painting Classification Using a Pre-trained Convolutional Neural Network
    Banerji, Sugata
    Sinha, Atreyee
    [J]. COMPUTER VISION, GRAPHICS, AND IMAGE PROCESSING, ICVGIP 2016, 2017, 10481 : 168 - 179
  • [5] Pre-trained Language Models with Limited Data for Intent Classification
    Kasthuriarachchy, Buddhika
    Chetty, Madhu
    Karmakar, Gour
    Walls, Darren
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] A Technique to Pre-trained Neural Network Language Model Customization to Software Development Domain
    Dudarin, Pavel, V
    Tronin, Vadim G.
    Svyatov, Kirill, V
    [J]. ARTIFICIAL INTELLIGENCE: (RCAI 2019), 2019, 1093 : 169 - 176
  • [7] Pre-Trained Convolutional Neural Network for Classification of Tanning Leather Image
    Winiarti, Sri
    Prahara, Adhi
    Murinto
    Ismi, Dewi Pramudi
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (01) : 212 - 217
  • [8] Classification of Atrial Fibrillation with Pre-Trained Convolutional Neural Network Models
    Qayyum, Abdul
    Meriaudeau, Fabrice
    Chan, Genevieve C. Y.
    [J]. 2018 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2018, : 594 - 599
  • [9] Classification of Pistachio Varieties Using Pre-trained Architectures and a Proposed Convolutional Neural Network Model
    Idress, Khaled Adil Dawood
    Oztekin, Yesim Benal
    Gadalla, Omsalma Alsadig Adam
    Baitu, Geofrey Prudence
    [J]. 15TH INTERNATIONAL CONGRESS ON AGRICULTURAL MECHANIZATION AND ENERGY IN AGRICULTURE, ANKAGENG 2023, 2024, 458 : 148 - 163
  • [10] Transfer Learning for Mammogram Classification Using Pre-Trained Convolutional Neural Network
    Yasuda, K.
    Tsuru, H.
    Ohki, M.
    [J]. MEDICAL PHYSICS, 2017, 44 (06) : 3102 - 3102