Intent Identification in Unattended Customer Queries Using an Unsupervised Approach

被引:1
|
作者
Rebelo, Hugo D. [1 ]
de Oliveira, Lucas A. F. [2 ]
Almeida, Gustavo M. [3 ]
Sotomayor, Cesar A. M. [1 ]
Rochocz, Geraldo L. [1 ]
Melo, Willian E. D. [4 ]
机构
[1] Passeio Corp, Radix Engn & Software, R Passeio 38,Tower 2, BR-20021290 Rio De Janeiro, RJ, Brazil
[2] Radix Engn & Software, R Santa Rita Durao 444, BR-30140110 Belo Horizonte, MG, Brazil
[3] Univ Fed Minas Gerais, Sch Engn, Dept Chem Engn, Av Antonio Carlos 6627, BR-31270901 Belo Horizonte, MG, Brazil
[4] Cemig Distribut SA, Av Barbacena 1200, BR-30190924 Belo Horizonte, MG, Brazil
关键词
Customer behaviour; customer intent; unsupervised learning; information system; text analytics; ML; LATENT; BUSINESS; REVIEWS; QUALITY;
D O I
10.1142/S0219649221500374
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Customer's satisfaction is crucial for companies worldwide. An integrated strategy composes omnichannel communication systems, in which chabot is widely used. This system is supervised, and the key point is that the required training data are originally unlabelled. Labelling data manually is unfeasible mainly nowadays due to the considerable volume. Moreover, customer behaviour is often hidden in the data even for experts. This work proposes a methodology to find unknown entities and intents automatically using unsupervised learning. This is based on natural language processing (NLP) for text data preparation and on machine learning (ML) for clustering model identification. Several combinations for preprocessing, vectorisation, dimensionality reduction and clustering techniques, were investigated. The case study refers to a Brazilian electric energy company, with a data set of failed customer queries, that is, not met by the company for any reason. They correspond to about 30% (4,044 queries) of the original data set. The best identified intent model employed stemming for preprocessing, word frequency analysis for vectorisation, latent Dirichlet allocation (LDA) for dimensionality reduction, and mini-batch k-means for clustering. This system was able to allocate 62% of the failed queries in one of the seven found intents. For instance, this new labelled data can be used for the training of NLP-based chatbots contributing to a greater generalisation capacity, and ultimately, to increase customer satisfaction.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Classifying the user intent of User intent of web queries web queries using k-means clustering
    Kathuria, Ashish
    Jansen, Bernard J.
    Hafernik, Carolyn
    Spink, Amanda
    [J]. INTERNET RESEARCH, 2010, 20 (05) : 563 - 581
  • [2] Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages
    Mullick, Ankan
    Mondal, Ishani
    Ray, Sourjyadip
    Raghav, R.
    Sai Chaitanya, G.
    Goyal, Pawan
    [J]. arXiv, 2023,
  • [3] Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages
    Mullick, Ankan
    Mondal, Ishani
    Ray, Sourjyadip
    Raghav, R.
    Chaitanya, G. Sai
    Goyal, Pawan
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1870 - 1881
  • [4] Unsupervised Affix Identification Approach using Probabilistic Dependence
    Alsheddi, Abeer
    Khorsi, Ahmed
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS' 2018), 2018,
  • [5] Unsupervised Identification of Synonymous Query Intent Templates for Attribute Intents
    Li, Yanen
    Hsu, Bo-June
    Zhai, ChengXiang
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2029 - 2038
  • [6] An unsupervised approach to language identification
    Pellegrino, F
    André-Obrecht, R
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 833 - 836
  • [7] A fast approach for unsupervised karst feature identification using GPU
    Afonso, Luis C. S.
    Basso, Mateus
    Kuroda, Michelle C.
    Vidal, Alexandre C.
    Papa, Joao P.
    [J]. COMPUTERS & GEOSCIENCES, 2018, 119 : 1 - 8
  • [8] Identification of rhinitis phenotypes using an unsupervised approach in the Constances cohort
    Savoure, Marine
    Bousquet, Jean
    Orsi, Laurent
    Goldberg, Marcel
    Leynaert, Benedicte
    Nadif, Mohamed
    Ribet, Celine
    Zins, Marie
    Jacquemin, Benedicte
    Nadif, Rachel
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2023, 62
  • [9] Customer Intent Prediction using Sentiment Analysis Techniques
    Lye, Say Hong
    Teh, Phoey Lee
    [J]. PROCEEDINGS OF THE THE 11TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS'2021), VOL 1, 2021, : 185 - 190
  • [10] Subjective Search Intent Predictions using Customer Reviews
    Boteanu, Adrian
    Dutile, Emily
    Kiezun, Adam
    Artzi, Shay
    [J]. CHIIR'20: PROCEEDINGS OF THE 2020 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2020, : 303 - 307