Document representation based on probabilistic word clustering in customer-voice classification

被引:0
|
作者
Younghoon Lee
Seokmin Song
Sungzoon Cho
Jinhae Choi
机构
[1] Seoul National University,Department of Industrial Engineering and Institute for Industrial Systems Innovation
[2] LG Electronics,Data Driven User Experience Team, Mobile Communication Lab
来源
关键词
Probabilistic word clustering; Document representation; Customer-voice; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
Customer-voice data have an important role in different fields including marketing, product planning, and quality assurance. However, owing to the manual processes involved, there are problems associated with the classification of customer-voice data. This study focuses on building automatic classifiers for customer-voice data with newly proposed document representation methods based on neural-embedding and probabilistic word-clustering approaches. Semantically similar terms are classified into a common cluster. The words generated from neural embedding are clustered according to the membership strength of each word relative to each cluster derived from a probabilistic clustering method such as the fuzzy C-means clustering method or Gaussian mixture model. It is expected that the proposed method can be suitable for the classification of customer-voice data consisting of unstructured text by considering the membership strength. The results demonstrate that the proposed method achieved an accuracy of 89.24% with respect to representational effectiveness and an accuracy of 87.76% with respect to the classification performance of customer-voice data consisting of 12 classes. Further, the method provided an intuitive interpretation for the generated representation.
引用
收藏
页码:221 / 232
页数:11
相关论文
共 50 条
  • [21] Voice Recognition and Document Classification-Based Data Analysis for Voice Phishing Detection
    Kim, Jeong-Wook
    Hong, Gi-Wan
    Chang, Hangbae
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2021, 11
  • [22] A hybrid classification model for churn prediction based on customer clustering
    Tang, Qi
    Xia, Guoen
    Zhang, Xianquan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (01) : 69 - 80
  • [23] Associative Web Document Classification Based on Word Mixed Weight
    Li, Xingyi
    Lan, Jun
    Shi, Huaji
    ICCSIT 2010 - 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 3, 2010, : 578 - 581
  • [24] A Hybrid Document Features Extraction with Clustering based Classification Framework on Large Document Sets
    Devi, S. Anjali
    Kumar, S. Siva
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (07) : 364 - 374
  • [25] Technical Aspect Extraction from Customer Reviews Based on Seeded Word Clustering
    Davril, Jean-Marc
    Leclercq, Tony
    Cordy, Maxime
    Heymans, Patrick
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 97 - 109
  • [26] A hybrid document features extraction with clustering based classification framework on large document sets
    Devi S.A.
    Kumar S.S.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (07): : 364 - 374
  • [27] AN EXTENDED PROBABILISTIC COLLABORATIVE REPRESENTATION BASED CLASSIFIER FOR IMAGE CLASSIFICATION
    Lan, Rushi
    Zhou, Yicong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1392 - 1397
  • [28] A Robust Probabilistic Collaborative Representation based Classification for Multimodal Biometrics
    Zhang, Jing
    Liu, Huanxi
    Ding, Derui
    Xiao, Jianli
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [29] KS-cluster: A spectral clustering method based on kernelized sparse representation for document clustering
    Xing, Jieqing
    Wang, Chunteng
    ICIC Express Letters, 2015, 9 (10): : 2801 - 2806
  • [30] A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation
    Foumani, Seyed Navid Mohammadi
    Nickabadi, Ahmad
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 195 - 203