A Machine Learning-Based Pipeline for the Extraction of Insights from Customer Reviews

被引:0
|
作者
Lakatos, Robert [1 ,2 ]
Bogacsovics, Gergo [1 ,2 ]
Harangi, Balazs [1 ]
Lakatos, Istvan [1 ,2 ]
Tiba, Attila [1 ]
Toth, Janos [1 ]
Szabo, Marianna [2 ,3 ]
Hajdu, Andras [1 ]
机构
[1] Univ Debrecen, Fac Informat, Dept Data Sci & Visualizat, H-4032 Debrecen, Hungary
[2] Univ Debrecen, Doctoral Sch Informat, H-4032 Debrecen, Hungary
[3] Univ Debrecen, Fac Informat, Dept Appl Math & Probabil Theory, H-4032 Debrecen, Hungary
关键词
machine and deep learning; topic modeling; keyphrase extracting; natural language processing; INFERENCE;
D O I
10.3390/bdcc8030020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. This paper presents a model that can extract insights from customer reviews using machine learning methods integrated into a pipeline. For topic modeling, our composite model uses transformer-based neural networks designed for natural language processing, vector-embedding-based keyword extraction, and clustering. The elements of our model have been integrated and tailored to better meet the requirements of efficient information extraction and topic modeling of the extracted information for opinion mining. Our approach was validated and compared with other state-of-the-art methods using publicly available benchmark datasets. The results show that our system performs better than existing topic modeling and keyword extraction methods in this task.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Data Representation in Machine Learning-Based Sentiment Analysis of Customer Reviews
    Shamshurin, Ivan
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, 2011, 6744 : 254 - 260
  • [2] Interpretable machine learning-based approach for customer segmentation for new product development from online product reviews
    Joung, Junegak
    Kim, Harrison
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2023, 70
  • [3] Extraction of affective responses from customer reviews: an opinion mining and machine learning approach
    Li, Z.
    Tian, Z. G.
    Wang, J. W.
    Wang, W. M.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2020, 33 (07) : 670 - 685
  • [4] MACHINE LEARNING-BASED SEVERITY ASSESSMENT OF PIPELINE DENTS
    Tang, Huang
    Sun, Jialin
    Di Blasi, Martin
    [J]. PROCEEDINGS OF 2022 14TH INTERNATIONAL PIPELINE CONFERENCE, IPC2022, VOL 1, 2022,
  • [5] A Machine Learning-Based Topic Extraction and Categorization of State Universities and Colleges (SUC) Customer Feedbacks
    Soriano, Lorna T.
    Palaoag, Thelma D.
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND MANAGEMENT (ICICM 2018), 2018, : 1 - 6
  • [6] Machine Learning-Based Feature Extraction and Selection
    Ruano-Ordas, David
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [7] Application of Machine Learning to Mining Customer Reviews
    Darbanibasmanj, Amir Abbas
    Persaud, Ajax
    Ruhi, Umar
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [8] Machine Learning-Based Pipeline for High Accuracy Bioparticle Sizing
    Luo, Shaobo
    Zhang, Yi
    Nguyen, Kim Truc
    Feng, Shilun
    Shi, Yuzhi
    Liu, Yang
    Hutchinson, Paul
    Chierchia, Giovanni
    Talbot, Hugues
    Bourouina, Tarik
    Jiang, Xudong
    Liu, Ai Qun
    [J]. MICROMACHINES, 2020, 11 (12) : 1 - 12
  • [9] Machine Learning-Based Risk Model for Pipeline Integrity Management
    Zhang, Xiaoyue
    Tao, Chengcheng
    Huang, Ying
    [J]. COMPUTING IN CIVIL ENGINEERING 2023-RESILIENCE, SAFETY, AND SUSTAINABILITY, 2024, : 689 - 696
  • [10] OPINION EXTRACTION FROM CUSTOMER REVIEWS
    Loh, Han Tong
    Sun, Jie
    Wang, Jingjing
    Lu, Wen Feng
    [J]. ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, PROCEEDINGS, VOL 2, PTS A AND B, 2010, : 753 - 758