A Machine Learning-Based Pipeline for the Extraction of Insights from Customer Reviews

被引:0
|
作者
Lakatos, Robert [1 ,2 ]
Bogacsovics, Gergo [1 ,2 ]
Harangi, Balazs [1 ]
Lakatos, Istvan [1 ,2 ]
Tiba, Attila [1 ]
Toth, Janos [1 ]
Szabo, Marianna [2 ,3 ]
Hajdu, Andras [1 ]
机构
[1] Univ Debrecen, Fac Informat, Dept Data Sci & Visualizat, H-4032 Debrecen, Hungary
[2] Univ Debrecen, Doctoral Sch Informat, H-4032 Debrecen, Hungary
[3] Univ Debrecen, Fac Informat, Dept Appl Math & Probabil Theory, H-4032 Debrecen, Hungary
关键词
machine and deep learning; topic modeling; keyphrase extracting; natural language processing; INFERENCE;
D O I
10.3390/bdcc8030020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. This paper presents a model that can extract insights from customer reviews using machine learning methods integrated into a pipeline. For topic modeling, our composite model uses transformer-based neural networks designed for natural language processing, vector-embedding-based keyword extraction, and clustering. The elements of our model have been integrated and tailored to better meet the requirements of efficient information extraction and topic modeling of the extracted information for opinion mining. Our approach was validated and compared with other state-of-the-art methods using publicly available benchmark datasets. The results show that our system performs better than existing topic modeling and keyword extraction methods in this task.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Customer Reviews Analysis Based on Information Extraction Approaches
    Zhang, Haiqing
    Sekhari, Aicha
    Fourli-Kartsouni, Florendia
    Ouzrout, Yacine
    Bouras, Abdelaziz
    [J]. PRODUCT LIFECYCLE MANAGEMENT IN THE ERA OF INTERNET OF THINGS, PLM 2015, 2016, 467 : 227 - 237
  • [32] Technical Aspect Extraction from Customer Reviews Based on Seeded Word Clustering
    Davril, Jean-Marc
    Leclercq, Tony
    Cordy, Maxime
    Heymans, Patrick
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 97 - 109
  • [33] Feature extraction for machine learning-based intrusion detection in IoT networks
    Mohanad Sarhan
    Siamak Layeghy
    Nour Moustafa
    Marcus Gallagher
    Marius Portmann
    [J]. Digital Communications and Networks., 2024, 10 (01) - 216
  • [34] Feature extraction for machine learning-based intrusion detection in IoT networks
    Sarhan, Mohanad
    Layeghy, Siamak
    Moustafa, Nour
    Gallagher, Marcus
    Portmann, Marius
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2024, 10 (01) : 205 - 216
  • [35] Evaluation of machine learning-based information extraction algorithms: criticisms and recommendations
    Alberto Lavelli
    Mary Elaine Califf
    Fabio Ciravegna
    Dayne Freitag
    Claudio Giuliano
    Nicholas Kushmerick
    Lorenza Romano
    Neil Ireson
    [J]. Language Resources and Evaluation, 2008, 42 : 361 - 393
  • [36] Evaluation of machine learning-based information extraction algorithms: criticisms and recommendations
    Lavelli, Alberto
    Califf, Mary Elaine
    Ciravegna, Fabio
    Freitag, Dayne
    Giuliano, Claudio
    Kushmerick, Nicholas
    Romano, Lorenza
    Ireson, Neil
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (04) : 361 - 393
  • [37] Machine learning-based inference analysis for customer preference on e-service features
    Lu, Zi
    Zhang, Zui
    Bai, Chenggang
    Zhang, Guangquan
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 61 - 65
  • [38] Machine learning-based e-commerce platform repurchase customer prediction model
    Liu, Cheng-Ju
    Huang, Tien-Shou
    Ho, Ping-Tsan
    Huang, Jui-Chan
    Hsieh, Ching-Tang
    [J]. PLOS ONE, 2020, 15 (12):
  • [39] Context-based sentiment analysis on customer reviews using machine learning linear models
    Chinnalagu, Anandan
    Durairaj, Ashok Kumar
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7
  • [40] DEEP LEARNING-BASED CUSTOMER COMPLAINT MANAGEMENT
    Anagun, Yildiray
    Bolel, Nur Sultan
    Isik, Sahin
    Ozkan, Serif Ercan
    [J]. JOURNAL OF ORGANIZATIONAL COMPUTING AND ELECTRONIC COMMERCE, 2022, 32 (3-4) : 217 - 231