Iterative threshold-based Naive bayes classifier

被引:0
|
作者
Romano, Maurizio [1 ]
Zammarchi, Gianpaolo [1 ]
Conversano, Claudio [1 ]
机构
[1] Univ Cagliari, Dept Econ & Business Sci, Viale Fra Ignazio 17, I-09123 Cagliari, Italy
来源
STATISTICAL METHODS AND APPLICATIONS | 2024年 / 33卷 / 01期
关键词
Naive bayes; Post-hoc analysis; Customer satisfaction; Sentiment analysis; Natural language processing; Booking.com; SENTIMENT ANALYSIS; REVIEWS;
D O I
10.1007/s10260-023-00721-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The iterative Threshold-based Naive Bayes (iTb-NB) classifier is introduced as a (simple) improved version of the previously introduced non-iterative Threshold-based Naive Bayes (Tb-NB) classifier. iTb-NB starts from a Natural Language text-corpus and allows the user to quantify with a numeric value a sentiment (positive or negative) from a specific test. Differently from Tb-NB, iTb-NB is an algorithm aimed at estimating multiple threshold values that concur to refine Tb-NB's decision rules when classifying a text into positive (negative) based on its content. Observations with sentiment scores close to the threshold are marked to be reclassified, hence a new decision rule is defined for them. Such "iterative" process improves the quality of predictions w.r.t. Tb-NB but keeping the possibility to utilize its results as the input of useful post-hoc analyses. The effectiveness of iTb-NB is evaluated analyzing hotel guests' reviews from all hotels located in the Sardinia region and available on Booking.com. Furthermore, iTb-NB is compared with Tb-NB in terms of model accuracy, resistance to noise, and computational efficiency.
引用
收藏
页码:235 / 265
页数:31
相关论文
共 50 条
  • [21] A Smoothed Naive Bayes-Based Classifier for Activity Recognition
    Sarkar, A. M. Jehad
    Lee, Young-Koo
    Lee, Sungyoung
    [J]. IETE TECHNICAL REVIEW, 2010, 27 (02) : 107 - 119
  • [22] RBNBC: Repeat Based Naive Bayes Classifier for Biological Sequences
    Rani, Pratibha
    Pudi, Vikrarn
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 989 - 994
  • [23] Naive Bayes Classifier based watermark detection in wavelet transform
    Elbasi, Ersin
    Eskicioglu, Ahmet M.
    [J]. MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 232 - 240
  • [24] Naive Bayes Based Classifier for Credit Card Fraud Discovery
    Ogundokun, Roseline Oluwaseun
    Misra, Sanjay
    Fatigun, Olufunmilayo Joyce
    Adeniyi, Jide Kehinde
    [J]. INFORMATION SYSTEMS (EMCIS 2021), 2022, 437 : 515 - 526
  • [25] A Hybrid Distance-Based and Naive Bayes Online Classifier
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 : 213 - 222
  • [26] A Distributed Chinese Naive Bayes Classifier Based on Word Embedding
    Feng, Mengke
    Wu, Guoshi
    [J]. PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY, 2016, 60 : 1121 - 1127
  • [27] Opinion Based Book Recommendation Using Naive Bayes Classifier
    Tewari, Anand Shanker
    Ansari, Tasif Sultan
    Barman, Asim Gopal
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 139 - 144
  • [28] Dynamic Classifier Selection Based on Imprecise Probabilities: A Case Study for the Naive Bayes Classifier
    Li, Meizhu
    De Bock, Jasper
    de Cooman, Gert
    [J]. UNCERTAINTY MODELLING IN DATA SCIENCE, 2019, 832 : 149 - 156
  • [29] One generalization of the naive Bayes to fuzzy sets and the design of the fuzzy naive Bayes classifier
    Zheng, JC
    Tang, YC
    [J]. ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING APPLICATIONS: A BIOINSPIRED APPROACH, PT 2, PROCEEDINGS, 2005, 3562 : 281 - 290
  • [30] Weighted Naive Bayes Classifier on Categorical Features
    Omura, Kazuhiro
    Kudo, Mineichi
    Endo, Tomomi
    Murai, Tetsuya
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 865 - 870