Iterative threshold-based Naive bayes classifier

被引:1
|
作者
Romano, Maurizio [1 ]
Zammarchi, Gianpaolo [1 ]
Conversano, Claudio [1 ]
机构
[1] Univ Cagliari, Dept Econ & Business Sci, Viale Fra Ignazio 17, I-09123 Cagliari, Italy
来源
STATISTICAL METHODS AND APPLICATIONS | 2024年 / 33卷 / 01期
关键词
Naive bayes; Post-hoc analysis; Customer satisfaction; Sentiment analysis; Natural language processing; Booking.com; SENTIMENT ANALYSIS; REVIEWS;
D O I
10.1007/s10260-023-00721-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The iterative Threshold-based Naive Bayes (iTb-NB) classifier is introduced as a (simple) improved version of the previously introduced non-iterative Threshold-based Naive Bayes (Tb-NB) classifier. iTb-NB starts from a Natural Language text-corpus and allows the user to quantify with a numeric value a sentiment (positive or negative) from a specific test. Differently from Tb-NB, iTb-NB is an algorithm aimed at estimating multiple threshold values that concur to refine Tb-NB's decision rules when classifying a text into positive (negative) based on its content. Observations with sentiment scores close to the threshold are marked to be reclassified, hence a new decision rule is defined for them. Such "iterative" process improves the quality of predictions w.r.t. Tb-NB but keeping the possibility to utilize its results as the input of useful post-hoc analyses. The effectiveness of iTb-NB is evaluated analyzing hotel guests' reviews from all hotels located in the Sardinia region and available on Booking.com. Furthermore, iTb-NB is compared with Tb-NB in terms of model accuracy, resistance to noise, and computational efficiency.
引用
收藏
页码:235 / 265
页数:31
相关论文
共 50 条
  • [1] Threshold-based Naive Bayes classifier
    Romano, Maurizio
    Contu, Giulia
    Mola, Francesco
    Conversano, Claudio
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (02) : 325 - 361
  • [2] Stairway to heaven: An emotional journey in Divina Commedia with threshold-based Naïve Bayes classifier
    Romano, Maurizio
    Conversano, Claudio
    MACHINE LEARNING WITH APPLICATIONS, 2025, 19
  • [3] A Naive Bayes Classifier Based on Neighborhood Granulation
    Fu, Xingyu
    Chen, Yingyue
    Yao, Zhiyuan
    Chen, Yumin
    Zeng, Nianfeng
    ROUGH SETS, IJCRS 2022, 2022, 13633 : 132 - 142
  • [4] A Focused Crawler Based on Naive Bayes Classifier
    Wang, Wenxian
    Chen, Xingshu
    Zou, Yongbin
    Wang, Haizhou
    Dai, Zongkun
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 517 - 521
  • [5] Naive Bayes Classifier Based Partitioner for MapReduce
    Chen, Lei
    Lu, Wei
    Bao, Ergude
    Wang, Liqiang
    Xing, Weiwei
    Cai, Yuanyuan
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (05) : 778 - 786
  • [6] Improving Usual Naive Bayes Classifier Performances with Neural Naive Bayes based Models
    Azeraf, Elie
    Monfrini, Emmanuel
    Pieczynski, Wojciech
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 315 - 322
  • [7] The Optimization of Threshold-Based Naive Bayesian Algorithm
    Wang Xin
    Jiang Hua
    THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 762 - 764
  • [8] Iterative naive Bayes
    Gama, J
    DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 80 - 91
  • [9] Chinese text classification by the Naive Bayes Classifier and the associative classifier with multiple confidence threshold values
    Lu, Shing-Hwa
    Chiang, Ding-An
    Keh, Huan-Chao
    Huang, Hui-Hua
    KNOWLEDGE-BASED SYSTEMS, 2010, 23 (06) : 598 - 604
  • [10] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711