Iterative threshold-based Naive bayes classifier

被引:0
|
作者
Romano, Maurizio [1 ]
Zammarchi, Gianpaolo [1 ]
Conversano, Claudio [1 ]
机构
[1] Univ Cagliari, Dept Econ & Business Sci, Viale Fra Ignazio 17, I-09123 Cagliari, Italy
来源
STATISTICAL METHODS AND APPLICATIONS | 2024年 / 33卷 / 01期
关键词
Naive bayes; Post-hoc analysis; Customer satisfaction; Sentiment analysis; Natural language processing; Booking.com; SENTIMENT ANALYSIS; REVIEWS;
D O I
10.1007/s10260-023-00721-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The iterative Threshold-based Naive Bayes (iTb-NB) classifier is introduced as a (simple) improved version of the previously introduced non-iterative Threshold-based Naive Bayes (Tb-NB) classifier. iTb-NB starts from a Natural Language text-corpus and allows the user to quantify with a numeric value a sentiment (positive or negative) from a specific test. Differently from Tb-NB, iTb-NB is an algorithm aimed at estimating multiple threshold values that concur to refine Tb-NB's decision rules when classifying a text into positive (negative) based on its content. Observations with sentiment scores close to the threshold are marked to be reclassified, hence a new decision rule is defined for them. Such "iterative" process improves the quality of predictions w.r.t. Tb-NB but keeping the possibility to utilize its results as the input of useful post-hoc analyses. The effectiveness of iTb-NB is evaluated analyzing hotel guests' reviews from all hotels located in the Sardinia region and available on Booking.com. Furthermore, iTb-NB is compared with Tb-NB in terms of model accuracy, resistance to noise, and computational efficiency.
引用
收藏
页码:235 / 265
页数:31
相关论文
共 50 条
  • [1] Threshold-based Naive Bayes classifier
    Romano, Maurizio
    Contu, Giulia
    Mola, Francesco
    Conversano, Claudio
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (02) : 325 - 361
  • [2] A Naive Bayes Classifier Based on Neighborhood Granulation
    Fu, Xingyu
    Chen, Yingyue
    Yao, Zhiyuan
    Chen, Yumin
    Zeng, Nianfeng
    [J]. ROUGH SETS, IJCRS 2022, 2022, 13633 : 132 - 142
  • [3] A Focused Crawler Based on Naive Bayes Classifier
    Wang, Wenxian
    Chen, Xingshu
    Zou, Yongbin
    Wang, Haizhou
    Dai, Zongkun
    [J]. 2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 517 - 521
  • [4] Naive Bayes Classifier Based Partitioner for MapReduce
    Chen, Lei
    Lu, Wei
    Bao, Ergude
    Wang, Liqiang
    Xing, Weiwei
    Cai, Yuanyuan
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (05) : 778 - 786
  • [5] Improving Usual Naive Bayes Classifier Performances with Neural Naive Bayes based Models
    Azeraf, Elie
    Monfrini, Emmanuel
    Pieczynski, Wojciech
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 315 - 322
  • [6] The Optimization of Threshold-Based Naive Bayesian Algorithm
    Wang Xin
    Jiang Hua
    [J]. THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 762 - 764
  • [7] Iterative naive Bayes
    Gama, J
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 80 - 91
  • [8] Chinese text classification by the Naive Bayes Classifier and the associative classifier with multiple confidence threshold values
    Lu, Shing-Hwa
    Chiang, Ding-An
    Keh, Huan-Chao
    Huang, Hui-Hua
    [J]. KNOWLEDGE-BASED SYSTEMS, 2010, 23 (06) : 598 - 604
  • [9] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [10] An automatic document classifier system based on Naive Bayes Classifier and Ontology
    Chang, Yi-Hsing
    Huang, Hsiu-Yi
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3144 - 3149