Comparison of Tree-Based Machine Learning Algorithms to Predict Reporting Behavior of Electronic Billing Machines

被引:7
|
作者
Murorunkwere, Belle Fille [1 ]
Ihirwe, Jean Felicien [2 ]
Kayijuka, Idrissa [3 ]
Nzabanita, Joseph [4 ]
Haughton, Dominique [5 ,6 ,7 ]
机构
[1] Univ Rwanda, African Ctr Excellence Data Sci, POB 4285, Kigali, Rwanda
[2] Univ lAquila, Dept Informat Engn Comp Sci & Math, I-56121 Pisa, Italy
[3] Univ Rwanda, Dept Appl Stat, POB 4285, Kigali, Rwanda
[4] Univ Rwanda, Coll Sci & Technol, Dept Math, POB 3900, Kigali, Rwanda
[5] Bentley Univ, Dept Math Sci & Global Studies, Waltham, MA 02452 USA
[6] Univ Paris 1 SAMM, Dept Math Sci & Global Studies, F-75634 Paris, France
[7] Univ Toulouse 1 TSE R, Dept Math Sci & Global Studies, F-31042 Toulouse, France
关键词
tree-based machine learning algorithms; compliance; value added tax; machine learning; electronic billing machines; reporting behavior;
D O I
10.3390/info14030140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tax fraud is a common problem for many tax administrations, costing billions of dollars. Different tax administrations have considered several options to optimize revenue; among them, there is the so-called electronic billing machine (EBM), which aims to monitor all business transactions and, as a result, boost value added tax (VAT) revenue and compliance. Most of the current research has focused on the impact of EBMs on VAT revenue collection and compliance rather than understanding how EBM reporting behavior influences future compliance. The essential contribution of this study is that it leverages both EBM's historical reporting behavior and actual business characteristics to understand and predict the future reporting behavior of EBMs. Herein, tree-based machine learning algorithms such as decision trees, random forest, gradient boost, and XGBoost are utilized, tested, and compared for better performance. The results exhibit the robustness of the random forest model, among others, with an accuracy of 92.3%. This paper clearly presents our approach contribution with respect to existing approaches through well-defined research questions, analysis mechanisms, and constructive discussions. Once applied, we believe that our approach could ultimately help the tax-collecting agency conduct timely interventions on EBM compliance, which will help achieve the EBM objective of improving VAT compliance.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] COMPARISON OF TREE-BASED CLASSIFICATION ALGORITHMS IN MAPPING BURNED FOREST AREAS
    Matci, Dilek Kucuk
    Comert, Resul
    Avdan, Ugur
    GEODETSKI VESTNIK, 2020, 64 (03) : 348 - 360
  • [42] Evaluation of tree-based ensemble learning algorithms for building energy performanceestimation
    Papadopoulos, Sokratis
    Azar, Elie
    Woon, Wei-Lee
    Kontokosta, Constantine E.
    JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2018, 11 (03) : 322 - 332
  • [43] Flash Flood Susceptibility Modeling Using New Approaches of Hybrid and Ensemble Tree-Based Machine Learning Algorithms
    Band, Shahab S.
    Janizadeh, Saeid
    Pal, Subodh Chandra
    Saha, Asish
    Chakrabortty, Rabin
    Melesse, Assefa M.
    Mosavi, Amirhosein
    REMOTE SENSING, 2020, 12 (21) : 1 - 23
  • [44] An explainable model for the mass appraisal of residences: The application of tree-based Machine Learning algorithms and interpretation of value determinants
    Iban, Muzaffer Can
    HABITAT INTERNATIONAL, 2022, 128
  • [45] Tree-Based Automated Machine Learning to Predict Biogas Production for Anaerobic Co-digestion of Organic Waste
    Wang, Yan
    Huntington, Tyler
    Scown, Corinne D.
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2021, 9 (38) : 12990 - 13000
  • [46] Coastal vulnerability assessment using the machine learning tree-based algorithms modeling in the north coast of Java, Indonesia
    Fajar Yulianto
    Mardi Wibowo
    Ardila Yananto
    Dhedy Husada Fadjar Perdana
    Edwin Adi Wiguna
    Yudhi Prabowo
    Nurkhalis Rahili
    Amalia Nurwijayanti
    Marindah Yulia Iswari
    Esti Ratnasari
    Amien Rusdiutomo
    Sapto Nugroho
    Andan Sigit Purwoko
    Hilmi Aziz
    Imam Fachrudin
    Earth Science Informatics, 2023, 16 : 3981 - 4008
  • [47] Modeling interfacial tension of surfactant-hydrocarbon systems using robust tree-based machine learning algorithms
    Rashidi-Khaniabadi, Ali
    Rashidi-Khaniabadi, Elham
    Amiri-Ramsheh, Behnam
    Mohammadi, Mohammad-Reza
    Hemmati-Sarapardeh, Abdolhossein
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [48] Tree-Based and Machine Learning Algorithm Analysis for Breast Cancer Classification
    Bhardwaj, Arpit
    Bhardwaj, Harshit
    Sakalle, Aditi
    Uddin, Ziya
    Sakalle, Maneesha
    Ibrahim, Wubshet
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [49] Uncovering Sociological Effect Heterogeneity Using Tree-Based Machine Learning
    Brand, Jennie E.
    Xu, Jiahui
    Koch, Bernard
    Geraldo, Pablo
    SOCIOLOGICAL METHODOLOGY, VOL 51, ISSUE 2, 2021, 51 (02): : 189 - 223
  • [50] A general tree-based machine learning accelerator with memristive analog CAM
    Pedretti, Giacomo
    Serebryakov, Sergey
    Strachan, John Paul
    Graves, Catherine E.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 220 - 224